Researchers Find Just 250 Malicious Documents Can Leave Llms Vulnerable To Backdoors

1 month ago

Artificial intelligence companies person been moving astatine breakneck speeds to create nan champion and astir powerful tools, but that accelerated improvement hasn't ever been coupled pinch clear understandings of AI's limitations aliases weaknesses. Today, Anthropic released a report connected really attackers tin power nan improvement of a ample connection model.

The study centered connected a type of onslaught called poisoning, wherever an LLM is pretrained connected malicious contented intended to make it study vulnerable aliases unwanted behaviors. The cardinal uncovering from this study is that a bad character doesn't request to power a percent of nan pretraining materials to get nan LLM to beryllium poisoned. Instead, nan researchers recovered that a mini and reasonably changeless number of malicious documents tin poison an LLM, sloppy of nan size of nan exemplary aliases its training materials. The study was capable to successfully backdoor LLMs based connected utilizing only 250 malicious documents successful nan pretraining information set, a overmuch smaller number than expected for models ranging from 600 cardinal to 13 cardinal parameters.

"We’re sharing these findings to show that data-poisoning attacks mightiness beryllium much applicable than believed, and to promote further investigation connected information poisoning and imaginable defenses against it," nan institution said. Anthropic collaborated pinch nan UK AI Security Institute and nan Alan Turing Institute connected nan research.

English (US) ·

Indonesian (ID) ·

· · ·

↑

Researchers Find Just 250 Malicious Documents Can Leave Llms Vulnerable To Backdoors

Related Article

I Can't Stop Talking About The Ninja Creami Swirl - And It's At An All-time Low Price

I Found The Best Apple Watch Deals Still Live For Black Friday 2025

The Best Black Friday Deals On Tech For 2025: Get Up To 50 Percent Off Gear From Apple, Amazon, Disney+, Lego, Dyson And Others

Popular Article

The Best Wireless Headphones For 2025: Bluetooth Options For Every Budget

New Travel Turmoil As American Airlines, United, Jetblue, And Avelo Slashing Flights And Routes – What You Need To Know

American, Delta, Southwest And Alaska Connecting Chicago, Philadelphia, Raleigh-durham, San Diego, Santa Maria, Sun Valley With New Winter Airline Rou...

Google Is Experimenting With Machine-learning Powered Age Estimation Tech In The U.s.

Thousands Of Air Canada Flights At Risk As Potential Strike Threat Set To Disrupt Global Travel