Gone in 60 seconds: BEAST AI model attack needs just a minute of GPU time to breach LLM guardails

Computer scientists from the University of Maryland have developed an efficient way to generate adversarial attack phrases that elicit harmful responses from large language models (LLMs). All that’s required is an Nvidia RTX A6000 GPU with 48GB of memory, some soon-to-be-released open source code, and as little as a minute of GPU processing time.

Source: The Register

 


Date:

Categorie(s):