OpenAI’s language model GPT-4o can be tricked into writing exploit code by encoding the malicious instructions in hexadecimal, which allows an attacker to jump the model’s built-in security guardrails and abuse the AI for evil purposes, according to 0Din researcher Marco Figueroa. 0Din is Mozilla’s generative AI bug bounty platform, and Figueroa is its technical product manager.
Source: The Register