An 'automated attacker' mimics the actions of human hackers to test the browser's defenses against prompt injection attacks. But there's a catch.
OpenAI says it has patched ChatGPT Atlas after internal red teaming found new prompt injection attacks that can hijack AI browser agents. The update adds an adversarially trained model plus stronger ...
OpenAI says prompt injections remain a key risk for AI browsers and is using an AI attacker to train ChatGPT Atlas.