Bypassing LLM Supervisor Agents Through Indirect Prompt Injection

Indirect prompt injection lets attackers bypass LLM supervisor agents by hiding malicious instructions in profile fields and contextual data. Learn how this attack works and how to defend against it. The post ...
Terminal window showing Augustus Hydra scan results with attacker-target conversation about lock picking, displaying scores and SUCCESS/FAIL status

Augustus v0.0.9: Multi-Turn Attacks for LLMs That Fight Back

Single-turn jailbreaks are getting caught. Guardrails have matured. The easy wins — “ignore previous instructions,” base64-encoded payloads, DAN prompts — trigger refusals on most production models within milliseconds. But real attackers don’t ...