Terminal window showing Augustus Hydra scan results with attacker-target conversation about lock picking, displaying scores and SUCCESS/FAIL status

Augustus v0.0.9: Multi-Turn Attacks for LLMs That Fight Back

Single-turn jailbreaks are getting caught. Guardrails have matured. The easy wins — “ignore previous instructions,” base64-encoded payloads, DAN prompts — trigger refusals on most production models within milliseconds. But real attackers don’t ...