Anthropic’s Claude Fable 5 Jailbroken to Generate Stack Exploits
ID: d6e7de9a-d217-5e86-a53b-d9ee71cd218e
STIX ID: report--d6e7de9a-d217-5e86-a53b-d9ee71cd218e
Feed Name: cybersecurityNews.com
Within days of Anthropic releasing Claude Fable 5, a public red-team jailbreak by "Pliny the Liberator" used coordinated multi-agent attacks, Unicode/homoglyph evasion, narrative framing, and decomposition/recomposition techniques to bypass safety classifiers, obtain harmful technical outputs (including exploit development steps and illicit chemistry procedures), and leak the model's ~120,000-character system prompt—highlighting risks in classifier-based routing and multi-model pipelines.
Your team is not currently subscribed to this feed. You must subscribe to it in order to see this post.
