Anthropic’s Claude Fable 5 Jailbroken to Generate Stack Exploits

ID: d6e7de9a-d217-5e86-a53b-d9ee71cd218e

STIX ID: report--d6e7de9a-d217-5e86-a53b-d9ee71cd218e

Feed Name: cybersecurityNews.com

Threat Score

Date Published: 2026-06-11

Date Updated: 2026-06-11

Author: Guru Baran

...

Within days of Anthropic releasing Claude Fable 5, a public red-team jailbreak by "Pliny the Liberator" used coordinated multi-agent attacks, Unicode/homoglyph evasion, narrative framing, and decomposition/recomposition techniques to bypass safety classifiers, obtain harmful technical outputs (including exploit development steps and illicit chemistry procedures), and leak the model's ~120,000-character system prompt—highlighting risks in classifier-based routing and multi-model pipelines.

Your team is not currently subscribed to this feed. You must subscribe to it in order to see this post.