Anthropic launches powerful Fable 5 model publicly, while keeping Mythos restricted over cybersecurity concerns

In context: Anthropic's latest release is really a story about control, not just capability. The company is offering two versions of the same underlying model: Claude Mythos 5 for a small circle of trusted partners, and Claude Fable 5 for everyone else. The split reflects a core challenge Anthropic is still trying to solve – how to deploy an extremely capable system into the wild without simultaneously handing attackers a new class of offensive tools.

Mythos has already shown what it can do when it is not heavily restricted. Since April, when an earlier preview was sent to about 150 organizations under the banner of Project Glasswing, users have reported more than 10,000 critical security flaws in their own systems. Those same capabilities could also be used by attackers looking to break in, rather than to patch security holes.

For that reason, Mythos 5 is staying behind the glass for now. Anthropic is keeping it in the hands of a "small group of cyberdefenders and infrastructure providers," along with select biology researchers, and is coordinating with US government agencies as part of the rollout. Access is effectively on a need-to-know basis, with the company signaling that a broader "trusted access program" will come later.

Fable 5 is where Anthropic is testing what a general-purpose release of Mythos-class technology looks like under constraint. Technically, it runs on the same underlying model as Mythos 5, but with hard limits built in. The system is designed to refuse or redirect a long list of requests related to cybersecurity, biology, and chemistry. When those guardrails trigger, the query is silently routed to an older model, Claude Opus 4.8, instead.

Anthropic has also wired Fable 5 to watch for distillation, where a user tries to harvest large volumes of answers to train a smaller model of their own. If the system thinks that is happening, those requests are also redirected to Opus 4.8. In other words, the company is not only trying to control what the model will talk about, but also what others can learn from it.

Anthropic has been wrestling with these decisions for months. Diane Penn, the company's head of product management, told Wired that testing and feedback since the April preview have helped shape the current strategy, even though it is still far from perfect.

"We're trying to make improvements in a way that's beneficial, even if we don't have the perfect [solution] for every use case to start," she says. "Out of all the different approaches, this emerged as the most viable and the best one. We just ended up feeling like this was the best product choice for users to get the maximum value out of Fable 5."

For now, the filters are tuned to err on the side of over-blocking. Penn has acknowledged that some harmless queries will be routed to the older model. Anthropic says it wants to refine its classifiers over time but argues that this level of caution is the only way to justify a wider release at this stage.

The stakes are higher because Fable and Mythos are not just chatbots that respond to prompts and stop. Anthropic says both can run "unattended" for longer stretches than previous Claude models, carrying out sequences of instructions without constant supervision.

That shift toward more agent-like behavior could substantially boost software engineering and other technical work, especially given Fable 5's stronger code generation and visual capabilities. But it also raises obvious questions about what happens if those capabilities are misused.

... continue reading