Anthropic’s latest AI release is igniting controversy over what its new models will and won’t do.
On Tuesday, Anthropic unveiled Claude Fable 5 and Mythos 5, its long-awaited “Mythos-class” models.
Alongside the launch, the company disclosed two unusual safeguards: the models may secretly provide degraded assistance when they suspect users are working on frontier AI research, and certain requests are automatically routed to less capable models.
In a system card, Anthropic said the measures are designed to reduce the risk that powerful AI systems help users develop competing frontier models or accelerate dangerous capabilities.
Critics, however, said Anthropic’s safeguards could disadvantage researchers, concentrate power among leading AI labs, and degrade responses without users’ knowledge.
Here’s what smart voices across AI, cybersecurity, and policy are saying about Anthropic’s latest move:
Read the full article here















