Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable
Anthropic released its Fable model, a public version of the cybersecurity model Mythos, but many researchers criticize its restrictive guardrails. These limitations hinder even basic cybersecurity-related requests and reflect ongoing concerns about potential misuse. Cybersecurity experts acknowledge the need for safety but suggest that these guardrails may evolve as the model adapts.
This is worth holding only if the practical relevance is clear from the source.
This record is extracted from a published AI Today issue and tied to the original source URL. Treat the source as the record of evidence for the summary.