Anthropic says its new Claude Fable 5 places powerful cyber tools behind safety filters. The pitch is straightforward. The risk is also straightforward.

If the filters work, attackers get fewer clean pathways to exploit code. If they do not, the speed and reach of “assistance” could still make an operator’s job easier than it should be. DeFi is already where the damage has been hardest.

CoinDesk reports that DeFi has seen more than $840 million in hacks this year. That number matters because it sets the baseline reality for where attackers aim their tools. When the threat surface is already expensive, a single new leverage point can translate into more losses, faster.

Why safety filters are suddenly not a detail

Safety filters can limit what a model produces. But their effectiveness is not binary in the way marketing implies. Even partial bypasses can reduce the cost of finding vulnerabilities, writing exploit logic, or automating reconnaissance.

Claude Fable 5 is designed to put “powerful cyber tools” behind those guardrails, CoinDesk says. That phrasing matters. It implies the model can do things that are useful in real attacks, not just generate generic code snippets.

So the practical question shifts from “Can the model help?” to “How reliably does it refuse when the task looks like harm?” In crypto, refusals that fail occasionally still create outsize impact.

DeFi’s hack tally raises the stakes

DeFi is not a one-off target. It is a sprawling set of contracts, integrations, and permission models that interact under time pressure. CoinDesk’s reference point is blunt. More than $840 million in hacks this year tells you attackers have been able to convert technical work into direct payouts.

When a system with cyber tooling arrives, the likely consequence is not a sudden stop to hacking. It is incremental changes to attacker workflow.

That is where filters become operational. Better filtering can reduce what attackers request. Weaker filtering can still help them move faster through the steps that lead to theft.

What to watch next

CoinDesk frames the development around Anthropics’ Claude Fable 5 and its safety filters. The next real signal will not be a lab promise. It will be evidence that filters reduce harmful outcomes in practice.

For readers, the deadline is not a press release. It is the next set of security incidents tied to tooling that lowers the barrier for exploitation.

DeFi already has a high-loss year. If new cyber assistance slips through guardrails, the desk expects attackers to test it quickly, not slowly.