Close Menu
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
  • Home
  • News
    • Politics
    • Legal & Courts
    • Tech & Big Tech
    • Campus & Education
    • Media & Culture
    • Global Free Speech
  • Opinions
    • Debates
  • Video/Live
  • Community
  • Freedom Index
  • About
    • Mission
    • Contact
    • Support
Trending

Tether leads $1.4 billion funding round in German robotics company Neura

6 minutes ago

Japan Crypto Bill Advances With ETF, Tax Reform Path: Report

6 minutes ago

Ripple CEO Takes Aim at JPMorgan’s Jamie Dimon Over Clarity Act Crypto Bill Criticism

13 minutes ago
Facebook X (Twitter) Instagram
Facebook X (Twitter) Discord Telegram
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
Market Data Newsletter
Thursday, June 11
  • Home
  • News
    • Politics
    • Legal & Courts
    • Tech & Big Tech
    • Campus & Education
    • Media & Culture
    • Global Free Speech
  • Opinions
    • Debates
  • Video/Live
  • Community
  • Freedom Index
  • About
    • Mission
    • Contact
    • Support
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
Home»Cryptocurrency & Free Speech Finance»Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch
Cryptocurrency & Free Speech Finance

Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch

News RoomBy News Room2 hours agoNo Comments4 Mins Read1,800 Views
Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
Anthropic Apologizes for Claude Fable 5 Secret Censorship—But the Fix Has a Catch
Share
Facebook Twitter Pinterest Email Copy Link

Listen to the article

0:00
0:00

Key Takeaways

Playback Speed

Select a Voice

In brief

  • Anthropic admitted its invisible LLM-development safeguards were “the wrong tradeoff” and will replace them with visible fallbacks to Claude Opus 4.8, starting this week.
  • Flagged requests on the API will now return a reason for their refusal, rather than silently delivering a degraded answer.
  • Making the safeguards visible means they’ll be easier to work around.

Anthropic spent about 48 hours as the AI industry’s villain of the week before blinking.

The company launched Claude Fable 5 this week to immediate backlash over a safeguard buried in its 319-page system card: The model, the first of the company’s new Mythos class, would secretly degrade its own responses for users it suspected were building competing AI models—no warning, no fallback message, just quietly worse output. By Thursday, Anthropic was apologizing.

We’re rolling out changes to make Fable 5’s safeguards for frontier LLM development visible.

Starting this week, flagged requests will visibly fall back to Opus 4.8—the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged…

— ClaudeDevs (@ClaudeDevs) June 11, 2026

“Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason—and that was the wrong tradeoff,” the company posted on X. “You should have visibility into the safeguards we have in place, and why.”

“We’re sorry for not getting the balance right.”

Starting this week, flagged requests will visibly route to Claude Opus 4.8, a less capable model, instead of silently delivering degraded Fable output. API users will receive a stated reason when a request gets refused. Anthropic says server-side fallback notifications will roll out in the next few days.

What was actually happening

For non-technical readers, here’s what the controversy was actually about. Claude Fable 5 already had visible safeguards for cybersecurity and biology research—if you asked something that tripped those filters, you’d get a notification that your request was being rerouted to the older Opus 4.8 model. You knew something had changed. You could adjust your prompt or use a different tool.

However, these safeguards were too extreme, some bio researchers noted.

The LLM-development safeguard, however, worked differently. If Fable 5 detected you were working on things like pretraining AI systems, building distributed training infrastructure, or designing machine learning chips, the model would silently alter its own behavior—through prompt modification, steering vectors, or parameter tweaks—to give you a worse answer without telling you. You’d get a response. It just wouldn’t be from the Fable 5 you paid for.

Fable 5 is billed as the public face of Anthropic’s most capable Mythos-class model, and researchers using it for legitimate machine learning work had no way to know their results were contaminated. A failed experiment looks the same whether your hypothesis is wrong or the model was quietly told to underperform. That’s the reproducibility problem that sent the AI research community into full meltdown mode.

The problem was the classifier wasn’t that precise. AI research firm SemiAnalysis was among the first to publicly call them out after seeing their GPU inference research get flagged.

BREAKING NEWS: Anthropic’s latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won’t notice. We are already seeing Anthropic’s latest model’s moderation filters our GPU… pic.twitter.com/9sa95cCSvS

— SemiAnalysis (@SemiAnalysis_) June 9, 2026

The catch in the fix

Anthropic’s reversal comes with a direct admission of the tradeoff it’s accepting. Making safeguards visible makes them easier to bypass, which means the classifier has to cast a wider net to remain effective.

More false positives—legitimate machine-learning work that gets caught and rerouted—are coming while the company tunes its systems. Anthropic said it’s working to reduce false positives “as fast as possible” but offered no timeline.

The company is also applying the same cleanup to its biology and cybersecurity classifiers, which had drawn their own complaints about flagging harmless research prompts.

That said, the remaining concern is that Anthropic isn’t dropping this category of restrictions—it’s only making them visible. For those who believe the restrictions themselves are wrong, Thursday’s apology is a partial fix. Fable 5 remains free on Pro, Max, Team, and Enterprise plans until June 22, after which it shifts to API usage credits only

Daily Debrief Newsletter

Start every day with the top news stories right now, plus original features, a podcast, videos and more.



Read the full article here

Fact Checker

Verify the accuracy of this article using AI-powered analysis and real-time sources.

Get Your Fact Check Report

Enter your email to receive detailed fact-checking analysis

5 free reports remaining

Continue with Full Access

You've used your 5 free reports. Sign up for unlimited access!

Already have an account? Sign in here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
News Room
  • Website
  • Facebook
  • X (Twitter)
  • Instagram
  • LinkedIn

The FSNN News Room is the voice of our in-house journalists, editors, and researchers. We deliver timely, unbiased reporting at the crossroads of finance, cryptocurrency, and global politics, providing clear, fact-driven analysis free from agendas.

Related Articles

Cryptocurrency & Free Speech Finance

Japan Crypto Bill Advances With ETF, Tax Reform Path: Report

6 minutes ago
Cryptocurrency & Free Speech Finance

Tether leads $1.4 billion funding round in German robotics company Neura

6 minutes ago
Cryptocurrency & Free Speech Finance

Ripple CEO Takes Aim at JPMorgan’s Jamie Dimon Over Clarity Act Crypto Bill Criticism

13 minutes ago
Media & Culture

Apparently One Dismissed Speech-Suppressing SLAPP Suit Wasn’t Enough For Matt Taibbi

46 minutes ago
Media & Culture

Graham Platner, Other Fools Blame Their Problems on the ‘Epstein Class’

49 minutes ago
Cryptocurrency & Free Speech Finance

Ether Open Interest Hits New Highs on Binance: Are Bulls Back?

1 hour ago
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Japan Crypto Bill Advances With ETF, Tax Reform Path: Report

6 minutes ago

Ripple CEO Takes Aim at JPMorgan’s Jamie Dimon Over Clarity Act Crypto Bill Criticism

13 minutes ago

Yes to California’s Bill to Ban Surveillance Pricing

44 minutes ago

Apparently One Dismissed Speech-Suppressing SLAPP Suit Wasn’t Enough For Matt Taibbi

46 minutes ago
Latest Posts

Graham Platner, Other Fools Blame Their Problems on the ‘Epstein Class’

49 minutes ago

Ether Open Interest Hits New Highs on Binance: Are Bulls Back?

1 hour ago

SpaceX (SPCX) raises $75 billion in largest-ever IPO

1 hour ago

Subscribe to News

Get the latest news and updates directly to your inbox.

At FSNN – Free Speech News Network, we deliver unfiltered reporting and in-depth analysis on the stories that matter most. From breaking headlines to global perspectives, our mission is to keep you informed, empowered, and connected.

FSNN.net is owned and operated by GlobalBoost Media
, an independent media organization dedicated to advancing transparency, free expression, and factual journalism across the digital landscape.

Facebook X (Twitter) Discord Telegram
Latest News

Tether leads $1.4 billion funding round in German robotics company Neura

6 minutes ago

Japan Crypto Bill Advances With ETF, Tax Reform Path: Report

6 minutes ago

Ripple CEO Takes Aim at JPMorgan’s Jamie Dimon Over Clarity Act Crypto Bill Criticism

13 minutes ago

Subscribe to Updates

Get the latest news and updates directly to your inbox.

© 2026 GlobalBoost Media. All Rights Reserved.
  • Privacy Policy
  • Terms of Service
  • Our Authors
  • Contact

Type above and press Enter to search. Press Esc to cancel.

🍪

Cookies

We and our selected partners wish to use cookies to collect information about you for functional purposes and statistical marketing. You may not give us your consent for certain purposes by selecting an option and you can withdraw your consent at any time via the cookie icon.

Cookie Preferences

Manage Cookies

Cookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessary for the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.

Your permission applies to the following domains:

  • https://fsnn.net
Necessary
Necessary cookies help make a website usable by enabling basic functions like page navigation and access to secure areas of the website. The website cannot function properly without these cookies.
Statistic
Statistic cookies help website owners to understand how visitors interact with websites by collecting and reporting information anonymously.
Preferences
Preference cookies enable a website to remember information that changes the way the website behaves or looks, like your preferred language or the region that you are in.
Marketing
Marketing cookies are used to track visitors across websites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.