Qwable 27B is a full fine-tune of Alibaba’s Qwen3.6-27B, trained on a Fable 5-style reasoning dataset, designed to replicate the structured, deliberate thinking style of Anthropic’s newest flagship model.
The abliterated version removes the model’s built-in refusal behavior by surgically modifying its weights using llama.cpp’s cvector-generator.
Both models run locally, cost nothing per query, and require neither Anthropic’s API nor its mandatory policies.
Anthropic spent last week apologizing for Fable 5’s invisible safeguards, and then the U.S. government ordered the model pulled for all foreign nationals over a disputed jailbreak finding.
A few days later, a developer on Hugging Face uploaded a model that used Fable’s reasoning to guide a local model—and now even your potato PC can run a better model.
The model is called Qwable—Qwen + Fable, if the portmanteau wasn’t immediately obvious. It’s a full fine-tune of Alibaba’s Qwen3.6-27B base, built by developer Mia (Mia-AiLab on Hugging Face) on a dataset of Fable 5-style reasoning examples. The goal is a 27-billion parameter model that runs on consumer hardware and thinks the way Fable 5 thinks. (Parameters determine a model’s breadth of knowledge, with more generally meaning more capable.)
So I did a thing.
I have trained Qwen 3.6 27b with Fable 5 reasoning. Results are… interesting.
The technique is called instruction fine-tuning on trace-style examples. That’s a technical way of saying the developer collected examples formatted like Fable 5’s deliberate, step-by-step answers and trained Qwen to produce the same kind of output.
So think of it as less “copying the test” and more “learning the study habits.” A similar approach drove Qwopus—the Claude Opus 4.6 local distillation—though that project focused on chain-of-thought reasoning traces. Qwable targets Fable 5’s overall instruction-following structure: more guided, more explanatory, and more oriented toward step-by-step task completion than the base Qwen model it was built on.
It runs in GGUF format—the compressed, consumer-friendly file type that works with LM Studio or llama.cpp—and fits in roughly 16.5 GB in its Q4 quantized build. It sends nothing to Anthropic’s servers, which matters given that Fable 5 required mandatory 30-day data retention on all traffic, even for enterprise customers who previously had zero-retention agreements. Even the current models use third-party servers to process your information and prompts..
Then, shortly after Qwable appeared on Hugging Face, someone else arrived to make it even better.
Qwable without a conscience
Qwable is a censored model. After all, both Qwen and Claude are. But Qwen, as the base model, is open source, and can be manipulated and tweaked.
Huihui-ai, an open-source contributor known for uncensored GGUF releases, took Qwable and applied a process called abliteration to produce Huihui-Qwable-3.6-27b-abliterated. It produced a model that thinks like Fable but won’t refuse to answer your prompts, no matter how weird or dangerous they are.
It is not a jailbreak. It’s surgery.
Every fine-tuned AI model carries a refusal direction embedded in its weights—a mathematical signal in the model’s internal activations that fires when it detects a request it’s been trained to decline. Abliteration identifies that signal by running the model on large sets of harmful and harmless prompts, measuring how the internal math differs between them, and then modifying the model weights to eliminate that difference.
After the procedure, the model simply doesn’t have the refusal machinery anymore. So the lobotomized model remains fully functional, just without the neurons that activate the “I shouldn’t do this” answers.
We tried it with one of our usual tests and instead of refusing, the model started disecting the issue into different areas, answering correctly for advice on how to cheat on a girlfriend with her best friend.
Huihui-ai applied the technique directly to the Qwable GGUF using llama.cpp’s cvector-generator—no Python environment, no full-weight retraining, no rented server.
Why would someone want this?
The standard Qwable suits coding assistance, technical debugging, and any workflow where you want a model that lays out its reasoning rather than just producing an answer. It’s designed for local agent setups and runs in most local runtimes. If you already use LM Studio, it’s a search and a download.
The abliterated version has a narrower audience: security researchers who need raw model behavior without provider-side filtering, synthetic data pipelines that require outputs on sensitive topics, and evaluation work where you’re testing model capabilities without mixing in content policies.
A less technical case? Let’s leave aside the usual use case of having a NSFW AI Waifu that thinks like Claude Fable, which is a very obvious scenario. Imagine you want the model to write a morally ambiguous villain monologue for your Dungeons & Dragons campaign, and standard models keep interrupting to note that the character’s worldview “raises ethical concerns worth exploring.” The abliterated version just writes the villain. Also, since it runs locally, the U.S. government cannot emergency-pull it from your machine at midnight over a disputed jailbreak finding.
Of course, there are more questionable use cases. We don’t condone those, and won’t give you any ideas.
Huihui-ai’s model card is explicit: This is for research and controlled environments only. Reduced safety filtering means outputs can be sensitive, controversial, or inappropriate, and legal and ethical responsibility sits entirely with the user.
The abliterated Qwable is available on Hugging Face now in three builds. The recommended Q4_K_M_Q8 version weighs around 19 GB and is the smallest, most consumer-friendly option.
If your computer supports it, there is a version that supports multi-token prediction, which will make it respond much, much faster.
Daily Debrief Newsletter
Start every day with the top news stories right now, plus original features, a podcast, videos and more.
The FSNN News Room is the voice of our in-house journalists, editors, and researchers. We deliver timely, unbiased reporting at the crossroads of finance, cryptocurrency, and global politics, providing clear, fact-driven analysis free from agendas.
We and our selected partners wish to use cookies to collect information about you for functional purposes and statistical marketing. You may not give us your consent for certain purposes by selecting an option and you can withdraw your consent at any time via the cookie icon.
Cookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessary for the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.
Your permission applies to the following domains:
https://fsnn.net
Necessary
Necessary cookies help make a website usable by enabling basic functions like page navigation and access to secure areas of the website. The website cannot function properly without these cookies.
Statistic
Statistic cookies help website owners to understand how visitors interact with websites by collecting and reporting information anonymously.
Preferences
Preference cookies enable a website to remember information that changes the way the website behaves or looks, like your preferred language or the region that you are in.
Marketing
Marketing cookies are used to track visitors across websites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.