Mistral Medium 3.5 is a 128 billion parameter dense model priced at $1.50 input / $7.50 output per million tokens, far above comparable Chinese alternatives.
Chinese open-source models—Qwen, GLM, MiMo-V2—dominate the leaderboard top, leaving Mistral as a lonely Western holdout.
Mistral is positioning the release as a building block toward a future large flagship model.
Mistral AI dropped Mistral Medium 3.5 on April 29. The Paris-based lab announced a dense 128-billion-parameter model, a set of agentic features—and walked straight into a wall of online “meh” reactions.
The release came in three parts. First, the model itself. Second, remote coding agents via Mistral Vibe CLI—cloud-based coding sessions that can push pull requests to GitHub and run in parallel without you sitting at a terminal. Third, Work Mode in Le Chat, Mistral’s ChatGPT-style consumer interface, which now handles multi-step autonomous tasks like email triage, research synthesis, and cross-tool workflows.
Big ambitions, but a messy benchmark reality.
Medium 3.5 scores 77.6% on SWE-Bench Verified—a coding benchmark that tests whether a model can fix real GitHub issues by generating working patches. It also hits 91.4% on τ³-Telecom, which measures agentic tool use in specialized environments. Mistral also merged three previously separate models (Medium 3.1, Magistral, and Devstral 2) into one set of weights with configurable reasoning effort per request.
Unified model replacing three is a real engineering win. The problem is what it costs and who it’s up against.
Mistral charges $1.50 per million input tokens and $7.50 per million output tokens. Alibaba’s Qwen 3.6 at 27 billion parameters—less than a quarter of Medium 3.5’s parameter count—scores 72.4% on the same SWE-Bench Verified benchmark and ships under Apache 2.0, meaning you can download and run it for free.
Did you know?
Parameters are what determine an AI’s capacity to learn, reason, and store information. The more parameters, the wider the model’s breadth of knowledge.
Scroll through the open-source leaderboards and the picture is stark. The top spots belong to Alibaba’s Qwen, GLM from China’s Zhipu AI, and MiMo-V2 from Xiaomi, all of them cheaper, more powerful and competitive than Mistral’s new release. Medium 3.5 hasn’t even ranked on major independent leaderboards yet—third-party evaluations are still pending.
The only good thing though, as some argue, is that Mistral is, at this point, the lone non-Chinese model with any serious presence in the open-source conversation.
I think Mistral has the 10th highest valuation in the whole AI scene (something like that).
All while they consistently release some of the worst models.
They have survived through European bureaucracy, lobbying and politics.
All because they’ve convinced demented bureaucrat… https://t.co/kh7ASvdi7C
Pedro Domingos, a machine learning professor at the University of Washington, wasn’t gentle:
“Regular AI companies brag about how much better their model is on benchmarks. Only Mistral brags about how much worse its one is.”
Regular AI companies brag about how much better their model is on benchmarks. Only Mistral brags about how much worse its one is. pic.twitter.com/WcAKskaVpL
He followed up with a sharper question: “I don’t know what’s worse, for Europe to not be in the AI race or for it to be represented by a laughingstock like Mistral.”
Youssof Altoukhi, founder of Yoyo Studios, did the math: Qwen 3.6, at 27 billion parameters, is 4.7 times smaller than Medium 3.5 and scores comparably on coding. Medium 3.5’s output pricing puts it alongside closed models that score significantly higher on every major benchmark.
“If it wasn’t for their political skill they would have been bankrupt by now,” he said.
Not everyone was purely dismissive. AI developer Michal Langmajer captured the ambivalence:
“I’m genuinely glad there’s still a non-US, non-Chinese lab trying to build frontier LLMs but boy we have to level up the game in Europe. Their new flagship model is basically ‘not the best’ on any benchmark, yet costs multiple times more than most competitors.”
I’m genuinely glad there’s still a non-US, non-Chinese lab trying to build frontier LLMs (@MistralAI) but boy we have to level up the game in Europe.
Their new flagship model is basically “not the best” on any benchmark, yet costs multiple times more than most competitors… pic.twitter.com/JwvR5eKWmT
Some developers argued open weights are a durability play, not a leaderboard play. A model anyone can download, fine-tune, and self-host doesn’t need to win rankings today to stay relevant. Others pointed to Mistral’s real enterprise deployments across Europe as evidence the moat isn’t purely technical.
The Geopolitical safety net
This is where Mistral’s actual pitch lives.
European enterprises under GDPR, banks handling sensitive customer data, and governments that won’t route AI workloads through Chinese infrastructure have limited options. As Decrypt reported last December, HSBC signed a multi-year deal with Mistral specifically to self-host models on its own infrastructure. The appeal of an EU-headquartered open-weight lab with a $14 billion valuation doesn’t show up in benchmark tables—but it shows up in procurement decisions.
Not the best at coding, and not the cheapest. But it is: not American, not Chinese, auditable, self-hostable, and legally safe for European enterprise.
Daily Debrief Newsletter
Start every day with the top news stories right now, plus original features, a podcast, videos and more.
The FSNN News Room is the voice of our in-house journalists, editors, and researchers. We deliver timely, unbiased reporting at the crossroads of finance, cryptocurrency, and global politics, providing clear, fact-driven analysis free from agendas.
We and our selected partners wish to use cookies to collect information about you for functional purposes and statistical marketing. You may not give us your consent for certain purposes by selecting an option and you can withdraw your consent at any time via the cookie icon.
Cookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessary for the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.
Your permission applies to the following domains:
https://fsnn.net
Necessary
Necessary cookies help make a website usable by enabling basic functions like page navigation and access to secure areas of the website. The website cannot function properly without these cookies.
Statistic
Statistic cookies help website owners to understand how visitors interact with websites by collecting and reporting information anonymously.
Preferences
Preference cookies enable a website to remember information that changes the way the website behaves or looks, like your preferred language or the region that you are in.
Marketing
Marketing cookies are used to track visitors across websites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.