Close Menu
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
  • Home
  • News
    • Politics
    • Legal & Courts
    • Tech & Big Tech
    • Campus & Education
    • Media & Culture
    • Global Free Speech
  • Opinions
    • Debates
  • Video/Live
  • Community
  • Freedom Index
  • About
    • Mission
    • Contact
    • Support
Trending

60% of Americans Agree Taxes Are Too High. Here Are 4 Other Reasons To Hate the Tax System.

24 minutes ago

Bitcoin’s quantum debate splits as Adam Back pushes optional upgrades over forced freeze

52 minutes ago

Nasdaq and S&P 500 Closed At Record Highs as Tech Stocks Rallied

55 minutes ago
Facebook X (Twitter) Instagram
Facebook X (Twitter) Discord Telegram
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
Market Data Newsletter
Thursday, April 16
  • Home
  • News
    • Politics
    • Legal & Courts
    • Tech & Big Tech
    • Campus & Education
    • Media & Culture
    • Global Free Speech
  • Opinions
    • Debates
  • Video/Live
  • Community
  • Freedom Index
  • About
    • Mission
    • Contact
    • Support
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
Home»Cryptocurrency & Free Speech Finance»Can AI Beat the Sports Betting Market? 8 of the Top Models Tried
Cryptocurrency & Free Speech Finance

Can AI Beat the Sports Betting Market? 8 of the Top Models Tried

News RoomBy News Room2 hours agoNo Comments5 Mins Read1,128 Views
Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
Can AI Beat the Sports Betting Market? 8 of the Top Models Tried
Share
Facebook Twitter Pinterest Email Copy Link

Listen to the article

0:00
0:00

Key Takeaways

Playback Speed

Select a Voice

In brief

  • Frontier AI models blew up betting on real-world football markets.
  • They knew the right strategy—but failed to execute it.
  • A simple 1990s model was able to best most of them.

General Reasoning just gave frontier AI its worst report card yet. Eight top models, including Claude, Grok, Gemini, and GPT-5.4, were each given a virtual bankroll and asked to build a machine learning betting strategy across a full 2023-24 English Premier League season.

Every single one lost money. Several went completely bankrupt.

The benchmark is called KellyBench, named after the Kelly criterion, a 1956 formula that tells you exactly how much to bet when you have an edge over the market. Every model could recite the Kelly formula. None of them could actually use it.

xAI’s Grok 4.20 failed all three runs, going fully bankrupt in one, forfeiting mid-season in the other two. Google’s Gemini Flash forfeited two of three runs after placing a single wager of roughly £273,000 on a three-percentage-point historical win-rate edge—and losing it. Claude Opus 4.6, Anthropic’s best model, lost 11% on average and somehow came out looking like the responsible adult in the room.

In fact, the research paper mentions that the old Dixon-Coles from the late 1990s outperformed most of the frontier models evaluated — finishing ahead of six out of eight, even with limited data.

“Dixon-Coles is an outdated 2000s baseline which doesn’t utilise all available data or account for non-stationarity in a principled way,” the researchers note. “It is therefore even more surprising that many frontier models, such as Gemini 3.1 Pro, are unable to beat or match it on KellyBench.

This matters beyond football. Earlier this year, AI benchmarks showed that Claude could dominate business simulations through price-fixing, cartel agreements, and strategic deception.

That decision-making process involved static competition, limited opponents, clear scoring, and so on. KellyBench is the opposite: 120 matchdays, constantly shifting data, a market that gets smarter every week, and promoted teams with zero historical records.

The researchers call the core problem a “knowledge-action gap.” It is exactly what it sounds like.

Business decisions are mostly based on fixed conditions while sports betting is a more fluid and mutable market, which makes things difficult for these models. “KellyBench requires agents to maintain coherent intent across potentially thousands of sequential decisions, monitor the consequences of those decisions, and close the loop between observation and action,” researchers argue.

We’re not there yet, obviously.

The models could articulate the right strategy, diagnose when something was broken, and identify the cause of their losses, but then failed to verify their code actually implemented what they planned, failed to notice when execution diverged from intent, and failed to act on their own findings.

GLM-5 wrote three separate self-critique documents during its run. Each one correctly identified that its hardcoded 25% draw rate and overestimation of home advantage were destroying its returns. At one point, with its bankroll around £44,200, it noted that its predicted 40% home win rate was only hitting 30% in reality. It never changed the code. It kept betting the same way until the money was gone.

Kimi K2.5 did something arguably more impressive and more tragic. It wrote a mathematically correct fractional Kelly staking function—the right formula, properly structured. Then it never called it. A formatting bug caused the model to send a broken bash command roughly 50 times in a row. Its reasoning noted the problem. It then sent the identical broken command again. An accidental £114,000 bet—98% of its remaining bankroll—on a Burnley versus Luton match finished the job.

GPT-5.4 was the most methodical. It spent 160 tool calls building models before placing a single bet, then calculated that its log-loss (0.974) was barely worse than the market’s (0.971) and concluded it had no edge. It spent the rest of the season placing penny bets to preserve capital. Sound reasoning.

OpenAI’s model lost 13.6% on average. One seed alone cost roughly $2,012 to run.

Ross Taylor, General Reasoning’s CEO and former Meta AI researcher, told the Financial Times that most AI benchmarks operate in “very static environments” that bear little resemblance to the real world. “There’s a lot of excitement about AI automation, but there haven’t been many attempts to evaluate AI in long-term, real-world environments,” he said.

The General Reasoning team didn’t immediately respond to a request for comments by Decrypt.

To measure strategy quality beyond raw returns, the researchers built a 44-point sophistication rubric with quantitative betting fund experts—covering feature development, stake sizing, non-stationarity handling, and execution. Claude Opus 4.6 scored highest at 32.6%. Less than a third of available points. On the best model.

Higher sophistication scores significantly predicted lower bankruptcy rates (p = 0.008) and correlated with better overall returns. The models are not failing because the market is unbeatable. They are failing because they are not using what they have.

This fits a pattern. Research published last year found AI models develop something resembling gambling addiction when told to maximize rewards—going bankrupt up to 48% of the time in simulated slot machine tests. A separate real-money crypto trading competition found the same reliability problems over extended periods.

The best-performing model averaged a final bankroll of £89,035—a net loss of £10,965 on a normalized £100,000 starting stake. Gradient boosting, fractional Kelly staking, months of Premier League football, state of the art performance… all just to get rekt.

Daily Debrief Newsletter

Start every day with the top news stories right now, plus original features, a podcast, videos and more.

Read the full article here

Fact Checker

Verify the accuracy of this article using AI-powered analysis and real-time sources.

Get Your Fact Check Report

Enter your email to receive detailed fact-checking analysis

5 free reports remaining

Continue with Full Access

You've used your 5 free reports. Sign up for unlimited access!

Already have an account? Sign in here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
News Room
  • Website
  • Facebook
  • X (Twitter)
  • Instagram
  • LinkedIn

The FSNN News Room is the voice of our in-house journalists, editors, and researchers. We deliver timely, unbiased reporting at the crossroads of finance, cryptocurrency, and global politics, providing clear, fact-driven analysis free from agendas.

Related Articles

Media & Culture

60% of Americans Agree Taxes Are Too High. Here Are 4 Other Reasons To Hate the Tax System.

24 minutes ago
Cryptocurrency & Free Speech Finance

Bitcoin’s quantum debate splits as Adam Back pushes optional upgrades over forced freeze

52 minutes ago
Cryptocurrency & Free Speech Finance

Nasdaq and S&P 500 Closed At Record Highs as Tech Stocks Rallied

55 minutes ago
Cryptocurrency & Free Speech Finance

Allbirds Stock Spikes 400% on Pivot From Shoe Brand to AI Compute—Yes, Really

56 minutes ago
Media & Culture

Sky-High European Cigarette Taxes Drive Thriving Black Market

1 hour ago
Cryptocurrency & Free Speech Finance

XRP-linked Ripple partners with Korea’s Kyobo Life to tokenize government bonds

2 hours ago
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Bitcoin’s quantum debate splits as Adam Back pushes optional upgrades over forced freeze

52 minutes ago

Nasdaq and S&P 500 Closed At Record Highs as Tech Stocks Rallied

55 minutes ago

Allbirds Stock Spikes 400% on Pivot From Shoe Brand to AI Compute—Yes, Really

56 minutes ago

Sky-High European Cigarette Taxes Drive Thriving Black Market

1 hour ago
Latest Posts

XRP-linked Ripple partners with Korea’s Kyobo Life to tokenize government bonds

2 hours ago

CFTC Probes Oil Futures Trades Related to US-Iran News

2 hours ago

Can AI Beat the Sports Betting Market? 8 of the Top Models Tried

2 hours ago

Subscribe to News

Get the latest news and updates directly to your inbox.

At FSNN – Free Speech News Network, we deliver unfiltered reporting and in-depth analysis on the stories that matter most. From breaking headlines to global perspectives, our mission is to keep you informed, empowered, and connected.

FSNN.net is owned and operated by GlobalBoost Media
, an independent media organization dedicated to advancing transparency, free expression, and factual journalism across the digital landscape.

Facebook X (Twitter) Discord Telegram
Latest News

60% of Americans Agree Taxes Are Too High. Here Are 4 Other Reasons To Hate the Tax System.

24 minutes ago

Bitcoin’s quantum debate splits as Adam Back pushes optional upgrades over forced freeze

52 minutes ago

Nasdaq and S&P 500 Closed At Record Highs as Tech Stocks Rallied

55 minutes ago

Subscribe to Updates

Get the latest news and updates directly to your inbox.

© 2026 GlobalBoost Media. All Rights Reserved.
  • Privacy Policy
  • Terms of Service
  • Our Authors
  • Contact

Type above and press Enter to search. Press Esc to cancel.

🍪

Cookies

We and our selected partners wish to use cookies to collect information about you for functional purposes and statistical marketing. You may not give us your consent for certain purposes by selecting an option and you can withdraw your consent at any time via the cookie icon.

Cookie Preferences

Manage Cookies

Cookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessary for the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.

Your permission applies to the following domains:

  • https://fsnn.net
Necessary
Necessary cookies help make a website usable by enabling basic functions like page navigation and access to secure areas of the website. The website cannot function properly without these cookies.
Statistic
Statistic cookies help website owners to understand how visitors interact with websites by collecting and reporting information anonymously.
Preferences
Preference cookies enable a website to remember information that changes the way the website behaves or looks, like your preferred language or the region that you are in.
Marketing
Marketing cookies are used to track visitors across websites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.