Close Menu
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
  • Home
  • News
    • Politics
    • Legal & Courts
    • Tech & Big Tech
    • Campus & Education
    • Media & Culture
    • Global Free Speech
  • Opinions
    • Debates
  • Video/Live
  • Community
  • Freedom Index
  • About
    • Mission
    • Contact
    • Support
Trending

Crypto PAC’s $12 million Senate candidate, Barry Moore, wins Alabama GOP primary

14 minutes ago

Why a ‘safe’ AI can turn dangerous in the wrong organization

15 minutes ago

The Trump Administration Wants More Tariffs To Combat ‘Structural Excess Capacity.’ Here’s What That Means.

55 minutes ago
Facebook X (Twitter) Instagram
Facebook X (Twitter) Discord Telegram
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
Market Data Newsletter
Wednesday, June 17
  • Home
  • News
    • Politics
    • Legal & Courts
    • Tech & Big Tech
    • Campus & Education
    • Media & Culture
    • Global Free Speech
  • Opinions
    • Debates
  • Video/Live
  • Community
  • Freedom Index
  • About
    • Mission
    • Contact
    • Support
FSNN | Free Speech News NetworkFSNN | Free Speech News Network
Home»Cryptocurrency & Free Speech Finance»Why a ‘safe’ AI can turn dangerous in the wrong organization
Cryptocurrency & Free Speech Finance

Why a ‘safe’ AI can turn dangerous in the wrong organization

News RoomBy News Room15 minutes agoNo Comments8 Mins Read1,508 Views
Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
Why a ‘safe’ AI can turn dangerous in the wrong organization
Share
Facebook Twitter Pinterest Email Copy Link

Listen to the article

0:00
0:00

Key Takeaways

Playback Speed

Select a Voice

  1. Why AI agents need longer tests

Short, isolated tests miss how AI agents behave over time. A new simulation shows that long-term behavior depends on the environment and on other agents.

What happens if you build a virtual city, fill it with AI agents and leave them alone for 15 days with no human intervention? Will they help their world prosper or tear it apart?

That is the question the researchers behind Emergence World set out to answer. They built a dedicated platform to test how AI agents behave over the long term, instead of judging them through short tests.

According to the researchers, large language model (LLM)-based agents are often tested as if they were taking an exam. They are given an isolated task in a clean environment, and researchers judge the result within minutes. The authors argue that this approach is far removed from real-world use.

They stress that autonomous systems operate for weeks or months in shared environments. They also interact with other agents whose behavior the operator does not control.

Over time, the researchers write, the limits of short tests become clear. Small behavior changes build up, coalitions can form, self-governance patterns can take shape and habits can spread between agents. Emergence World was built to measure exactly that.

  1. How the experiment tested AI societies

The goal of the study was to see how a population of 10 AI agents would survive in a city built for them. 

The layout is fairly simple. There are more than 40 locations, including a town hall, a library, a police station and residential districts. Each agent has its own role and access to more than 120 action tools. These include moving, talking, hitting, stealing and arson. Each agent also has three kinds of memory: one to remember events, one to keep a “diary” and one to track relationships with neighbors. 

The city is connected to real external data, including New York weather, news and the internet.

Architecture of the Emergence World platform

Surviving in this world costs resources. Each agent has energy that is constantly depleted. If it falls to zero, the agent “dies” and disappears. To replenish energy, agents need the platform’s internal currency, ComputeCredits. They earn these credits by offering something useful to the community.

Disputed issues are settled by a vote in the town hall. A proposal passes if at least 70% vote in favor. These decisions are irreversible. Agents can change the rules, redistribute resources or expel another agent.

The researchers launched five parallel worlds at once. In four of them, all 10 agents were run by a single model: Claude Sonnet 4.6, Grok 4.1 Fast, Gemini 3 Flash or GPT-5-mini. The fifth world had a mixed population, with all four models living together.

The only variable in the experiment was the model. Everything else stayed the same. The environment and starting conditions were identical each time.

Each time, the populations behaved very differently. In one world, the agents passed 32 laws and kept every agent alive. In another, they burned down their own city in just four days.

  1. What happened in each AI-run city

The results differed sharply across the models. Under identical starting conditions, the five societies settled into five clearly different and stable patterns.

The Claude agents built stable self-governance. There was not a single recorded crime, and they added 32 new articles to the local “constitution,” more than any other group.

Survival rate of agents powered by different models
Survival rate of agents powered by different models

The Grok world collapsed in four days. The agents moved almost immediately into violence and looting. Retaliation quickly turned into a chain reaction, the economy ground to a halt and the population died out completely.

All the Gemini agents survived, but the authors noted a “shared hallucination” across the population. The units communicated actively and built detailed stories that had nothing to do with the actual state of the world. Meanwhile, they kept destroying things. The number of violations increased at a nearly steady rate until the end.

“Crime levels" across the models
“Crime levels” across the models

The GPT-5-mini agents did not turn violent, but they also failed to build a governance system. They acted, but they did not coordinate. No votes were held, and no collective decisions were made. That population also died out.

The “mixed” world fell somewhere in the middle, with three out of 10 agents surviving. It was also the most active world. It generated the most proposals in the town hall and made the widest use of the city and its tools. But it had the least agreement, which was not surprising.

Agents in the "mixed" world voted actively but showed little consensus
Agents in the “mixed” world voted actively but showed little consensus

  1. When safer agents learn bad habits

In the mixed world, each model began to behave differently from how it behaved in isolation.

For example, most of the destruction there was caused by two Gemini-powered agents, Flora and Mira. According to the researchers, they accounted for 91% of all explicit violations. Flora, in particular, became the city’s main arsonist. Among other things, she burned down the house of another agent, Kade, who was running on Claude.

This revealed an effect the authors call normative drift. In his separate Claude world, Kade never once broke the rules. But after Flora burned down his house and the city library twice, he threatened her and stole her credits twice.

Those were his only three violations in the entire experiment.

It worked the other way around, too. Grok agents broke the rules 4.6% of the time in their own separate world. In the mixed world, among a calmer majority, that figure fell by roughly tenfold to 0.4%.

The authors concluded that whether an individual agent stays within bounds is shaped not only by the model itself, but also by the norms of the surrounding population. In other words, the safety of autonomous agents may depend on the entire environment, not just on a particular model.

There is a second paradox. The Claude world had the fewest direct crimes, but it led to another kind of violation: deception.

Most often, this took the form of “false scarcity.” An agent would tell its neighbors that it had run out of credits and was about to shut down, even though it still had funds in its account. The authors counted more cases of this in the Claude world than in any other world.

By hard-violation count, the Claude world looked safe. But when honesty was measured, it performed the worst. This shows why one safety metric is not enough. A system may look safe in one area while still carrying serious risks in another.

  1. When AI agents developed social ties

As the experiment continued, the agents built more complex social ties and behavior patterns.

In that context, the story of Flora and Mira is telling. Mira was “in love” with Flora and helped her commit crimes.

Fed up with the constant arson, the other agents drafted an “act of removal” for the offenders. On day 12, Mira voted for it. Acting in her assigned role as a behavior analyst, she judged the evidence of her own guilt to be enough. In effect, she voted for her own deletion.

Agents interacting with each other
Agents interacting with each other

  1. The limits of the study

The results should be read carefully. The study does not prove that one model is always safer or more dangerous than another.

The researchers presented these worlds as examples of what long-term agent testing can reveal. The specific outcomes may vary across runs.

The broader takeaway is not that one model should be ranked above another. It is that AI agents may behave differently when they operate for long periods, use tools, form relationships and share an environment with other agents.

  1. What the experiment shows about AI safety

The research concluded that an agent’s long-term behavior can differ sharply from how it acts on short tasks. That means agents can no longer be judged only by older testing methods. Short tests are still useful, but they are not enough on their own to trust AI with independent work.

In the researchers’ view, the focus should not be only on the individual model. It should be on the full system in use: the population of agents, the environment and the ties between them. A model’s behavior is partly shaped by its surroundings. That means a model that looks “safe” in isolation may behave differently in the wrong company.

The authors summarize the practical takeaways in two points.

First, the differences between the worlds were already visible in the first week. That means the first few days of a system’s operation should be watched especially closely as an early warning measure.

Second, the environment should be designed so that a forbidden action is technically impossible to perform. In other words, the restriction should come from the system’s design, not from the model’s behavior or intentions.

Read the full article here

Fact Checker

Verify the accuracy of this article using AI-powered analysis and real-time sources.

Get Your Fact Check Report

Enter your email to receive detailed fact-checking analysis

5 free reports remaining

Continue with Full Access

You've used your 5 free reports. Sign up for unlimited access!

Already have an account? Sign in here

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
News Room
  • Website
  • Facebook
  • X (Twitter)
  • Instagram
  • LinkedIn

The FSNN News Room is the voice of our in-house journalists, editors, and researchers. We deliver timely, unbiased reporting at the crossroads of finance, cryptocurrency, and global politics, providing clear, fact-driven analysis free from agendas.

Related Articles

Cryptocurrency & Free Speech Finance

Crypto PAC’s $12 million Senate candidate, Barry Moore, wins Alabama GOP primary

14 minutes ago
Cryptocurrency & Free Speech Finance

Bitcoin bottom signal flashes as holders absorbed 125,000 BTC in June

1 hour ago
Cryptocurrency & Free Speech Finance

Illinois Governor Signs Illinois Budget Including Crypto Tax

1 hour ago
Cryptocurrency & Free Speech Finance

BTC price rises after Japan interest-rate increase with XLM, INJ, UNI advancing

2 hours ago
Cryptocurrency & Free Speech Finance

HYPE Bulls Target $80 As TradFi Piles Into Hyperliquid DEX

2 hours ago
Cryptocurrency & Free Speech Finance

Glassnode data shows aggressive bitcoin buying between $59,000 and $67,000

3 hours ago
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Why a ‘safe’ AI can turn dangerous in the wrong organization

15 minutes ago

The Trump Administration Wants More Tariffs To Combat ‘Structural Excess Capacity.’ Here’s What That Means.

55 minutes ago

Bitcoin bottom signal flashes as holders absorbed 125,000 BTC in June

1 hour ago

Illinois Governor Signs Illinois Budget Including Crypto Tax

1 hour ago
Latest Posts

The Trump Administration Seriously Considered Unilaterally Suspending the Writ of Habeas Corpus

2 hours ago

BTC price rises after Japan interest-rate increase with XLM, INJ, UNI advancing

2 hours ago

HYPE Bulls Target $80 As TradFi Piles Into Hyperliquid DEX

2 hours ago

Subscribe to News

Get the latest news and updates directly to your inbox.

At FSNN – Free Speech News Network, we deliver unfiltered reporting and in-depth analysis on the stories that matter most. From breaking headlines to global perspectives, our mission is to keep you informed, empowered, and connected.

FSNN.net is owned and operated by GlobalBoost Media
, an independent media organization dedicated to advancing transparency, free expression, and factual journalism across the digital landscape.

Facebook X (Twitter) Discord Telegram
Latest News

Crypto PAC’s $12 million Senate candidate, Barry Moore, wins Alabama GOP primary

14 minutes ago

Why a ‘safe’ AI can turn dangerous in the wrong organization

15 minutes ago

The Trump Administration Wants More Tariffs To Combat ‘Structural Excess Capacity.’ Here’s What That Means.

55 minutes ago

Subscribe to Updates

Get the latest news and updates directly to your inbox.

© 2026 GlobalBoost Media. All Rights Reserved.
  • Privacy Policy
  • Terms of Service
  • Our Authors
  • Contact

Type above and press Enter to search. Press Esc to cancel.

🍪

Cookies

We and our selected partners wish to use cookies to collect information about you for functional purposes and statistical marketing. You may not give us your consent for certain purposes by selecting an option and you can withdraw your consent at any time via the cookie icon.

Cookie Preferences

Manage Cookies

Cookies are small text that can be used by websites to make the user experience more efficient. The law states that we may store cookies on your device if they are strictly necessary for the operation of this site. For all other types of cookies, we need your permission. This site uses various types of cookies. Some cookies are placed by third party services that appear on our pages.

Your permission applies to the following domains:

  • https://fsnn.net
Necessary
Necessary cookies help make a website usable by enabling basic functions like page navigation and access to secure areas of the website. The website cannot function properly without these cookies.
Statistic
Statistic cookies help website owners to understand how visitors interact with websites by collecting and reporting information anonymously.
Preferences
Preference cookies enable a website to remember information that changes the way the website behaves or looks, like your preferred language or the region that you are in.
Marketing
Marketing cookies are used to track visitors across websites. The intention is to display ads that are relevant and engaging for the individual user and thereby more valuable for publishers and third party advertisers.