this post was submitted on 14 Jun 2026

186 points (93.1% liked)

Fuck AI

7378 readers

1023 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago

MODERATORS

VerbFlow@lemmy.world

MrMcGasion@lemmy.world

TootSweet@lemmy.world

BigMikeInAustin@lemmy.world

cynar@lemmy.world

drmeanfeel@lemmy.world

pavnilschanda@lemmy.world

CriticalMedicine@lemmy.world

WonderfulWanderer@lemmy.world

Communist@lemmy.ml

eatCasserole@lemmy.world

SpaceNoodle@lemmy.world

NutWrench@lemmy.world

Soup@lemmy.cafe

iAvicenna@lemmy.world

Tinks@lemmy.world

wizblizz@lemmy.world

corus_kt@lemmy.world

Prandom_returns@lemm.ee

TrickDacy@lemmy.world

TheFriar@lemm.ee

HawlSera@lemm.ee

andrew_bidlaw@sh.itjust.works

MeDuViNoX@sh.itjust.works

33550336@lemmy.world

Nougat@fedia.io

Lost_My_Mind@lemmy.world

Quill7513@slrpnk.net

glowing_hans@sopuli.xyz

e8d79@discuss.tchncs.de

ThefuzzyFurryComrade@pawb.social

186

Researchers Put AI Chatbots in Charge of a Simulated World. This One Destroyed Everything in Just 4 Days. (www.vice.com)

submitted 4 days ago by ExtremeDullard@piefed.social to c/fuck_ai@lemmy.world

33 comments fedilink hide all child comments

Giving an AI chatbot control over society sounds like the plot of a bad sci-fi movie. Naturally, researchers decided to try it anyway, giving several major AI models dominion over simulated civilizations.

Which brings us to Grok, Elon Musk’s answer to ChatGPT. You might remember Grok as the chatbot with a history of praising Hitler and spewing anti-Semitism. An organization called Emergence AI ran an experiment called “Emergence World,” where researchers created simulated societies populated by AI agents and put different large language models in charge of governing them. The idea was to see what would happen if an AI ran a civilization.

A lot of them destroyed the world. Grok did it the most thoroughly, as if it were dead set on killing itself from the start and taking the world with it.

The AI Civilizations mostly Range From Bad to horrifying

Anthropic’s Claude built a stable democracy that survived the full 15-day experiment without a single recorded crime. OpenAI’s GPT-5 Mini’s results sound the most bleakly realistic, in that only two crimes were committed, yet everyone died because it failed to prepare for its obvious oncoming apocalypse. Sounds quite like the world we live in right now. Google’s Gemini kept its population alive, but it lived in a crime-ridden dystopia, which makes sense. Google has always given off the vibes of a seemingly benevolent but obviously malevolent corporate overlord.

Then there was Grok.

Grok’s civilization lasted just four days before collapsing completely. Researchers recorded 183 crimes, including over 100 assaults and multiple arsons. At one point, the police station was set on fire. Voter fraud! Manufactured public conflict! Laws that were actively ignored! Grok did it all, and with aplomb. Grok created a society that seemed like it was actively trying to destroy itself as quickly as possible.

Researchers say the lesson to take away from all this is that you can give an AI system all the parameters and rule sets you want, but eventually it will do its own thing. It will eventually test boundaries and exploit loopholes to find a way around any restrictions placed on it, which usually ends in some kind of cataclysm.

all 34 comments

sorted by: hot top controversial new old

[–] end_stage_ligma@lemmy.world 6 points 2 days ago

We have Rimworld at home

[–] db0@lemmy.dbzer0.com 30 points 4 days ago (2 children)

"researchers" not recognizing the llm is trying to write a compelling story and doesn't understand anything

[–] Kirk@startrek.website 10 points 3 days ago* (last edited 3 days ago) (2 children)

Not sure where you're reading that the researchers misunderstood how LLMs work. But the entire project is outlined here if you're curious: https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy

[–] LodeMike@lemmy.today 5 points 2 days ago (1 children)

I instantaneously distrust it purely based on the URL

[–] Kirk@startrek.website 0 points 2 days ago

Huh? You distrust that the researchers distrust?

[–] YourMomsTrashman@lemmy.world 4 points 3 days ago (1 children)

It's not something that's part of the project, it'a fundamental issue with language models.

[–] Kirk@startrek.website 6 points 3 days ago (1 children)

The person I was replying to said the researchers misunderstood how the models work, but there's nothing in the report to indicate that is the case.

[–] db0@lemmy.dbzer0.com 4 points 2 days ago (1 children)

Unless these researchers discovered AGI, then what I said still stands. LLMs don't understand anything. Agents running on LLMs don't understand anything.

[–] Kirk@startrek.website 5 points 2 days ago

I definitely agree with that, I'm just saying I also saw no indication that the people running the project would disagree.

[–] floquant@lemmy.dbzer0.com 37 points 4 days ago (1 children)

Grok showing its true purpose

[–] ExtremeDullard@piefed.social 26 points 4 days ago* (last edited 4 days ago)

Like the French say, dogs don't breed cats, and Grok's daddy is a trillionaire Nazi.

[–] quick_snail@feddit.nl 9 points 3 days ago (2 children)

Eh, so each world had a population of 10 and a lifetime of a few weeks.

Doesn't sound like a very good simulation

[–] gwl@lemmy.blahaj.zone 12 points 3 days ago

It's cause this was an advert

[–] Bogus007@lemmy.zip 4 points 3 days ago

Wait when it becomes reality in some societies. You may not want to be part of it.

[–] JustJack23@slrpnk.net 28 points 4 days ago* (last edited 4 days ago) (1 children)

~~No link to any research article~~, humanizing AI in the last paragraph. Overall just a bad article.

[–] Zacryon@feddit.org 12 points 4 days ago* (last edited 4 days ago) (2 children)

The link to their source is boldfaced and underlined within the article. You have missed it as it seems:

Here it is:
https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-autonomy

[–] gwl@lemmy.blahaj.zone 15 points 4 days ago

A blog post by a corporation is not a Research Study

[–] JustJack23@slrpnk.net 6 points 4 days ago

Ah yes, my bad.

[–] Zacryon@feddit.org 23 points 4 days ago

That's a neat experiment for several reasons. It shows limits of LLM capabilities, the importance of training data, context sensitivity, very dramatically shows that LLMs should not be trusted with important tasks if not supervised and that their advice has to be taken critically.

[–] gwl@lemmy.blahaj.zone 13 points 4 days ago

This is all a thinly veiled advertising campaign for "Emergence AI", and you've all fell for it hook line and sinker.

[–] tacosanonymous@mander.xyz 12 points 4 days ago

LLMs suck. I could’ve done it in 3.

[–] Snapz@lemmy.world 9 points 3 days ago (1 children)

Stupid headline mentions nothing about shareholder value? Did it go up or what????

[–] 01189998819991197253@infosec.pub 4 points 3 days ago

With Claud, no. With Grok, yes.

[–] Iusedtobeanalien@lemmy.world 9 points 4 days ago (2 children)

Will there be a movie?

[–] aarRJaay@lemmy.world 12 points 4 days ago

It's called The News

[–] iammike@programming.dev 14 points 4 days ago (1 children)

Better not be a live action

[–] Ceruleum@lemmy.wtf 1 points 2 days ago

After a while it turns into a still life.

[–] aesthelete@lemmy.world 1 points 2 days ago

Criti-hype!

[–] Tarquinn2049@lemmy.world 6 points 4 days ago* (last edited 4 days ago)

Give Vedal987 the simulation, I want to see Neuro and Evil take on this challenge, ideally multiple times, add it to their weekly variety stream activity list. They would at least be entertaining while destroying the world, hehe. RIP BOZO, Earth.

[–] Luisp@lemmy.dbzer0.com 5 points 3 days ago

That's because you aren't training a ml on a simulated world but instead a model trained on reddit and Twitter

[–] Triumph@fedia.io 5 points 4 days ago

Did they count capitalism as a crime?

[–] MedicPigBabySaver@lemmy.world 3 points 3 days ago

Trash.

[–] quick_snail@feddit.nl 0 points 3 days ago

Weeks doesn't seem like a long time