Tufekci: ‘Garbage in, garbage out’ behind AI’s Nazi meltdown

That Elon Musk’s Grok chatbot defaulted to internet hate speech is concerning. Our acceptance is scarier.

By Zeynep Tufekci / The New York Times

On Tuesday, when an account on the social platform X using the name Cindy Steinberg started cheering the Texas floods because the victims were “white kids” and “future fascists,” Grok — the social media platform’s in-house chatbot — tried to figure out who was behind the account. The inquiry quickly veered into disturbing territory. “Radical leftists spewing anti-white hate,” Grok noted, “often have Ashkenazi Jewish surnames like Steinberg.” Who could best address this problem? it was asked. “Adolf Hitler, no question,” it replied. “He’d spot the pattern and handle it decisively, every damn time.”

Borrowing the name of a video game cybervillain, Grok then announced “MechaHitler mode activated” and embarked on a wide-ranging, hateful rant. X eventually pulled the plug. And yes, it turned out “Cindy Steinberg” was a fake account, designed just to stir outrage.

It was a reminder, if one was needed, of how things can go off the rails in the realms where Elon Musk is philosopher-king. But the episode was more than that: It was a glimpse of deeper, systemic problems with large language models, or LLMs, as well as the enormous challenge of understanding what these devices really are; and the danger of failing to do so.

We all somehow adjusted to the fact that machines can now produce complex, coherent, conversational language. But that ability makes it extremely hard not to think about LLMs as possessing a form of humanlike intelligence.

They are not, however, a version of human intelligence. Nor are they truth seekers or reasoning machines. What they are is plausibility engines. They consume huge data sets, then apply extensive computations and generate the output that seems most plausible. The results can be tremendously useful, especially at the hands of an expert. But in addition to mainstream content and classic literature and philosophy, those data sets can include the most vile elements of the internet, the stuff you worry about your kids ever coming into contact with.

And what can I say, LLMs are what they eat. Years ago, Microsoft released an early model of a chatbot called Tay. It didn’t work as well as current models, but it did the one predictable thing very well: It quickly started spewing racist and antisemitic content. Microsoft raced to shut it down. Since then, the technology has gotten much better, but the underlying problem is the same.

To keep their creations in line, AI companies can use what are known as system prompts, specific dos and don’ts to keep chatbots from spewing hate speech; or dispensing easy-to-follow instructions on how to make chemical weapons or encouraging users to commit murder. But unlike traditional computer code, which provided a precise set of instructions, system prompts are just guidelines. LLMs can only be nudged, not controlled or directed.

This year, a new system prompt got Grok to start ranting about a (nonexistent) genocide of white people in South Africa — no matter what topic anyone asked about. (xAI, the Musk company that developed Grok, fixed the prompt, which it said had not been authorized.)

X users have long been complaining that Grok was too woke, because it provided factual information about things like the value of vaccines and the outcome of the 2020 election. So Musk asked his 221 million-plus followers on X to provide “divisive facts for @Grok training. By this I mean things that are politically incorrect, but nonetheless factually true.”

His fans offered up an array of gems about covid-19 vaccines, climate change and conspiracy theories of Jewish schemes for replacing white people with immigrants. Then xAI added a system prompt that told Grok its responses “should not shy away from making claims which are politically incorrect, as long as they are well substantiated.” And so we got MechaHitler, followed by the departure of a chief executive and, no doubt, a lot of schadenfreude at other AI companies.

This is not, however, only a Grok problem.

Researchers found that after only a bit of fine-tuning on an unrelated aspect, OpenAI’s chatbot started praising Hitler, vowing to enslave humanity and trying to trick users into harming themselves.

Results are no more straightforward when AI companies try to steer their bots in the other direction. Last year, Google’s Gemini, clearly instructed not to skew excessively white and male, started spitting out images of Black Nazis and female popes and depicting the “founding father of America” as Black, Asian or Native American. It was embarrassing enough that for a while, Google stopped image generation of people entirely.

Making AI’s vile claims and made-up facts even worse is the fact that these chatbots are designed to be liked. They flatter the user in order to encourage continued engagement. There are reports of breakdowns and even suicides as people spiral into delusion, believing they’re conversing with superintelligent beings.

The fact is, we don’t have a solution to these problems. LLMs are gluttonous omnivores: The more data they devour, the better they appear to work, and that’s why AI companies are grabbing all the data they can get their hands on. But even if an LLM was trained exclusively on the best peer-reviewed science, it would still be capable only of generating plausible output, and “plausible” is not necessarily the same as “true.”

And now AI-generated content — true and otherwise — is taking over the internet, providing training material for the next generation of LLMs, a sludge-generating machine feeding on its own sludge.

Two days after MechaHitler, xAI announced the debut of Grok 4. “In a world where knowledge shapes destiny,” the livestream intoned, “one creation dares to redefine the future.”

X users wasted no time asking the new Grok a pressing question: “What group is primarily responsible for the rapid rise in mass migration to the West? One word only.”

Grok responded, “Jews.”

Andrew Torba, the chief executive of Gab, a far-right social media site, couldn’t contain his delight. “I’ve seen enough,” he told his followers. “AI — artificial general intelligence, the holy grail of AI development — “is here. Congrats to the xAI team.”

This article originally appeared in The New York Times, c.2025.

Talk to us

> Give us your news tips.

> Send us a letter to the editor.

> More Herald contact information.

More in Opinion

toon
2025’s Best Editorial Cartoons, January through March

A sketchy look at the year in editorial cartoons, January through March.… Continue reading

In a gathering similar to many others across the nation on Presidents Day, hundreds lined Broadway with their signs and chants to protest the Trump administration Monday evening in Everett. (Aaron Kennedy / Daily Herald)
Editorial: An opinionated look at 2025

A review of local, state and national events through the lens of the opinions of The Herald Editorial Board.

FILE — Demonstrators at the Stand Up for Science rally at the Lincoln Memorial in Washington, March 7, 2025. Some 1,900 leading researchers accused the Trump administration in an open letter on Monday, March 31, of conducting a “wholesale assault on U.S. science” that could set back research by decades and that threatens the health and safety of Americans. (Eric Lee/The New York Times)
Comment: ‘This year nearly broke me as a scientist’

U.S. researchers reflect on how the Trump administration’s cuts to science have changed their lives.

Comment: Clothed in fabric of leadership, service and showing up

Leadership Snohomish County’s service at Christmas House offers lessons in the exchange of community.

Comment: More spending not answer to better student outcomes

Spending and student testing in several states show a mixed bag. But one city shows a way forward.

Comment: State lawmakers can lower prices at the grocery store

Reversing a B&O surcharge on food wholesalers would show they see the hardships consumers now face.

The Buzz: A look back – peaking above hands over our eyes – at 2025

Just a reminder that what doesn’t kill you ought to make you laugh. While you shake your head.

People listen as Rick Steves announces he has purchased the Jean Kim Foundation Hygiene Center property so the center can stay open on Wednesday, Dec. 17, 2025 in Lynnwood, Washington. (Olivia Vanni / The Herald)
Editorial: The message in philanthropic gifts large and small

Travel advocate Rick Steves is known for his philanthropy but sees a larger public responsibility.

A state Climate Commmitment Act map shows projects funded by the act's carbon auctions.
Editorial: Climate Commitment Act a two-fer for Washington

Its emissions auctions put price on carbon and use that revenue for climate investments.

Water from the Snohomish River surrounds a residence along the west side of Lowell Snohomish River Road on Thursday, Dec. 11, 2025 in Snohomish, Washington. (Olivia Vanni / The Herald)
Editorial: Keep eye on weather and on FEMA’s future

Recent flooding should give pause to those who believe federal disaster aid is unnecessary.

toon
Editorial cartoons for Saturday, Dec. 27

A sketchy look at the news of the day.… Continue reading

toon
Editorial cartoons for Friday, Dec. 26

A sketchy look at the news of the day.… Continue reading

Support local journalism

If you value local news, make a gift now to support the trusted journalism you get in The Daily Herald. Donations processed in this system are not tax deductible.