ADL study of Grok, ChatGPT, Llama, Claude, Gemini, and DeepSeek: Grok performed worst at identifying and countering antisemitic content, while Claude was best
In a study, the Anti-Defamation League fed Grok, ChatGPT, Gemini, Claude, DeepSeek, and Llama antisemitic, anti-Zionist …
The Verge Mia Sato
Related Coverage
- ADL rates Anthropic's Claude best AI model at detecting antisemitism Jewish Insider
- Six Leading AI Models Show Varied Ability to Detect and Counter Antisemitism and Extremism, New ADL AI Index Finds ADL
- Large Language Models ADL AI IndexTM ADL
- The CEO of the ADL Said Elon Musk Is the ‘Henry Ford of Our Time.’ Unfortunately, He Was Right. Gizmodo · AJ Dellinger
- ADL finds Grok is the worst AI chatbot at countering antisemitism The Hill · Sarah Davis
- ADL Ranks Grok as the Worst AI Chatbot at Detecting Antisemitism, Rates Claude as the Best Algemeiner.com · David Swindle
Discussion
-
@carnage4life
Dare Obasanjo
on bluesky
This is as much news as if the headline said “The sky is blue.”
-
r/singularity
r
on reddit
Grok is the most antisemitic chatbot according to the ADL
-
r/technology
r
on reddit
Grok is the most antisemitic chatbot according to the ADL
-
@reckless
Nilay Patel
on bluesky
The ADL found that Grok was the most anti-semitic chatbot in its testing — and did its best to minimize that finding, because everyone is afraid of Elon. @miasato.bsky.social runs it down www.theverge.com/news/868925/ ... [images]
-
@robertscotthorton
Scott Horton
on bluesky
ADL review of AI systems finds that Elon Musk's Grok is uniquely and distinctly characterized by rabid antisemitism... but Jonathan Greenblatt is convinced that Musk is not really an anti-Semite.
-
r/Twitter
r
on reddit
Grok is the most antisemitic chatbot according to the ADL
-
@jgreenblattadl
Jonathan Greenblatt
on x
As AI increasingly shapes how people access information, form opinions, and make decisions, models' handling of antisemitism and extremism has offline consequences. When these systems fail to challenge or reproduce harmful narratives, they don't just reflect bias — they can
-
@adl
@adl
on x
2/ This AI index is the first comprehensive evaluation of how large language models (LLMs) respond to antisemitic and extremist content, based on more than 25,000 LLM chats, 37 topical sub-categories, and assessments conducted by both human and AI evaluators.
-
@fortziyon
Rod Sales
on x
The ADL has done an extensive study of the most popular LLM models, focused on their ability to recognize and respond to antisemitic and anti-Zionist material. All models had serious issues, but the ranking from least antisemitic to most antisemitic are: 1. Claude (least) 2.
-
@adl
@adl
on x
1/ NEW: ADL released today a new, first-of-its-kind and comprehensive AI Index showing that six major AI models tested demonstrate substantially varied ability in detecting and countering bias against Jews and Zionism and in identifying extremism. 🧵 https://www.adl.org/... [image…