2025-05-01
TechCrunch
7 related
A study from Cohere, Stanford, MIT, and Ai2 accuses LMArena of helping Meta, OpenAI, Google, and Amazon game its popular crowdsourced AI benchmark Chatbot Arena
A new paper from AI lab Cohere, Stanford, MIT, and Ai2 accuses LM Arena, the organization behind the popular crowdsourced AI …
2024-09-08
TechCrunch
A look at LMSYS' Chatbot Arena and the issues surrounding the crowdsourced LLM benchmark platform, including biases, lack of transparency, and commercial ties
Kyle Wiggers / TechCrunch : X: @woojinrad X: Woojin Kim / @woojinrad : The AI industry is obsessed with Chatbot Arena, but it might not be the best benchmark | @TechCrunch Human raters bring their bi...
Loading articles...