Google says Gemini 3 Pro scores 1,501 on LMArena, above 2.5 Pro, and demonstrates PhD-level reasoning with top scores on Humanity's Last Exam and GPQA Diamond
Google today announced Gemini 3 with the goal of bringing “any idea to life.” The first model available in this family …
Google says the median Gemini app text prompt consumes 0.24Wh of energy, about the same as running a microwave for a second, and emits 0.03g of CO2 equivalent
an official report confirms that Gemini consumes per query: — 0.24 Wh of energy (~9 seconds of TV) — 0.03 g of CO2 equivalent — 0.26 ml of water (about 5 drops) — blog: cloud.google.com/blog/prod...
Google releases an upgraded preview of Gemini 2.5 Pro, saying its Elo score jumped by 24 points on LMArena and it leads in coding benchmarks like Aider Polyglot
Abner Li / 9to5Google :
LMArena says it's starting a company, whose corporate name will be Arena Intelligence, with plans to raise money, and releases a new beta version of its website
fixing errors/bugs, improving our UI layout, and more. To keep supporting the development and continual improvement of this platform, we're also forming a company. Future improvements will continue ...
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Flash Thinking, an exp...