artificialanlys

Google has released Gemini 3.1 Flash-Lite Preview!...Key takeaways: ➤ Improved intelligence over Gemini 2.5 Flash-Lite: @GoogleDeepMind's Gemini 3.1 Flash-Lite Preview scores 34 on the Artificial Analysis Intelligence Index, up 12 points from Gemini 2.5 Flash-Lite (09-25). However, Gemini 3.1 Flash-Lite Preview had limited gains in tool use capabilities, matching Gemini 2.5 Flash-Lite (09-25) on Tau2-Telecom with 31%, and scoring 958 on GDPval-AA, 12 points behind gpt-oss-120b (high) ➤ Leading speed and latency: Gemini 3.1 Flash-Lite Preview maintains the same high speeds and low latency as Gemini 2.5 Flash-Lite (09-25), measuring at over 360 output tokens/s, with an average answer latency of 5.1s. To measure latency for reasoning models, we use time to first answer token, which accounts for both prefill processing, and thinking time

2026-03-04 View on X

The Keyword

Google launches Gemini 3.1 Flash-Lite, which it says delivers “enhanced performance at a fraction of the cost of larger models” and outperforms Gemini 2.5 Flash

Get best-in-class intelligence for your highest-volume workloads. … Today, we're introducing Gemini 3.1 Flash-Lite …

View original

Google has released Gemini 3.1 Flash-Lite Preview! This upgrades the fastest, lowest-cost Gemini model series, scoring 34 on the Artificial Analysis Intelligence Index while served at over 360 output tokens/sec, significantly faster than other first-party API endpoints Key [image]

2026-03-03 View on X

The Keyword

Google launches Gemini 3.1 Flash-Lite, which it says delivers “enhanced performance” at a fraction of the cost of larger models and outperforms 2.5 Flash

Get best-in-class intelligence for your highest-volume workloads. … Today, we're introducing Gemini 3.1 Flash-Lite …

View original

Google is once again the leader in AI: Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index, 4 points ahead of Claude Opus 4.6 while costing less than half as much to run @GoogleDeepMind gave us pre-release access to Gemini 3.1 Pro Preview. It leads 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index and improves significantly over Gemini 3 Pro Preview across capabilities, with the biggest gains in reasoning and knowledge, coding, and hallucination reduction.

2026-02-21 View on X

9to5Google

Google rolls out Gemini 3.1 Pro, which it says is “a step forward in core reasoning”, for all users in the Gemini app; the .1 increment is a first for Google

View original

Google is once again the leader in AI: Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index, 4 points ahead of Claude Opus 4.6 while costing less than half as much to run @GoogleDeepMind gave us pre-release access to Gemini 3.1 Pro Preview. It leads 6 of the 10 evaluations that make up the Artificial Analysis Intelligence Index and improves significantly over Gemini 3 Pro Preview across capabilities, with the biggest gains in reasoning and knowledge, coding, and hallucination reduction.

2026-02-20 View on X

9to5Google

Google rolls out Gemini 3.1 Pro, which it says is “a step forward in core reasoning”, for all users in the Gemini app; the .1 increment is a first for Google

In November, Google introduced Gemini 3 Pro in preview, with Gemini 3 Flash following a month later.

View original

Google is once again the leader in AI: Gemini 3.1 Pro Preview leads the Artificial Analysis Intelligence Index, 4 points ahead of Claude Opus 4.6 while costing less than half as much to run @GoogleDeepMind gave us pre-release access to Gemini 3.1 Pro Preview. It leads 6 of the [image]

2026-02-19 View on X

9to5Google

Google rolls out Gemini 3.1 Pro, which it says is “a step forward in core reasoning”, for all users in the Gemini app; the .1 increment is a first for Google

In November, Google introduced Gemini 3 Pro in preview, with Gemini 3 Flash following a month later.

View original

The performance and token use increases for Claude Sonnet 4.6 mean that it is now clustered with Opus 4.6 on the ELO vs. Cost to Run curve despite 40% lower per token prices Sonnet is back at the Pareto frontier, but now positioned at a higher cost and performance point while [image]

2026-02-18 View on X

Anthropic