Google launches Gemini 3.1 Flash-Lite, which it says delivers “enhanced performance at a fraction of the cost of larger models” and outperforms Gemini 2.5 Flash
Get best-in-class intelligence for your highest-volume workloads. … Today, we're introducing Gemini 3.1 Flash-Lite …
The Keyword
Related Coverage
- Gemini 3.1 Flash-Lite's thinking levels: The feature that solves AI's speed vs. intelligence problem Digit · Vyom Ramani
- Google launches Gemini 3.1 Flash-Lite on AI Studio and Vertex AI TestingCatalog · Erin
- Google launches Gemini 3.1 Flash Lite as fastest and cheapest Gemini 3 model Crypto Briefing · Estefano Gomez
- Google launches speedy Gemini 3.1 Flash-Lite model in preview SiliconANGLE · Maria Deutscher
- Google Launches Gemini 3.1 Flash-Lite for Enterprise Scale WinBuzzer · Markus Kasanmascheff
- Google releases Gemini 3.1 Flash Lite at 1/8th the cost of Pro VentureBeat · Carl Franzen
- Google's fastest and cheapest model Gemini 3.1 Flash-Lite got smarter but also tripled the price The Decoder · Matthias Bastian
- Google Drops Gemini 3.1 Flash-Lite: A Cost-efficient Powerhouse with Adjustable Thinking Levels Designed for High-Scale Production AI MarkTechPost · Asif Razzaq
- Google just launched Gemini 3.1 Flash-Lite — 7 prompts to test its new ‘Thinking’ mode Tom's Guide · Amanda Caswell
- Google launches Gemini 3.1 Flash-Lite, its fastest Gemini 3 model yet The New Stack · Frederic Lardinois
- OpenAI, Google target lighter models The Deep View · Nat Rubio-Licht
- Claude is down, but Gemini 3.1 Flash-Lite is rolling out right now OnMSFT · Ron
- Big news for fast and cost-efficient AI! Gemini 3.1 Flash-Lite is here: — ⚡️ 2.5X faster Time to First Token — 📉 $0.25 per 1M input tokens … Karl Weinmeister
- Google launches Gemini 3.1 Flash-Lite, its “fastest and most cost-efficient” AI model The Economic Times
- Altman faces the fallout from OpenAI's Pentagon deal The Rundown AI
- Google Launches Gemini 3.1 Flash-Lite, Its Fastest and Cheapest AI Model Yet eWeek · Liz Ticong
- Google reveals dev-focused Gemini 3.1 Flash Lite, promises ‘best-in-class intelligence for your highest-volume workloads’ TechRadar · Craig Hale
Discussion
-
@koraykv
Koray Kavukcuoglu
on x
Gemini 3.1 Flash-Lite is available now! It takes an unbelievable amount of complex engineering to make AI feel instantaneous, enabling exciting new frontiers for experimentation! [video]
-
@googledeepmind
@googledeepmind
on x
Gemini 3.1 Flash-Lite has landed. It's our most cost-efficient Gemini 3 series model yet, built for intelligence at scale. Here's what's new 🧵 [video]
-
@google
@google
on x
Developers can now preview Gemini 3.1 Flash-Lite, our fastest and most cost-efficient Gemini 3 series model yet. With a 45% increase in output speed, it outperforms 2.5 Flash and features dynamic thinking levels to match task complexity. Rolling out in preview today in [video]
-
@artificialanlys
@artificialanlys
on x
Google has released Gemini 3.1 Flash-Lite Preview!...Key takeaways: ➤ Improved intelligence over Gemini 2.5 Flash-Lite: @GoogleDeepMind's Gemini 3.1 Flash-Lite Preview scores 34 on the Artificial Analysis Intelligence Index, up 12 points from Gemini 2.5 Flash-Lite (09-25). Howev…
-
@noamshazeer
Noam Shazeer
on x
📢Introducing Gemini 3.1 Flash-Lite, our fastest and most efficient model, built for high-volume workloads. It outperforms 2.5 Flash in reasoning, reliability, and scalability at a lower cost. This model also introduces thinking levels. You can adjust compute by complexity of
-
@sundarpichai
Sundar Pichai
on x
Gemini 3.1 Flash-Lite is the fastest and most cost-efficient Gemini 3 series model⚡️ It outperforms 2.5 Flash with a 2.5X faster Time to First Answer Token and a 45% increase in output speed, at a fraction of the cost of larger models!
-
@scaling01
@scaling01
on x
Gemini 3.1 Flash Lite Benchmarks [image]
-
@googledevs
@googledevs
on x
Today, we're introducing Gemini 3.1 Flash Lite (in preview) ⚡️ Now available via the Gemini API, our fastest and most cost-efficient Gemini 3 series model: - Features dynamic thinking for scaled reasoning - Delivers enhanced performance at a lower cost (priced at $0.25/1M input […
-
@_philschmid
Philipp Schmid
on x
Gemini 3.1 Flash-Lite is here! Our fastest, most cost-efficient gemini model built for high-volume workloads at scale. 💰 Priced at $0.25/M input, $1.50/M output tokens 🧠 Matches 2.5 Flash quality at Flash-Lite cost ⚡2.5x TFT and 45% faster output vs 2.5 Flash 💽 Enables low-late…
-
@chanduthota
Chandu Thota
on x
Excited to introduce Gemini 3.1 Flash-Lite, our fastest and most cost-efficient Gemini 3 series model designed for high-volume workloads. It's 2.5X faster Time to First Answer Token than 2.5 Flash (along with a 45% increase in output speed) ⚡️ [image]
-
@teksedge
David Hendrickson
on x
Here are the benchmarks for the newly released Gemini 3.1 Flash-Lite Preview, now available in AI Studio. Good pricing @ $1.5/1M tokens on par with new Chinese OS models. It beats Qwen3.5 397B (~$3/1M) in shared benchmarks, so a great deal. It DOES NOT beat GLM-5 (~$2.5/1M).
-
@googleai
@googleai
on x
Smarter. Faster. Gemini 3.1 Flash-Lite is here⚡ The model offers uncompromising speed & intelligence at scale by focusing on: — Cost-efficiency: Priced at just $0.25/1M input and $1.50/1M output tokens, it gets work done faster at a fraction of the cost of larger models, [video]
-
@veermasrani
Veer Masrani
on x
Gemini 3.1 Flash-Lite Just Dropped & Destroyed the Budget Model Tier Google's cheapest model and it's somehow the fastest in its class. The numbers at $0.25 input: > 363 tok/s — Flash-Lite > 108 tok/s — Claude Haiku ($1.00) > 71 tok/s — GPT-5 mini ($0.25) Adjustable “thinking le…
-
@dynamicwebpaige
@dynamicwebpaige
on x
smol but incredibly mighty! Gemini 3.1 Flash-Lite is an absolute speed demon (417 tokens/s!! 🏃♀️💨) but still punches far above its weight class. 💸 $0.25 per 1M input tokens 🧠 86.9% on GPQA Diamond ⚡️ 2.5X faster time-to-first-token A game changer for high-volume and agent
-
@altryne
Alex Volkov
on x
New Gemini is out! 3.1 Flash 3.1 looks like it beats the previous ⚡ models while also becoming cheaper!
-
@kimmonismus
@kimmonismus
on x
Gemini 3.1 Flash lite released and its next level price-performance-ratio Gemini 3.1 Flash-Lite, its fastest and most cost-efficient Gemini 3 model yet, built for high-volume developer workloads. Priced at just $0.25 per 1M input tokens and $1.50 per 1M output tokens, it [image]
-
@joshwoodward
Josh Woodward
on x
Meet Flash-Lite: the really, really fast one
-
@googledeepmind
@googledeepmind
on x
3.1 Flash-Lite outperforms 2.5 Flash with faster performance at a lower price. New ‘thinking levels’ let you dial in reasoning to adapt for different tasks, while still being able to handle complex workloads - like generating UI and dashboards or creating simulations. [image]
-
@tulseedoshi
Tulsee Doshi
on x
3.1 Flash-Lite is now live in the Gemini API 3.1 Flash-Lite outperforms 2.5 Flash across a majority of benchmarks, while being a little speedster! ⚡️⚡️⚡️ [video]
-
@googlecloudtech
@googlecloudtech
on x
Introducing Gemini 3.1 Flash-Lite, our fastest and most cost-efficient Gemini 3 series model. Built for high-volume workloads at scale, 3.1 Flash-Lite delivers high quality for its price and model tier. Rolling out in preview via Vertex AI → https://cloud.google.com/... [image]
-
@dasanaike
Noah Dasanaike
on x
socOCRbench has been updated to include the new Gemini 3.1 Flash-Lite, which achieves SOTA performance (by far) at this price point (and is best across all models for print tables!), as well as the new small Qwen 3.5 models, which are best in their respective size classes. [image…
-
@marmaduke091
@marmaduke091
on x
Gemini 3.1 Flash Lite seems to be a lot more expensive than its predecessor [image]
-
@mark_k
Mark Kretschmann
on x
Gemini 3.1 Flash-Lite was just released by @GoogleDeepMind, and the initial benchmarks are highly impressive for a model in this weight class. It is designed to be a high-volume, cost-sensitive workhorse that doesn't sacrifice quality. Based on the benchmark data, here is how it …
-
@jeffdean
Jeff Dean
on x
⚡ Excited to announce Gemini 3.1 Flash-Lite! We've set a new standard for efficiency and capability to give developers our fastest, most cost-effective Gemini 3 model yet. We engineered this model with thinking levels, allowing it to handle high-volume queries instantly, while [i…
-
@jeffdean
Jeff Dean
on x
Here's a side-by-side speed and accuracy comparison comparing Gemini 3.1 Flash Lite against our older Gemini 2.5 Flash model. Not only is the new model significantly faster in terms of tokens/s, it also needs many fewer tokens to accomplish complex tasks (~1/3 as many in this [vi…
-
@officiallogank
Logan Kilpatrick
on x
Introducing Gemini 3.1 Flash-Lite 🔦, a huge step forward on the boundary of intelligence, beating 2.5 Flash on many tasks. [image]
-
@tristanhurleb
Tristan Hurlebaus
on x
Here we are Gemini 3.1 Flash lite a supper fast small model from Google. [image]
-
r/Bard
r
on reddit
Gemini 3.1 Flash-Lite: Built for intelligence at scale