Nvidia debuts Nemotron 3 Super, a 120B-parameter hybrid MoE open-weight model; filing: Nvidia plans to spend $26B over the next five years to build open models
The move could position the AI infrastructure powerhouse to quickly compete with OpenAI, Anthropic, and DeepSeek.
Wired Will Knight
Related Coverage
- Jack Dorsey Praises Nvidia's $26 Billion Bet On Open AI Models: ‘This Would Be Excellent’ Benzinga · Ananya Gairola
- Introducing Nemotron 3 Super: An Open Hybrid Mamba-Transformer MoE for Agentic Reasoning NVIDIA Technical Blog
- NVIDIA Nemotron 3 Super NVIDIA Nemotron
- Nvidia Is Making a Massive $26 Billion Bet on the Future of Artificial Intelligence (AI) Motley Fool · Danny Vena
- New NVIDIA Nemotron 3 Super Delivers 5x Higher Throughput for Agentic AI NVIDIA · Kari Briski
- Nvidia's new open weights Nemotron 3 super combines three different architectures to beat gpt-oss and Qwen in throughput VentureBeat · Carl Franzen
- Nvidia boosts open models with Nemotron 3 Super The Deep View
- Nvidia's Nemotron Super 3 model for agentic systems launches with five times higher throughput SiliconANGLE · Mike Wheatley
- NVIDIA Nemotron 3 Super now available on Workers AI Cloudflare
- Thrilled to release Nemotron 3 Super (120B/12B Active)! — Designed for Blackwell, it features a Hybrid SSM Latent-MoE architecture with MTP … Mostofa Patwary
- NVIDIA Nemotron 3 Super is here! — Our team worked incredibly hard to build this model. It is designed for Blackwell and pretrained in NVFP4. … Jiantao Jiao
- This is awesome. Nvidia makes great open-source models and glad they have one in their repertoire with a 1M token context window. … Samuel G. Rodriques
- Nvidia launches Nemotron 3 Super to power enterprise AI agents InfoWorld · Prasanth Aby Thomas
- Nvidia launches Nemotron 3 Super, a 120B open model for large-scale AI systems The New Stack · Frederic Lardinois
- Nvidia Nemotron: Much needed open-source model champion in US Constellation Research · Larry Dignan
- Nvidia launches Nemotron 3 Super, an open model to build cheaper and accurate AI agents The Indian Express
- Nvidia Commits $26 Billion to Open-Weight AI, Ships Nemotron 3 Super Implicator.ai · Marcus Schuler
- Nvidia steps into the open-source AI gap that OpenAI, Meta, and Anthropic left behind The Decoder · Maximilian Schreiner
- Nvidia Just Changed the Economics of AI Agents Shelly Palmer
- Forget basketball. Next week's Nvidia GTC is the real March Madness for AI Fortune · Sharon Goldman
- Palantir and Nvidia build sovereign, on-premises AI reference architecture Constellation Research · Larry Dignan
- Nvidia Bets $26 Billion On Open-Source AI Revolution Benzinga · Anusuya Lahiri
- Australian investors offered bite at Nvidia-backed DeepSeek AI rival Australian Financial Review · Gus McCubbing
Discussion
-
@kuchaev
Oleksii Kuchaiev
on x
Nemotron 3 Super is here — 120B total / 12B active, Hybrid SSM Latent MoE, designed for Blackwell. Truly open: permissive license, open data, open training infra. See analysis on @ArtificialAnlys Details in thread 🧵below: [image]
-
@artificialanlys
@artificialanlys
on x
NVIDIA has released Nemotron 3 Super, a 120B (12B active) open weights reasoning model that scores 36 on the Artificial Analysis Intelligence Index with a hybrid Mamba-Transformer MoE architecture We were given access to this model ahead of launch and evaluated it across [image]
-
@ctnzr
Bryan Catanzaro
on x
Announcing NVIDIA Nemotron 3 Super! 💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell 💚36 on AAIndex v4 💚up to 2.2X faster than GPT-OSS-120B in FP4 💚Open data, open recipe, open weights Models, Tech report, etc. here: https://research.nvidia.com/ ... And yes, Ultra is comin…
-
@ggerganov
Georgi Gerganov
on x
In collaboration with NVIDIA we announce support for the new NVIDIA Nemotron 3 Super model in llama.cpp NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.
-
@nvidiaaidev
@nvidiaaidev
on x
This latest addition to the Nemotron family isn't just a bigger Nano. ✅ Up to 5x higher throughput and 2x accuracy than the previous version ✅ Latent MoE that calls 4x as many expert specialists for the same inference cost ✅ Multi-token prediction that dramatically reduces [imag…
-
@nvidianewsroom
@nvidianewsroom
on x
NVIDIA Nemotron 3 Super is here to accelerate the era of agentic AI. Optimized for NVIDIA Blackwell, this 120B open model uses a hybrid Mixture-of-Experts (MoE) architecture that delivers 5x higher throughput and 2x higher accuracy. The model combines advanced reasoning with a
-
@miles_brundage
Miles Brundage
on x
I don't think there's a *super* strong reason to take this more seriously than Meta's earlier commitment to open source which was walked back, but a weak reason to think it's real is that NVIDIA benefits from model commoditization more than Meta did https://x.com/...
-
@manuelfaysse
Manuel Faysse
on x
If you ever wondered how LLMs became so good at MMLU, the Nvidia Nemotron 3 Super reports that 11.1% of Pretraining Phase 1 data (20T tokens) is MMLU-style SFT data, so over 2T tokens of synthetic tokens specifically designed to reach the coveted 86% performance. [image]
-
@samhogan
Sam Hogan
on x
We've been testing Nemotron 3 Super for the last few weeks. TL;DR: it's easily the best Open Source American model for its size. Super fast. Great for agents and tool-calling use cases. We'll be shipping a series of post-trained Nemtron models in the coming weeks.
-
@_albertgu
Albert Gu
on x
as always, exciting to see NVIDIA continue to invest in Mamba hybrids and true open source. very impressive results!
-
@jiantaoj
Jiantao Jiao
on x
Nemotron 3 Super arrived! With efficiency in mind (Hybrid SSM Latent MoE, designed for Blackwell), the accuracy is also incredible. The most important aspect is scaling RL, utilizing the highly efficient and scalable Nemo Gym backend for RL environments and Nemo RL for model
-
@danprimack
Dan Primack
on x
Nvidia is both funding and competing with the LLM giants
-
@nvidiaai
@nvidiaai
on x
“We're an American company, but we work with companies across the world,” @ctnzr says. “It's in our interest to make the ecosystem diverse and strong everywhere.” Read more on how we are committed to helping the AI ecosystem develop through our open source models.
-
@igtmn
Igor Gitman
on x
Nemotron 3 Super is out! It's really good and it will only get better from here. And we release all the details - tech report, training code, training data, model weights. Everything you need to build a model like this yourself!
-
@mweinbach
Max Weinbach
on x
Trying out the new Nvidia Nemotron 3 Super model on my Mac Studio! [image]
-
@kevinbankston
Kevin Bankston
on x
Good. And smart move.
-
@jack
@jack
on x
this would be excellent
-
@cloudflaredev
@cloudflaredev
on x
Building multi-agent systems? @NVIDIA's Nemotron 3 Super (120B A12B) is now on Workers AI. - Reasoning and tool calling for complex multi-agent workflows - Built for code, finance, cybersecurity, and search agent use cases Learn more: https://developers.cloudflare.com/ ...
-
@nvidia
@nvidia
on x
New NVIDIA Nemotron 3 Super Delivers 5x Higher Throughput for Agentic AI
-
@natolambert
Nathan Lambert
on x
This looks like a model that's competitive with GPT OSS 120B or similar Qwen3.5 models on intelligence & speed, while coming with tons of open data + training details. Is a huge contribution for the ecosystem. Congrats Nvidia on the Nemotron 3 Super release!
-
@dr_alphalyrae
Vega Shah
on x
Today we launch NVIDIA's Nemotron Super 3, a 120B param open model designed to run agentic AI systems across scientific, enterprise and industrial applications. Partners working with us include Dassault Systèmes, Palantir Technologies, Lila Sciences and Edison Scientific Key [ima…
-
@nvidiaaidev
@nvidiaaidev
on x
🦞These innovations come together to create a model that is well suited for long-running autonomous agents. On PinchBench—a benchmark for evaluating LLMs as @OpenClaw coding agents—Nemotron 3 Super scores 85.6% across the full test suite, making it the best open model in its [imag…
-
@zeffmax
Max Zeff
on x
great scoop from will. for context, Anthropic said in a court filing a few days ago that it spent $10 billion on AI model training and inference in its whole lifetime... can't count out nvidia's models in the coming years!
-
@kimmonismus
@kimmonismus
on x
NVIDIA just dropped Nemotron 3 Super - and the architecture is wild. I was able to check it out early, and I love it (thanks, @nvidia) -120B parameters, but only 12B active. -A hybrid Mamba-Transformer MoE design that squeezes serious intelligence out of minimal compute. What [im…
-
@nvidiaaidev
@nvidiaaidev
on x
Introducing NVIDIA Nemotron 3 Super 🎉 Open 120B-parameter (12B active) hybrid Mamba-Transformer MoE model Native 1M-token context Built for compute-efficient, high-accuracy multi-agent applications Plus, fully open weights, datasets and recipes for easy customization and [video]
-
@benitoz
Ben Pouladian
on x
Nemotron 3 Super ships exactly what I mapped in December: Mamba hybrid, Latent MoE, multi-token prediction, NVFP4 on Blackwell 120B params, 12B active, 5x throughput Full-stack co-design, silicon to model No paywall👇🏽 https://bepresearch.substack.com/ ...
-
r/technology
r
on reddit
Nvidia Will Spend $26 Billion to Build Open-Weight AI Models, Filings Show
-
r/ArtificialInteligence
r
on reddit
Nvidia Will Spend $26 Billion to Build Open-Weight AI Models, Filings Show
-
r/nvidia
r
on reddit
Nvidia Will Spend $26 Billion to Build Open-Weight AI Models, Filings Show
-
@willknight
Will Knight
on x
Scoop from me: Nvidia will spend a total of $26 billion over the next five years building the world's best open source models. America is back in the open source AI race! https://www.wired.com/...
-
@vllm_project
@vllm_project
on x
🎉 Congrats to @nvidia on the release of Nemotron 3 Super — day-0 support in vLLM v0.17.1! Verified on NVIDIA GPUs. 120B hybrid MoE, only 12B active at inference. Big upgrades over the previous Nemotron Super: - 5x higher throughput - 2x higher accuracy on Artificial Analysis [ima…
-
@ollama
@ollama
on x
NVIDIA Nemotron 3 Super is now available on Ollama. ollama run nemotron-3-super:cloud 🦞Try it with OpenClaw: ollama launch openclaw —model nemotron-3-super:cloud Run it locally on your device: ollama run nemotron-3-super > 120B mixture of experts model with 12B active >
-
@kevinsxu
Kevin S. Xu
on x
This move makes business, technical, and geopolitical sense — a rare strategic trifecta
-
@andrewwhite01
@andrewwhite01
on x
We've been testing Nemotron 3 Super for a bit prerelease and it's very competitive with top open models while also being EXTREMELY fast. Congrats to the team! I'm very optimistic about the seriousness of NVIDIA nemotron team and this is great news for open models
-
@jeffreyhuber
Jeff Huber
on x
anyone who tells you they know how this market will turn out is lying
-
@rokomijic
Roko
on x
NVidia is commoditizing its complement
-
@vincentweisser
Vincent Weisser
on x
Really awesome model release by Nvidia!!
-
@novasarc01
Λux
on x
leave everything and read the nemotron 3 super technical report!! [image]
-
@alexfinn
Alex Finn
on x
This is the best news of 2026 Nvidia is going all in on open models. Something no other American AI company has had the balls to do Open source is the American way. Democratized, equal opportunity for all. Yet China has been dominating on this. No more Here's what this means:
-
@jacquesthibs
Jacques
on x
This was the obvious outcome. Better open-source models means more competition, which means more demand for GPUs. This is good news imo, makes sovereign AI outside of the US more of a possibility.
-
@scaling01
@scaling01
on x
Nvidia released Nemotron 3 Super - a 120B-A12B hybrid Mamba model with LatentMoE and MTP - pre-trained on 25T tokens in NVFP4 - context up to 1M - 2.2X faster inference than GPT-OSS-120B - 7.5X faster inference than Qwen3.5-122B https://huggingface.co/... [image]
-
@bubbleboi
Bubble Boi
on x
What does this mean for labs lol?
-
@opencode
@opencode
on x
NVIDIA's new open source model is now free on OpenCode Zen Nemotron 3 Super is a mid sized model that is - fast - fully open source - 1M context
-
@sgrodriques
Sam Rodriques
on x
Nemotron 3 super is out today. It is blazingly fast, has long context, and is very competitive. Nvidia has a very serious team working on Nemotron, and we have a deep partnership with Nvidia training specialist agents on their models. We're excited to continue.
-
@nvidiaaidev
@nvidiaaidev
on x
We 💚 open models
-
@mark_k
Mark Kretschmann
on x
NVIDIA just announced their latest release: Nemotron 3 Super. It is an impressive open 120B-parameter model built specifically for compute-efficient, high-accuracy multi-agent applications. Under the hood, it utilizes a hybrid Mamba-Transformer Mixture-of-Experts (MoE) [image]
-
@ianandrewsdc
Ian Andrews
on x
TL;DR Nemotron 3 Super punches far above its weight class. In spite of being less than a tenth of the size of frontier models, it was able to fluently use the harness, call tools, navigate the codebase using Bash and identify some surprising bugs in the code.
-
@kwindla
@kwindla
on x
NVIDIA Nemotron 3 Super launches today! We've been building voice agents with Super's pre-release checkpoints and running all our various tests and benchmarks. Nemotron 3 Super matches both GPT-5.4 and GPT-4.1 in tool calling and instruction following performance on our realtime …
-
@cedric_chee
Cedric
on x
NVIDIA has dropped Nemotron 3 Super, an open weights reasoning model with a hybrid Mamba-Transformer MoE architecture. Efficient.
-
@sudoraohacker
Arun Rao
on x
Exciting release of Nemotron's mid-sized “Super” (120b) model, with strong benchmark scores vs GLM 4.5 and a detailed tech report that is worth reading. The model has been optimized for accuracy, inference cost, and speed. Personally, I find the use of synthetic data generation, …
-
@joshpurtell
Josh
on x
RL's back on the menu boys
-
@artificialanlys
@artificialanlys
on x
NVIDIA is focused on efficient intelligence for the Nemotron family, and we tested inference performance against peer models to see the impact of the architecture choices. We ran self-hosted throughput tests across a range of peer models using a simple methodology with workloads …
-
@soubhik_deb
@soubhik_deb
on x
the whole AI stack is having its moment of vertical integration > closed AI labs going down vertical to design own chips > closed AI labs going down vertical to establish own datacenters > vibecoding and finetuning platforms going down vertical to build own models > nvidia