Nvidia debuts Nemotron 3 Super, a 120B-parameter hybrid MoE open-weight model; filing: Nvidia plans to spend $26B over the next five years to build open models

The move could position the AI infrastructure powerhouse to quickly compete with OpenAI, Anthropic, and DeepSeek.

Wired 2026-03-12 Will Knight

Discussion

@kuchaev Oleksii Kuchaiev on x
Nemotron 3 Super is here — 120B total / 12B active, Hybrid SSM Latent MoE, designed for Blackwell. Truly open: permissive license, open data, open training infra. See analysis on @ArtificialAnlys Details in thread 🧵below: [image]
@artificialanlys @artificialanlys on x
NVIDIA has released Nemotron 3 Super, a 120B (12B active) open weights reasoning model that scores 36 on the Artificial Analysis Intelligence Index with a hybrid Mamba-Transformer MoE architecture We were given access to this model ahead of launch and evaluated it across [image]
@ctnzr Bryan Catanzaro on x
Announcing NVIDIA Nemotron 3 Super! 💚120B-12A Hybrid SSM Latent MoE, designed for Blackwell 💚36 on AAIndex v4 💚up to 2.2X faster than GPT-OSS-120B in FP4 💚Open data, open recipe, open weights Models, Tech report, etc. here: https://research.nvidia.com/ ... And yes, Ultra is comin…
@ggerganov Georgi Gerganov on x
In collaboration with NVIDIA we announce support for the new NVIDIA Nemotron 3 Super model in llama.cpp NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.
@nvidiaaidev @nvidiaaidev on x
This latest addition to the Nemotron family isn't just a bigger Nano. ✅ Up to 5x higher throughput and 2x accuracy than the previous version ✅ Latent MoE that calls 4x as many expert specialists for the same inference cost  ✅ Multi-token prediction that dramatically reduces [imag…
@nvidianewsroom @nvidianewsroom on x
NVIDIA Nemotron 3 Super is here to accelerate the era of agentic AI. Optimized for NVIDIA Blackwell, this 120B open model uses a hybrid Mixture-of-Experts (MoE) architecture that delivers 5x higher throughput and 2x higher accuracy. The model combines advanced reasoning with a
@miles_brundage Miles Brundage on x
I don't think there's a *super* strong reason to take this more seriously than Meta's earlier commitment to open source which was walked back, but a weak reason to think it's real is that NVIDIA benefits from model commoditization more than Meta did https://x.com/...
@manuelfaysse Manuel Faysse on x
If you ever wondered how LLMs became so good at MMLU, the Nvidia Nemotron 3 Super reports that 11.1% of Pretraining Phase 1 data (20T tokens) is MMLU-style SFT data, so over 2T tokens of synthetic tokens specifically designed to reach the coveted 86% performance. [image]
@samhogan Sam Hogan on x
We've been testing Nemotron 3 Super for the last few weeks. TL;DR: it's easily the best Open Source American model for its size. Super fast. Great for agents and tool-calling use cases. We'll be shipping a series of post-trained Nemtron models in the coming weeks.
@_albertgu Albert Gu on x
as always, exciting to see NVIDIA continue to invest in Mamba hybrids and true open source. very impressive results!
@jiantaoj Jiantao Jiao on x
Nemotron 3 Super arrived! With efficiency in mind (Hybrid SSM Latent MoE, designed for Blackwell), the accuracy is also incredible. The most important aspect is scaling RL, utilizing the highly efficient and scalable Nemo Gym backend for RL environments and Nemo RL for model
@danprimack Dan Primack on x
Nvidia is both funding and competing with the LLM giants
@nvidiaai @nvidiaai on x
“We're an American company, but we work with companies across the world,” @ctnzr says. “It's in our interest to make the ecosystem diverse and strong everywhere.” Read more on how we are committed to helping the AI ecosystem develop through our open source models.
@igtmn Igor Gitman on x
Nemotron 3 Super is out! It's really good and it will only get better from here. And we release all the details - tech report, training code, training data, model weights. Everything you need to build a model like this yourself!
@mweinbach Max Weinbach on x
Trying out the new Nvidia Nemotron 3 Super model on my Mac Studio! [image]
@kevinbankston Kevin Bankston on x
Good. And smart move.
@jack @jack on x
this would be excellent
@cloudflaredev @cloudflaredev on x
Building multi-agent systems? @NVIDIA's Nemotron 3 Super (120B A12B) is now on Workers AI. - Reasoning and tool calling for complex multi-agent workflows - Built for code, finance, cybersecurity, and search agent use cases Learn more: https://developers.cloudflare.com/ ...
@nvidia @nvidia on x
New NVIDIA Nemotron 3 Super Delivers 5x Higher Throughput for Agentic AI
@natolambert Nathan Lambert on x
This looks like a model that's competitive with GPT OSS 120B or similar Qwen3.5 models on intelligence & speed, while coming with tons of open data + training details. Is a huge contribution for the ecosystem. Congrats Nvidia on the Nemotron 3 Super release!
@dr_alphalyrae Vega Shah on x
Today we launch NVIDIA's Nemotron Super 3, a 120B param open model designed to run agentic AI systems across scientific, enterprise and industrial applications. Partners working with us include Dassault Systèmes, Palantir Technologies, Lila Sciences and Edison Scientific Key [ima…
@nvidiaaidev @nvidiaaidev on x
🦞These innovations come together to create a model that is well suited for long-running autonomous agents. On PinchBench—a benchmark for evaluating LLMs as @OpenClaw coding agents—Nemotron 3 Super scores 85.6% across the full test suite, making it the best open model in its [imag…
@zeffmax Max Zeff on x
great scoop from will. for context, Anthropic said in a court filing a few days ago that it spent $10 billion on AI model training and inference in its whole lifetime... can't count out nvidia's models in the coming years!
@kimmonismus @kimmonismus on x
NVIDIA just dropped Nemotron 3 Super - and the architecture is wild. I was able to check it out early, and I love it (thanks, @nvidia) -120B parameters, but only 12B active. -A hybrid Mamba-Transformer MoE design that squeezes serious intelligence out of minimal compute. What [im…
@nvidiaaidev @nvidiaaidev on x
Introducing NVIDIA Nemotron 3 Super 🎉 Open 120B-parameter (12B active) hybrid Mamba-Transformer MoE model Native 1M-token context Built for compute-efficient, high-accuracy multi-agent applications Plus, fully open weights, datasets and recipes for easy customization and [video]
@benitoz Ben Pouladian on x
Nemotron 3 Super ships exactly what I mapped in December: Mamba hybrid, Latent MoE, multi-token prediction, NVFP4 on Blackwell 120B params, 12B active, 5x throughput Full-stack co-design, silicon to model No paywall👇🏽 https://bepresearch.substack.com/ ...
r/technology r on reddit
Nvidia Will Spend $26 Billion to Build Open-Weight AI Models, Filings Show
r/ArtificialInteligence r on reddit
Nvidia Will Spend $26 Billion to Build Open-Weight AI Models, Filings Show
r/nvidia r on reddit
Nvidia Will Spend $26 Billion to Build Open-Weight AI Models, Filings Show
@willknight Will Knight on x
Scoop from me: Nvidia will spend a total of $26 billion over the next five years building the world's best open source models. America is back in the open source AI race! https://www.wired.com/...
@vllm_project @vllm_project on x
🎉 Congrats to @nvidia on the release of Nemotron 3 Super — day-0 support in vLLM v0.17.1! Verified on NVIDIA GPUs. 120B hybrid MoE, only 12B active at inference. Big upgrades over the previous Nemotron Super: - 5x higher throughput - 2x higher accuracy on Artificial Analysis [ima…
@ollama @ollama on x
NVIDIA Nemotron 3 Super is now available on Ollama. ollama run nemotron-3-super:cloud 🦞Try it with OpenClaw: ollama launch openclaw —model nemotron-3-super:cloud Run it locally on your device: ollama run nemotron-3-super > 120B mixture of experts model with 12B active >
@kevinsxu Kevin S. Xu on x
This move makes business, technical, and geopolitical sense — a rare strategic trifecta
@andrewwhite01 @andrewwhite01 on x
We've been testing Nemotron 3 Super for a bit prerelease and it's very competitive with top open models while also being EXTREMELY fast. Congrats to the team! I'm very optimistic about the seriousness of NVIDIA nemotron team and this is great news for open models
@jeffreyhuber Jeff Huber on x
anyone who tells you they know how this market will turn out is lying
@rokomijic Roko on x
NVidia is commoditizing its complement
@vincentweisser Vincent Weisser on x
Really awesome model release by Nvidia!!
@novasarc01 &LAMBDA;ux on x
leave everything and read the nemotron 3 super technical report!! [image]
@alexfinn Alex Finn on x
This is the best news of 2026 Nvidia is going all in on open models. Something no other American AI company has had the balls to do Open source is the American way. Democratized, equal opportunity for all. Yet China has been dominating on this. No more Here's what this means:
@jacquesthibs Jacques on x
This was the obvious outcome. Better open-source models means more competition, which means more demand for GPUs. This is good news imo, makes sovereign AI outside of the US more of a possibility.
@scaling01 @scaling01 on x
Nvidia released Nemotron 3 Super - a 120B-A12B hybrid Mamba model with LatentMoE and MTP - pre-trained on 25T tokens in NVFP4 - context up to 1M - 2.2X faster inference than GPT-OSS-120B - 7.5X faster inference than Qwen3.5-122B https://huggingface.co/... [image]
@bubbleboi Bubble Boi on x
What does this mean for labs lol?
@opencode @opencode on x
NVIDIA's new open source model is now free on OpenCode Zen Nemotron 3 Super is a mid sized model that is - fast - fully open source - 1M context
@sgrodriques Sam Rodriques on x
Nemotron 3 super is out today. It is blazingly fast, has long context, and is very competitive. Nvidia has a very serious team working on Nemotron, and we have a deep partnership with Nvidia training specialist agents on their models. We're excited to continue.
@nvidiaaidev @nvidiaaidev on x
We 💚 open models
@mark_k Mark Kretschmann on x
NVIDIA just announced their latest release: Nemotron 3 Super. It is an impressive open 120B-parameter model built specifically for compute-efficient, high-accuracy multi-agent applications. Under the hood, it utilizes a hybrid Mamba-Transformer Mixture-of-Experts (MoE) [image]
@ianandrewsdc Ian Andrews on x
TL;DR Nemotron 3 Super punches far above its weight class. In spite of being less than a tenth of the size of frontier models, it was able to fluently use the harness, call tools, navigate the codebase using Bash and identify some surprising bugs in the code.
@kwindla @kwindla on x
NVIDIA Nemotron 3 Super launches today! We've been building voice agents with Super's pre-release checkpoints and running all our various tests and benchmarks. Nemotron 3 Super matches both GPT-5.4 and GPT-4.1 in tool calling and instruction following performance on our realtime …
@cedric_chee Cedric on x
NVIDIA has dropped Nemotron 3 Super, an open weights reasoning model with a hybrid Mamba-Transformer MoE architecture. Efficient.
@sudoraohacker Arun Rao on x
Exciting release of Nemotron's mid-sized “Super” (120b) model, with strong benchmark scores vs GLM 4.5 and a detailed tech report that is worth reading. The model has been optimized for accuracy, inference cost, and speed. Personally, I find the use of synthetic data generation, …
@joshpurtell Josh on x
RL's back on the menu boys
@artificialanlys @artificialanlys on x
NVIDIA is focused on efficient intelligence for the Nemotron family, and we tested inference performance against peer models to see the impact of the architecture choices. We ran self-hosted throughput tests across a range of peer models using a simple methodology with workloads …
@soubhik_deb @soubhik_deb on x
the whole AI stack is having its moment of vertical integration > closed AI labs going down vertical to design own chips > closed AI labs going down vertical to establish own datacenters > vibecoding and finetuning platforms going down vertical to build own models > nvidia

Chronicles

Nvidia debuts Nemotron 3 Super, a 120B-parameter hybrid MoE open-weight model; filing: Nvidia plans to spend $26B over the next five years to build open models

Related Coverage

Discussion