Sources: Nvidia plans to unveil a new AI inference chip at its GTC conference in March; the system will have a Groq-designed chip and OpenAI is a customer
Under pressure from rivals, the chip giant is set to offer a new product focused on rapid processing of AI queries for ‘inference’ demand
As inference splits into prefill and decode, Nvidia's Groq deal could enable a “Rubin SRAM” variant optimized for ultra-low latency agentic reasoning workloads
Nvidia is buying Groq for two reasons imo. 1) Inference is disaggregating into prefill and decode.
As inference splits into prefill and decode, Nvidia's Groq deal could enable a “Rubin SRAM” variant optimized for ultra-low latency agentic reasoning workloads
Nvidia is buying Groq for two reasons imo. 1) Inference is disaggregating into prefill and decode.
The Groq deal secures key talent for Nvidia, including CEO Jonathan Ross, creator of the TPU, and keeps them from companies that may try to make their own chips
Twas the night before Christmas and all through the house, not a creature was stirring, not even a... wait. What's that?
Nvidia agrees to a licensing deal with Groq; CEO Jonathan Ross and other top executives will join Nvidia; Groq says it will continue operating independently
Nvidia Corp. agreed to a licensing deal with artificial intelligence startup Groq, furthering its investments in companies connected …
Sources: Saudi Arabia's sovereign wealth fund-backed AI company Humain picked US-based chipmaker Groq for inference; Groq plans to expand its Dammam data center
THE SCOOP — Saudi Arabia's sovereign wealth fund-backed artificial intelligence company HUMAIN has selected US chipmaker …