Alibaba Cloud details a GPU pooling system that it claims reduced the number of Nvidia H20s required by 82% when serving dozens of LLMs of up to 72B parameters
up to 9x increase in output lets 213 GPUs perform like 1,192 ACM Digital Library : Aegaeon: Effective GPU Pooling for Concurrent LLM Serving on the Market Rounak Jain / Benzinga : Alibaba Cloud's New ...
Sources: the US Commerce Department's BIS approves several billion dollars' worth of Nvidia GPU exports to the UAE, a first step in May 2025's bilateral AI deal
Mackenzie Hawkins / Bloomberg :
Source: Nvidia is in advanced talks to acquire Lepton AI, which rents Nvidia GPU-based servers, for several hundred million dollars; Lepton raised $11M in 2023
Nvidia is in advanced talks to buy Lepton AI, a two-year-old startup that rents out servers powered by Nvidia's artificial intelligence chips …
Cloud-based Nvidia GPU provider CoreWeave files for an IPO on the Nasdaq under CRWV and says 2024 revenue was up 737% YoY to $1.92B and it had an $863M net loss
CoreWeave, a provider of cloud-based Nvidia processors to companies including Meta and Microsoft, is headed for the public market.
An interview with Arkady Volozh, CEO of Nebius, formerly Yandex NV, on selling its Russian assets and pivoting to become a full-stack AI infrastructure provider
Yandex plans to triple its Nvidia GPU deployments
Enfabrica, which sells “hub and spoke” networking chips to scale Nvidia GPU-based AI data centers, raised a $125M Series B led by Atreides Management
Stephen Nellis / Reuters :
Microsoft adds an entry-level $1,699 Surface Book with Nvidia GPU and a $3,199 model with 1TB solid state storage and top-end specs to its lineup
Paul Lilly / Maximum PC :
Microsoft adds an entry-level $1,699 Surface Book with Nvidia GPU and a $3,199 model with 1TB solid state storage and top-end specs to its lineup
Paul Lilly / Maximum PC :