Moonshot says Kimi K2.5 builds on K2 with “pretraining over ~15T mixed visual and text tokens” and “can self-direct an agent swarm with up to 100 sub-agents”
Today, we are introducing Kimi K2.5, the most powerful open-source model to date.
Chinese startup Moonshot releases Kimi K2.5, saying the model can process text, images, and videos simultaneously and beats its open-source peers in some tests
Alibaba Group Holding Ltd.-backed Moonshot AI released an upgrade of its flagship model, heating up a domestic arms race ahead …
A source close to Thinking Machines Lab alleges ex-CTO Barret Zoph, who is returning to OpenAI, had shared confidential company information with competitors
The departures are a blow for Thinking Machines Lab. Two narratives are already emerging about why they happened.
Thinking Machines Lab parts ways with CTO Barret Zoph, with Soumith Chintala taking over the role; sources say the termination is due to “unethical conduct”
BREAKING: Thinking Machines has terminated its CTO, Barret Zoph, due to unethical conduct according to two sources familiar with the matter. CEO Mira Murati announced the news at a...
Essential AI, whose CEO co-wrote Google's Attention Is All You Need paper, unveils Rnj-1, an 8B-parameter open model with SWE-bench performance close to GPT-4o
The long-term advancement and equitable diffusion of AI technologies crucially depend on their development in the Open.
Z.ai releases GLM-4.6, an open-weights model with a context window of up to 200K tokens, claiming near parity with Claude Sonnet 4 on coding and reasoning tasks
Today, we are releasing the latest version of our flagship model: GLM-4.6. Compared with GLM-4.5, this generation brings several key improvements:
Xiaomi unveils open-source AI reasoning model MiMo, joining other Chinese tech leaders hoping to make a splash in the burgeoning AI field endorsed by Beijing
Xiaomi debuted MiMo a day after Alibaba unveiled the latest version of its own flagship model, amplifying a race between China's tech players …
Google unveils Gemma 3, the “world's best single-accelerator model”, running on a single GPU, in 1B, 4B, 12B, and 27B sizes, and says it outperforms Llama-405B
Following version 1 in February 2024 and 2 in May, Google today announced Gemma 3 as its latest open model for developers.
Alibaba's Qwen team releases Qwen2.5-VL, a new series of AI models that can control PCs and phones, as well as perform a number of text and image analysis tasks
QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD Anusuya Lahiri / Benzinga : Not Just DeepSeek - Alibaba Unveils AI Model To Rival OpenAI's Operator Markus Kasanmascheff / WinBuzze...
Google releases Gemini 2.0 Flash Thinking, an experimental “reasoning” model that “explicitly shows its thoughts” and can use them to strengthen its reasoning
Quick: what sort of prompts should you run against GPT-4o vs Gemini 1.5 Flash vs o1 vs o1-pro vs gemini-2.0-flash-thinking-exp? X: Jeff Dean / @jeffdean : Introducing Gemini 2.0 Fl...