Sources: Meta's new AI model, codenamed Avocado, may launch in spring 2026 as a “closed” model, and was trained using Google's Gemma, OpenAI's gpt-oss, and Qwen
Meta Platforms Inc.'s Mark Zuckerberg, months into building one of the priciest teams in technology history …
Mira Murati's Thinking Machines Lab launches its first product, Tinker, an API for fine-tuning language models, in private beta, with support for Qwen and Llama
Today, we are launching Tinker, a flexible API for fine-tuning language models. Moneycontrol : Ex-OpenAI CEO Mira Murati stealth AI lab launches its first ever product Matthias Bastian / The Decoder :...
Alibaba releases the Qwen3-VL vision models, the Qwen3Guard “safety moderation” models, and three closed-weight models, including Qwen3-Max with 1T+ parameters
Qwen 50.6k — Safetensors qwen3_vl_moe Julian Nabil / Forbes Middle East : Alibaba Introduces Qwen3-Max AI Model With Over 1T Parameters Markus Kasanmascheff / WinBuzzer : Alibaba Releases Qwen3-VL O...
Alibaba releases Qwen3-Next, a new model architecture optimized for long-context understanding, large parameter scale, and better computational efficiency
the FUTURE of efficient LLMs is here! 🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!) 🔹Hybrid Architecture: Gated Delta...
OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller gpt-oss-20b can run locally on a device with 16GB+ of RAM
gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models Simon Willison / Simon Willison's Weblog : OpenAI's new open weight (Apache 2) models are really good OpenAI on GitHub : ...
Alibaba debuts the Qwen3-Coder model for agentic coding, including a 480B-parameter MoE variant, and open sources Qwen Code, a CLI tool adapted from Gemini CLI
Qwen 39.4k — Text Generation Transformers Safetensors qwen3_moe conversational Coco Feng / South China Morning Post : Alibaba upgrades flagship Qwen3 model to outperform OpenAI, DeepSeek in maths, c...
Alibaba releases Qwen2.5-Omni-3B, a scaled-down 3B-parameter variant of its flagship 7B-parameter multimodal model, designed to run on consumer PCs and laptops
Qwen2.5-Omni — Qwen2.5-Omni is an end-to-end multimodal model designed … Asif Razzaq / MarkTechPost : Multimodal AI on Developer GPUs: Alibaba Releases Qwen2.5-Omni-3B with 50% Lower VRAM Usage and ...
Alibaba releases open-source reasoning model QwQ-32B on Hugging Face and ModelScope, claiming comparable performance to DeepSeek-R1 but with lower compute needs
Introduction QwQ is the reasoning model of the Qwen series. Paul Barker / InfoWorld : Alibaba says its new AI model rivals DeepSeeks's R-1, OpenAI's o1 Jose Antonio Lanz / Decrypt : Alibaba's Latest A...
Alibaba's Qwen team releases Qwen2.5-VL, a new series of AI models that can control PCs and phones, as well as perform a number of text and image analysis tasks
QWEN CHAT GITHUB HUGGING FACE MODELSCOPE DISCORD Anusuya Lahiri / Benzinga : Not Just DeepSeek - Alibaba Unveils AI Model To Rival OpenAI's Operator Markus Kasanmascheff / WinBuzzer : Alibaba Qwen Cha...
Alibaba releases QvQ-72B-Preview, an experimental research model focused on “enhancing visual reasoning capabilities”, built on Qwen2-VL-72B
QVQ-72B-Preview is an experimental research model developed by the Qwen team … QwenLM on GitHub : Qwen2-VL — Introduction After a year's relentless efforts, today we are thrilled to release Qwen2-VL...