Alibaba releases the Qwen3-VL vision models, the Qwen3Guard “safety moderation” models, and three closed-weight models, including Qwen3-Max with 1T+ parameters
Qwen 50.6k — Safetensors qwen3_vl_moe Julian Nabil / Forbes Middle East : Alibaba Introduces Qwen3-Max AI Model With Over 1T Parameters Markus Kasanmascheff / WinBuzzer : Alibaba Releases Qwen3-VL O...
Alibaba debuts the Qwen3-Coder model for agentic coding, including a 480B-parameter MoE variant, and open sources Qwen Code, a CLI tool adapted from Gemini CLI
Qwen 39.4k — Text Generation Transformers Safetensors qwen3_moe conversational Coco Feng / South China Morning Post : Alibaba upgrades flagship Qwen3 model to outperform OpenAI, DeepSeek in maths, c...
Sources: Meta's new superintelligence lab led by Alexandr Wang discussed abandoning its top open-source model, Behemoth, in favor of developing a closed model
that may be changing Robert Brown / implicator.ai : From Altruism to Arms Race: Meta's AI Strategy Takes a Closed Turn Bluesky: M.G. Siegler / @mgsiegler.com : Yeah... (from a couple weeks back) [emb...
Baidu releases Ernie X1, an AI model that articulates its reasoning similarly to DeepSeek R1, and upgrades its flagship foundation model to Ernie 4.5, both free
And Costs Half As Much: ‘Excellent Multimodal Understanding Ability’ Nisha Gopalan / Yahoo Finance : China's Baidu Takes on DeepSeek With New AI Model Mike Wheatley / SiliconANGLE : Baidu debuts its f...
OpenAI launches GPT-4.5, its “most knowledgeable model yet” in research preview, initially warning it's not a frontier model and may perform below o1 or o3-mini
GPT-4.5 rollout delayed due to lack of processing power Gerui Wang / Forbes : Open AI's GPT-4.5 Drops As AI Race Escalates Nathan Lambert / Interconnects : GPT-4.5: “Not a frontier model”? Charlie Guo...
Sam Altman says OpenAI was forced to stagger GPT-4.5's rollout because it is “out of GPUs”; the model is wildly expensive, costing $75 per million input tokens
Hopefully the output is worth it? 🤔 — Oh... 😥 — www.theverge.com/news/620021/ ... [embedded post] X: Ed Zitron / @edzitron : Also, $1.30 per hour per GPU is the Microsoft discount rate for OpenA...
Anthropic releases Claude 3.7 Sonnet, a hybrid model that can produce fast responses or extended, step-by-step thinking, and Claude Code, an agentic coding tool
and it could be a game changer Ghacks : Anthropic Unveils Claude 3.7: First Hybrid Reasoning AI Model Rowan Cheung / The Rundown AI : Claude enters the reasoning era Siddharth Jindal / Analytics India...
Microsoft says it is on track to invest ~$80B in FY 2025 to build out datacenters to train AI and deploy AI and cloud apps, with 50%+ of the spending in the US
plans to spend tens of billions of dollars to build out AI data center … Rounak Jain / Benzinga : Microsoft To Invest $80 Billion In AI Data Centers For Fiscal 2025 Minh Le / Tech in Asia : Microsoft ...
OpenAI launches ChatGPT Pro, a $200/month plan with unlimited access to o1, GPT-4o, and more, plus an o1 version that uses more compute for better responses
12 Days of OpenAI: Day 1 Alan Velasco / HotHardware : OpenAI Unveils A Turbocharged $200 ChatGPT Pro Tier For AI Power Users Reece Rogers / Wired : Here's What OpenAI's $200 Monthly ChatGPT Pro Subscr...
Anthropic releases Claude 3.5 Haiku at $1 per million input tokens, up 4x from Claude 3.0 Haiku's ¢25 per million tokens, and without image analysis features
Anthropic released Claude 3.5 Haiku today, a few days later than expected … X: @anthropicai : During final testing, Haiku surpassed Claude 3 Opus, our previous flagship model, on many benchmarks—at a ...