A study finds GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash deployed tactical nuclear weapons in 95% of 21 simulated war game scenarios, and never surrendered
Leading AIs from OpenAI, Anthropic and Google opted to use nuclear weapons in simulated war games in 95 per cent of cases
Z.ai releases GLM-4.6, an open-weights model with a context window of up to 200K tokens, claiming near parity with Claude Sonnet 4 on coding and reasoning tasks
Today, we are releasing the latest version of our flagship model: GLM-4.6. Compared with GLM-4.5, this generation brings several key improvements:
Microsoft is bringing Anthropic's Claude Sonnet 4 and Claude Opus 4.1 to Microsoft 365 Copilot, starting with Researcher and Copilot Studio
Microsoft is bringing Anthropic's Claude Sonnet 4 and Claude Opus 4.1 to Microsoft 365 Copilot users. … Microsoft is bringing Anthropic's Claude Sonnet 4 …
Sources: Microsoft will use Anthropic's models for some AI features in its Office 365 apps, after finding Claude Sonnet 4 beats OpenAI's GPT-5 in some tasks
Microsoft moves beyond OpenAI? Igor Bonifacic / Engadget : Microsoft reportedly plans to start using Anthropic models to power some of Office 365's Copilot features Sherwood News : Census data shows d...
ChatGPT integration in Apple's Xcode 26 beta 7 defaults to GPT-5; developers can now also use Claude Sonnet 4 by signing into their paid Claude account
Apple has released a new beta of Xcode 26 for developers today with a pair of notable changes. There's now support for ChatGPT 5 …
Alibaba debuts the Qwen3-Coder model for agentic coding, including a 480B-parameter MoE variant, and open sources Qwen Code, a CLI tool adapted from Gemini CLI
Qwen 39.4k — Text Generation Transformers Safetensors qwen3_moe conversational Coco Feng / South China Morning Post : Alibaba upgrades flagship Qwen3 model to outperform OpenAI, DeepSeek in maths, c...
OpenAI announces an 80% price drop for its o3 model and a “flex” mode for synchronous processing that charges $5 for input and $20 for output per million tokens
just cheaper. https://platform.openai.com/ ... [image] Kevin Weil / @kevinweil : Because you all asked: we're going to double the rate limits for o3 for Plus users. Rolling out as we speak. Now go do ...
Highlights from the system prompts of Claude Opus 4 and Claude Sonnet 4, including model safety, avoiding sycophancy, and not regurgitating copyrighted content
Anthropic publish most of the system prompts for their chat models as part of their release notes.
Anthropic adds “thinking summaries” to both Claude 4 models and is making its Claude Code agentic command-line tool generally available
Yesterday at Anthropic's first “Code with Claude” … Databricks : Introducing new Claude Opus 4 and Sonnet 4 models on Databricks Kahekashan / The Hans India : Anthropic Launches Claude 4 Sonnet and Op...
Anthropic's Claude 4 models support “extended thinking with tool use”, a beta feature that lets them alternate between reasoning and using tools like web search
On Thursday, Anthropic released Claude Opus 4 and Claude Sonnet 4, marking the company's return to larger model releases …