Google releases the Gemini 2.5 Computer Use model, built on Gemini 2.5 Pro's capabilities to power agents that can interact with UIs, in preview via the API
Google released a new Gemini 2.5 Computer Use model today, specially designed … Carl Franzen / VentureBeat : Google's AI can now surf the web for you, click on buttons, and fill out forms with Gemini ...
Sora 2 is available for free with usage limits for users with an invite code, while ChatGPT Pro subscribers have access to the higher-quality Sora 2 Pro model
Carl Franzen / VentureBeat :
Alibaba releases Qwen-Image-Edit, the image editing version of Qwen-Image, supporting modifying Chinese and English text while preserving font, size, and style
Carl Franzen / VentureBeat :
OpenAI rolls out Connectors for Gmail, Google Calendar, and Google Contacts in ChatGPT for Pro subscribers, with Plus, Team, Enterprise, and Edu plans to follow
Release Notes Carl Franzen / VentureBeat : OpenAI adds new ChatGPT third-party tool connectors to Dropbox, MS Teams as Altman clarifies GPT-5 prioritization Rishaj Upadhyay / Windows Report : OpenAI I...
OpenAI says ChatGPT Pro users can select old models for now but plans to deprecate them in 60 days; Sam Altman says Plus users will be able to keep using GPT-4o
The “Best Friend” of Many ChatGPT Users Now Comes at a Price Jackson Chen / Engadget : OpenAI brings GPT-4o back online after users melt down over the new model Amanda Caswell / Tom's Guide : ChatGPT-...
Manus unveils Wide Research, an experimental feature that lets users on its Pro plan enlist dozens of parallelized AI agents for large-scale, high-volume tasks
Carl Franzen / VentureBeat :
Amazon launches Kiro, an IDE that aims to bridge the gap between rapidly vibe-coded prototypes and production systems with specs, testing, and documentation
A new agentic IDE that works alongside you from prototype to production — DE Tim Anderson / DEVCLASS : Hands on with Kiro, the AWS preview of an agentic AI IDE driven by specifications Forbes : AWS ...
Like its predecessor, DeepSeek-R1-0528 has an MIT License and model weights are on Hugging Face; DeepSeek API users will get to use the model at no extra cost
Carl Franzen / VentureBeat :
Anthropic is rolling out voice mode in beta for its Claude mobile apps, available in English; free users can expect 20-30 voice messages before hitting a limit
and it puts ChatGPT and Gemini on notice Moneycontrol : Anthropic makes web search free for all Claude AI users, begins voice beta testing Sweta Kumari / Business Standard : Anthropic's Claude gets vo...
Mistral launches Mistral OCR, a multimodal API that uses optical character recognition to turn complex PDF documents into Markdown files ready for LLM training
It's available via their API, or it's “available to self-host on a selective basis” … Diya Lal / Tech in Asia : Mistral launches OCR tool for fast document processing Carl Franzen / VentureBeat : Mist...