AI coding agents made a huge leap forward since December, completing complex projects with minimal oversight, meaning “programming is becoming unrecognizable”
AI coding agents made a huge leap forward since December, completing complex projects with minimal oversight, meaning “programming is becoming unrecognizable”
It is hard to communicate how much programming has changed due to AI in the last 2 months: not gradually and over time in the …
Google unveils Gemini 3 Flash, which it says has Pro-grade reasoning with lower latency, outperforming 2.5 Pro “while being 3x faster at a fraction of the cost”
Following last month's launch of Gemini 3 Pro, Google today announced Gemini 3 Flash for consumers and developers.
Google unveils Gemini 3, its “most intelligent” and “factually accurate” model yet, with improvements across coding and reasoning, and offering less “flattery”
The flagship Gemini 3 Pro model is coming to the Gemini app and Search, with improvements across coding, reasoning, and less ‘flattery.’
Samsung introduces the Tiny Recursion Model, a 7M-parameter model that can outperform LLMs 10,000x larger, like Gemini 2.5 Pro and o3-mini, on specific problems
The trend of AI researchers developing new, small open source generative models that outperform far larger …
OpenAI announces apps that work inside ChatGPT, piloting Booking.com, Canva, Coursera, Figma, Expedia, Spotify, and Zillow for logged-in users outside of the EU
A new generation of apps you can chat with and the tools for developers to build them. — Try in ChatGPT(opens in a new window)Start building apps(opens in a new window)
OpenAI launches AgentKit, a toolkit for building and deploying AI agents, including Agent Builder, which Sam Altman described as like Canva for building agents
New tools for building, deploying, and optimizing agents. NDTV Profit : What Is AI Agent Builder And How Does It Work? OpenAI Launches New Set Of Tools For Developers Aman Gupta / ...
Alibaba releases the Qwen3-VL vision models, the Qwen3Guard “safety moderation” models, and three closed-weight models, including Qwen3-Max with 1T+ parameters
Qwen 50.6k — Safetensors qwen3_vl_moe Julian Nabil / Forbes Middle East : Alibaba Introduces Qwen3-Max AI Model With Over 1T Parameters Markus Kasanmascheff / WinBuzzer : Alibaba...
Alibaba's Hong Kong-listed shares hit a nearly four-year high after CEO Eddie Wu announced plans to increase AI spending beyond the $53B target over three years
Alibaba Group Holding Ltd.'s shares surged to their highest in nearly four years after revealing plans to ramp up AI spending past …
OpenAI rolls out Codex, an AI coding agent powered by codex-1, a version of o3 optimized for software engineering, for ChatGPT Pro, Enterprise, and Team users
OpenAI announced on Friday it's launching a research preview of Codex, the company's most capable AI coding agent yet.
OpenAI rolls out Codex, an AI coding agent powered by codex-1, a version of o3 optimized for software engineering, for ChatGPT Pro, Enterprise, and Team users
OpenAI announced on Friday it's launching a research preview of Codex, the company's most capable AI coding agent yet.
Meta VP of Generative AI Ahmad Al-Dahle denies a rumor that the company trained Llama 4 Maverick and Scout on test sets, saying that Meta “would never do that”
but the EU doesn't get everything Pascale Davies / Euronews : From a political shift to a more powerful AI: Everything to know about Meta's Llama 4 models Jay Bonggolto / Android C...
LMArena says it is updating its leaderboard policies after a Llama 4 Maverick version, which Meta said in fine print is not public, secured the number two spot
With Llama 4, Meta fudged benchmarks to appear as though its new AI model is better than the competition.
LLMs, unlike prior transformative tech like the internet, disproportionately benefit regular people, with a muted, slower impact on corporations and governments
> They had to permit personal devices to be used work, but wanted to see enough justification to do that = catch-22. Took a while before an employee could use gmail, calendar, slac...
Meta launches Llama 4 Maverick with 400B parameters and Scout with 109B parameters and a 10M context window, and previews Behemoth with 2T total parameters
Takeaways — We're sharing the first models in the Llama 4 herd, which will enable people to build more personalized multimodal experiences.
Google says it is rolling out Gemini 2.5 Pro Experimental to all Gemini users, after initially launching the model for Gemini Advanced subscribers on March 25
No Subscription Needed The Economic Times : Google rolls out experimental version of Gemini 2.5 Pro for free users The Hindu : Google's Gemini 2.5 Pro rolled out to all users days ...
OpenAI debuts a Responses API to help developers build agents that search the web, scan for files, and perform tasks on PCs, and an Agents SDK for orchestration
But really, why isn't OpenAI building their own agents if their tech is so powerful? [embedded post] X: Atty Eleti / @athyuttamre : Introducing the Responses API: the new primitive...
xAI unveils DeepSearch, a reasoning chatbot that explains its thought process for queries and is capable of doing research, brainstorming, and data analysis
Elon Musk's artificial intelligence startup xAI showed off the updated Grok-3 model, showcasing a version of the chatbot technology …
xAI launches Grok-3 beta and Grok-3 mini, its latest AI models with reasoning, trained on 200K GPUs, or “10x” more compute than Grok-2, for X Premium+ users
Elon Musk's AI company, xAI, late on Monday released its latest flagship AI model, Grok 3, and unveiled new capabilities for the Grok iOS and web apps.
Stanford and University of Washington AI researchers claim they trained AI reasoning model s1, distilled from a Gemini 2.0 model, for under $50 in cloud compute
AI researchers at Stanford and the University of Washington were able to train an AI “reasoning” model for under $50 in cloud compute credits …