OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost
OpenAI eyes January exit from “code red” John Werner / Forbes : The Wonder And The Promise Of GPT 5.2 Is Here Benj Edwards / Ars Technica : OpenAI releases GPT-5.2 after “code red” Google threat alert...
DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute
the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model su...
Andrej Karpathy unveils nanochat, a full-stack training and inference implementation of an LLM in a single, dependency-minimal codebase, deployable in 4 hours
It provides a full ChatGPT-style LLM, including training, inference and a web Ui … X: Clem / @clementdelangue : Am I wrong in sensing a paradigm shift in AI? Feels like we're moving from a world obses...
Mira Murati's Thinking Machines Lab launches its first product, Tinker, an API for fine-tuning language models, in private beta, with support for Qwen and Llama
Today, we are launching Tinker, a flexible API for fine-tuning language models. Moneycontrol : Ex-OpenAI CEO Mira Murati stealth AI lab launches its first ever product Matthias Bastian / The Decoder :...
Tencent open sources translation models Hunyuan-MT-7B and Hunyuan-MT-Chimera-7B, which support 33 languages, claiming they beat established models in benchmarks
Tencent 5.61k — Translation Transformers Safetensors hunyuan_v1_dense text-generation Ben Jiang / South China Morning Post : Tencent's open-source translation model beats Google, OpenAI in top globa...
OpenAI says GPT-5 is a unified system with an efficient model for most questions, a reasoning model for harder problems, and a router that decides which to use
All You Need To Know Lakshay Kumar / Business Today : What is GPT-5? How OpenAI is upgrading your ChatGPT experience Tsveta Ermenkova / PhoneArena : You can now chat with a PhD-level AI that knows whe...
Google DeepMind says Gemini Diffusion, an experimental text diffusion model demoed at Google I/O and available by waitlist, generates 1,000-2,000 tokens/second
Our state-of-the-art, experimental text diffusion model Jose Antonio Lanz / Decrypt : Google Doubles Down on AI: Veo 3, Imagen 4 and Gemini Diffusion Push Creative Boundaries Matthias Bastian / The De...
Sam Altman says OpenAI plans to “release a powerful new open-weight language model with reasoning in the coming months”, its first open-weight model since GPT-2
just look at the “T” in ChatGPT, which comes from the Transformer architecture openly shared by Google. Then came Garry Tan / @garrytan : Open weights 🚀 Alexander Doria / @dorialexander : Ok, this one...
Browser Use, whose open-source tool converts website elements into a “text-like” format that AI agents can better understand, raised a $17M seed led by Felicis
They've raised $17M in seed funding and I'm curious about the business model as it's Open Source and anyone can host it. Mastodon: Dare Obasanjo / @carnage4life@mas.to : Browser Use extracts the struc...
A look at Manus, which its Chinese creators claim is the world's first fully autonomous AI agent, as some say it might be China's second DeepSeek moment
Manus is a general AI agent that bridges minds and actions: it doesn't just think, it delivers results. Luiza Jarovsky / Luiza's Newsletter : ✋ Manus AI: Why Everyone Should Worry ecns : ECNS Wire — ...