OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost
OpenAI eyes January exit from “code red” John Werner / Forbes : The Wonder And The Promise Of GPT 5.2 Is Here Benj Edwards / Ars Technica : OpenAI releases GPT-5.2 after “code red” Google threat alert...
DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute
the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model su...
Andrej Karpathy unveils nanochat, a full-stack training and inference implementation of an LLM in a single, dependency-minimal codebase, deployable in 4 hours
It provides a full ChatGPT-style LLM, including training, inference and a web Ui … X: Clem / @clementdelangue : Am I wrong in sensing a paradigm shift in AI? Feels like we're moving from a world obses...
Mira Murati's Thinking Machines Lab launches its first product, Tinker, an API for fine-tuning language models, in private beta, with support for Qwen and Llama
Today, we are launching Tinker, a flexible API for fine-tuning language models. Moneycontrol : Ex-OpenAI CEO Mira Murati stealth AI lab launches its first ever product Matthias Bastian / The Decoder :...
Tencent open sources translation models Hunyuan-MT-7B and Hunyuan-MT-Chimera-7B, which support 33 languages, claiming they beat established models in benchmarks
Tencent 5.61k — Translation Transformers Safetensors hunyuan_v1_dense text-generation Ben Jiang / South China Morning Post : Tencent's open-source translation model beats Google, OpenAI in top globa...
OpenAI says GPT-5 is a unified system with an efficient model for most questions, a reasoning model for harder problems, and a router that decides which to use
All You Need To Know Lakshay Kumar / Business Today : What is GPT-5? How OpenAI is upgrading your ChatGPT experience Tsveta Ermenkova / PhoneArena : You can now chat with a PhD-level AI that knows whe...
Sources: Meta is looking to raise $3B in equity and $26B in debt, from private capital firms, including Apollo Global and KKR, to fund its data center build out
www.ft.com/content/aff1... @ivanthek : WTF? — This is not exactly a vote of confidence in their own strategy. — www.ft.com/content/aff1... [image] X: Luke Kawa / @ljkawa : Private credit and AI, ...
Google DeepMind says Gemini Diffusion, an experimental text diffusion model demoed at Google I/O and available by waitlist, generates 1,000-2,000 tokens/second
Our state-of-the-art, experimental text diffusion model Jose Antonio Lanz / Decrypt : Google Doubles Down on AI: Veo 3, Imagen 4 and Gemini Diffusion Push Creative Boundaries Matthias Bastian / The De...
OpenAI says its new o3 and o4-mini AI models hallucinate more often than its previous reasoning and traditional models, and the company doesn't know why
OpenAI's internal tests show o3 hallucinated on 33% of person-related questions, double the rate of previous models. Even worse, o4-mini hit 48%. Mastodon: Aulia Masna / @aulia@mementomori.social : “...
Source: OpenAI is in talks to acquire Windsurf, an AI coding tool formerly known as Codeium, for ~$3B; Windsurf was valued at $1.25B in a 2024 funding deal
they are the black hole of Startups People still don't get it Evil institution Deedy / @deedydas : @EMostaque Perhaps the time (say even 6mos) it takes to build and grow a Windsurf like product and GT...