Sources: DeepSeek plans to release its multimodal model V4 next week and worked with Huawei and Chinese AI chipmaker Cambricon to optimize V4 for their products
Sources: DeepSeek did not share its upcoming V4 model with US chipmakers, including AMD and Nvidia, but granted early access to Chinese companies like Huawei
DeepSeek, the Chinese artificial intelligence lab whose low-cost model rattled global markets last year, has not shown U.S. chipmakers …
Sources: DeepSeek did not share its upcoming V4 model with US chipmakers, including AMD and Nvidia, but granted early access to Chinese companies like Huawei
DeepSeek, the Chinese artificial intelligence lab whose low-cost model rattled global markets last year, has not shown U.S. chipmakers …
Sources say DeepSeek will launch V4, its next-generation model, in the coming weeks and say it outperformed Anthropic's Claude and OpenAI's GPT series in coding
Chinese AI startup DeepSeek is expected to launch its next-generation AI model that features strong coding capabilities in the coming weeks …
Sources say DeepSeek will launch V4, its next-generation model, in the coming weeks and say it outperformed Anthropic's Claude and OpenAI's GPT series in coding
Chinese AI startup DeepSeek is expected to launch its next-generation AI model that features strong coding capabilities in the coming weeks …
DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute
the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model su...
Mastodon plans to roll out quote posts next week, with safety features that give users several ways to control how their posts can be quoted, to avoid “dunking”
techcrunch.com/2025/09/12/m... Aram Sinnreich / @aramsinn : The inability to quote post on Mastodon is one of the main reasons my post volume is higher on Bluesky. Seems like that's about to end. — ...
DeepSeek details V3.1 and says it surpasses R1 on key benchmarks and is customized to work with next-gen Chinese-made AI chips, after unveiling it on August 19
Introducing DeepSeek-V3.1: our first step toward the agent era! 🚀 Tobias Mann / The Register : DeepSeek's new V3.1 release points to potent new Chinese chips coming soon Hugging Face : DeepSeek-V3.1 ...
OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller gpt-oss-20b can run locally on a device with 16GB+ of RAM
gpt-oss-120b and gpt-oss-20b push the frontier of open-weight reasoning models Simon Willison / Simon Willison's Weblog : OpenAI's new open weight (Apache 2) models are really good OpenAI on GitHub : ...
Meta VP of Generative AI Ahmad Al-Dahle denies a rumor that the company trained Llama 4 Maverick and Scout on test sets, saying that Meta “would never do that”
but the EU doesn't get everything Pascale Davies / Euronews : From a political shift to a more powerful AI: Everything to know about Meta's Llama 4 models Jay Bonggolto / Android Central : Meta is com...