DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute
the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model su...
Baidu releases Ernie X1, an AI model that articulates its reasoning similarly to DeepSeek R1, and upgrades its flagship foundation model to Ernie 4.5, both free
And Costs Half As Much: ‘Excellent Multimodal Understanding Ability’ Nisha Gopalan / Yahoo Finance : China's Baidu Takes on DeepSeek With New AI Model Mike Wheatley / SiliconANGLE : Baidu debuts its f...
Alibaba releases 32.5B-parameter QwQ-32B-Preview under Apache 2.0 and claims the “reasoning” AI model beats OpenAI's o1-preview on the AIME and MATH tests
Introduction QwQ-32B-Preview is an experimental research model developed … Ananya Gairola / Benzinga : Alibaba's New AI Model Outperforms OpenAI's o1 In Specific Benchmarks, Now Available For Free Dow...