2025-10-21
The Decoder
12 related
DeepSeek releases DeepSeek-OCR, a vision language model designed for efficient vision-text compression, enabling longer contexts with less compute
the new frontier of OCR from @deepseek_ai , exploring optical context compression for LLMs, is running blazingly fast on vLLM ⚡ (~2500 tokens/s on A100-40G) — powered by vllm==0.8.5 for day-0 model su...
2025-09-02
The Decoder
6 related
Tencent open sources translation models Hunyuan-MT-7B and Hunyuan-MT-Chimera-7B, which support 33 languages, claiming they beat established models in benchmarks
Tencent 5.61k — Translation Transformers Safetensors hunyuan_v1_dense text-generation Ben Jiang / South China Morning Post : Tencent's open-source translation model beats Google, OpenAI in top globa...
2024-12-26
Qwen
7 related
Alibaba releases QvQ-72B-Preview, an experimental research model focused on “enhancing visual reasoning capabilities”, built on Qwen2-VL-72B
QVQ-72B-Preview is an experimental research model developed by the Qwen team … QwenLM on GitHub : Qwen2-VL — Introduction After a year's relentless efforts, today we are thrilled to release Qwen2-VL...
Loading articles...