2026-04-25
TechCrunch
1 related
LinkedIn profile review shows Thinking Machines Lab has been hiring more researchers from Meta than from any other employer; TML's headcount now stands at ~140
Weiyao Wang spent eight years at Meta — his first job out of college — helping build multimodal perception systems and contributing …
2026-03-31
The Information
6 related
Alibaba's new Qwen3.5-Omni multimodal model, which processes text, audio, images, and video, is proprietary, marking a shift away from its open-source strategy
Alibaba Group has released the new generation of its large language model that can understand text, audio, images and video.
2026-03-17
Mistral AI
13 related
Mistral releases Small 4, its first model to unify the reasoning, multimodal, and coding capabilities of its flagship Magistral, Pixtral, and Devstral models
Today, we are announcing Mistral Small 4. This model is the next major release in the Mistral Small family.
2026-02-28
Financial Times
1 related
Sources: DeepSeek plans to release its multimodal model V4 next week and worked with Huawei and Chinese AI chipmaker Cambricon to optimize V4 for their products
2026-02-17
Reuters
14 related
Alibaba debuts Qwen3.5, a 397B-parameter open-weight multimodal AI model that it says is 60% cheaper to use and 8x better at large workloads than Qwen3
2024-07-13
The Information
1 related
Source: Meta plans to release the largest version of its Llama 3 model, expected to have 405B parameters and multimodal capabilities, on July 23
2024-05-22
Windows Central
20 related
Microsoft ships Azure AI Studio in broad availability, adds support for OpenAI's GPT-4o, and announces a new multimodal model in its lightweight Phi-3 family
2024-05-15
Engadget
5 related
Google announces Gemini 1.5 Flash, which is more lightweight and cheaper than Gemini Pro, but has the same multimodal capabilities and 1M-token context window
2024-04-15
VentureBeat
15 related
Elon Musk's xAI previews Grok-1.5 Vision, its first multimodal model, and says the AI model will be available soon to “early testers and existing Grok users”
2023-12-26
VentureBeat
13 related
Researchers from Apple and Columbia University released Ferret, an open source multimodal LLM that can recognize and describe any shape in an image, in October
Loading articles...