multimodal (Technology)

Apple Machine Learning Research 15 related

Apple unveils new Apple Foundation Models: two on-device models, including a 20B-parameter multimodal model called AFM 3 Core Advanced, and three cloud models

2026-06-10 View

Apple Machine Learning Research 23 related

Apple unveils new Apple Foundation Models: two on-device models, including a 20B-parameter multimodal model called AFM 3 Core Advanced, and three cloud models

Apple Machine Learning Research:

2026-06-09 View

Google Developers Blog 19 related

Google releases macOS versions of AI Edge Gallery, which lets users run open models on their devices, and AI Edge Eloquent, an on-device voice dictation app

Google DeepMind's latest open model, Gemma 4 12B, is designed to bring agentic, multimodal intelligence directly to your laptop.

2026-06-04 View

VentureBeat 27 related

Google releases Gemma 4 12B, an 11.95B-parameter unified, encoder-free open multimodal model that can run locally on devices with 16GB of VRAM or unified memory

While many AI open source model providers are pursuing larger and more powerful models, Google is still giving attention to the smaller, more local side of the market.

2026-06-04 View

VentureBeat 7 related

Google introduces Gemma 4 12B, a unified, encoder-free open multimodal model that can run locally on devices with 16GB of VRAM or unified memory

While many AI open source model providers are pursuing larger and more powerful models, Google is still giving attention to the smaller, more local side of the market.

2026-06-03 View

VentureBeat 3 related

Alibaba releases Qwen3.7-Plus, a multimodal proprietary model with a 1M-token context window, costing $2 per 1M tokens, 60% less than the text-only Qwen3.7-Max

However, like its immediate predecessor Qwen3.7-Plus is available only under a “closed” commercial license via proprietary application …

2026-06-03 View

VentureBeat 39 related

Google launches Gemini Omni, a multimodal model it says can “create anything from any input”, starting with video generation, for Google AI Plus, Pro, and Ultra

Although it was already discovered by intrepid AI power users weeks ahead of the official unveiling today at Google's annual …

2026-05-20 View

VentureBeat 31 related

Google launches the Gemini Omni multimodal model, saying it can “create anything from any input”, starting with video generation, for Google AI subscribers

Although it was already discovered by intrepid AI power users weeks ahead of the official unveiling today at Google's annual …

2026-05-19 View

SiliconANGLE 16 related

Nvidia launches Nemotron 3 Nano Omni, an open multimodal model with a 30B-A3B hybrid MoE architecture; the Nemotron 3 family saw 50M+ downloads in the past year

Nvidia Corp. today launched a powerful reasoning artificial intelligence model that unifies text, vision and speech …

2026-04-29 View

SiliconANGLE 7 related

Nvidia launches Nemotron 3 Nano Omni, an open multimodal model with a 30B-A3B hybrid MoE architecture; the Nemotron 3 family saw 50M+ downloads in the past year

Nvidia Corp. today launched a powerful reasoning artificial intelligence model that unifies text, vision and speech …

2026-04-28 View

multimodal

Patterns

Related Entities

Top Voices

Explore Further

Coverage Timeline

Apple unveils new Apple Foundation Models: two on-device models, including a 20B-parameter multimodal model called AFM 3 Core Advanced, and three cloud models

Apple unveils new Apple Foundation Models: two on-device models, including a 20B-parameter multimodal model called AFM 3 Core Advanced, and three cloud models

Google releases macOS versions of AI Edge Gallery, which lets users run open models on their devices, and AI Edge Eloquent, an on-device voice dictation app

Google releases Gemma 4 12B, an 11.95B-parameter unified, encoder-free open multimodal model that can run locally on devices with 16GB of VRAM or unified memory

Google introduces Gemma 4 12B, a unified, encoder-free open multimodal model that can run locally on devices with 16GB of VRAM or unified memory

Alibaba releases Qwen3.7-Plus, a multimodal proprietary model with a 1M-token context window, costing $2 per 1M tokens, 60% less than the text-only Qwen3.7-Max

Google launches Gemini Omni, a multimodal model it says can “create anything from any input”, starting with video generation, for Google AI Plus, Pro, and Ultra

Google launches the Gemini Omni multimodal model, saying it can “create anything from any input”, starting with video generation, for Google AI subscribers

Nvidia launches Nemotron 3 Nano Omni, an open multimodal model with a 30B-A3B hybrid MoE architecture; the Nemotron 3 family saw 50M+ downloads in the past year

Nvidia launches Nemotron 3 Nano Omni, an open multimodal model with a 30B-A3B hybrid MoE architecture; the Nemotron 3 family saw 50M+ downloads in the past year

Quarterly Coverage

Top Sources

Narrative

Key Moments

Relationships