Microsoft launches in-house AI models MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2, built by its superintelligence team, as it pursues “AI self-sufficiency”
and businesses will love it
VentureBeat Michael Nuñez
Related Coverage
- Introducing MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 in Microsoft Foundry Microsoft Tech Community · Naomi Moneypenny
- Microsoft releases 3 new models on MAI Playground TestingCatalog · Alexey Shabanov
- Microsoft takes on AI rivals with three new foundational models TechCrunch · Rebecca Szkutak
- Microsoft unveils three new AI models for speech and imaging: What they can do Digit · Ayushi Jain
- Microsoft calls it superintelligence. The spreadsheet says cost reduction. Implicator.ai · Marcus Schuler
- Microsoft builds its own AI stack to help wean it from its reliance on OpenAI Computerworld · Taryn Plumb
- Microsoft launches new high-speed voice and image models SiliconANGLE · Maria Deutscher
- The Netflix Football League Afterthoughts · M.G. Siegler
- Microsoft releases MAI-Transcribe-1, the most accurate transcription model in the world Neowin · Pradeep Viswanathan
- Microsoft's New AI Models Go Beyond Just Text CNET · Katelyn Chedraoui
- Microsoft now has an AI that can turn hours of audio into text instantly — and businesses will love it Windows Central · Kevin Okemwa
- We're bringing our growing MAI model family to every developer in Foundry, including ... • MAI-Transcribe-1, most accurate transcription model in world across 25 languages … Satya Nadella
Discussion
-
@mustafasuleyman
Mustafa Suleyman
on x
Three models. Three top-tier results. All shipped within just a few months by the @MicrosoftAI team. - MAI-Transcribe-1 dropped today, the most accurate transcription model in the world across 25 languages according to FLEURS WER benchmark. - MAI-Voice-1 sets a new standard f…
-
@angrytomtweets
@angrytomtweets
on x
Microsoft just dropped MAI-Transcribe-1, a new SOTA speech-to-text model. The model is built to deliver high quality transcription in messy, real-world environments, while remaining incredibly fast and efficient. MAI-Transcribe-1 delivers SOTA speech-to-text transcription acros…
-
@patrickmoorhead
Patrick Moorhead
on x
This is something Microsoft should be really good at. Did the foundational research on voice synthesis decades ago.
-
@amandaksilver
Amanda Silver
on x
Developers, developers, developers! Three new models from @MicrosoftAI now in Foundry: speech→text, text→speech, and text→image. Less integration tax. Build agents with voice, captions, call analytics and automate support and creative workflows! @AIFoundryDevs @MSAzureDev
-
@nandodf
Nando de Freitas
on x
Artists and creators: You can now access a growing set of the most powerful speech tools in the world for the lowest price at @Azure. With the tools, it is up to you how to create or monetise your own ideas. @LarryJackson @1benm @tapmusic You could build something like the
-
@microsoftai
@microsoftai
on x
The most accurate model across 25 languages, faster transcription speeds, and stronger performance in real-world noise. MAI-Transcribe-1 sets a new bar for speech recognition. Learn more + try it today: https://microsoft.ai/... [image]
-
@azure
@azure
on x
Today, we announced the public preview of MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 on Microsoft Foundry, bringing our first-party AI models directly into the hands of developers. Read more: https://microsoft.ai/... [video]
-
@michaelfnunez
Michael Nuñez
on x
BREAKING: Mustafa Suleyman just told me Microsoft will build a frontier large language model to compete directly with OpenAI's GPT — and revealed that until October 2025, Microsoft was contractually banned from even trying. 🔥🤖 Full story: https://venturebeat.com/... #AI #Microsof…
-
@satyanadella
Satya Nadella
on x
We're bringing our growing MAI model family to every developer in Foundry, including ... · MAI-Transcribe-1, most accurate transcription model in world across 25 languages · MAI-Voice-1, natural, expressive speech generation · MAI-Image-2, our most capable image model yet Start […
-
@nandodf
Nando de Freitas
on x
MSI is the most fun team I've ever worked with. This team ships. This team creates. This team innovates. This team believes in work-life balance, and none of that 70 hour or 996 bullsh*t We must build AI responsibly and sustainably, put users first, put our teams first, put
-
@microsoftai
@microsoftai
on x
MAI-Transcribe-1 makes speech-to-text clearer, faster, and more reliable even in noisy audio. Ranked #1 on the industry-standard FLEURS word error rate benchmark. Now in public preview. Learn more: https://microsoft.ai/... [video]
-
@mustafasuleyman
Mustafa Suleyman
on x
One place MAI-Image-2 really knocks it out of the park is surrealist images. Try this one: Close-up zoomed in macro photo of a bright orange clownfish hiding among stark white peonies with bright yellow stamens. High contrast, shallow depth of field, vibrant wildlife [image]
-
@mustafasuleyman
Mustafa Suleyman
on x
Been awesome to have MAI-Image-2 out in the world and see people's creations. Wanted to start sharing some favorite prompts the team has come up with so you can test them out for yourself 👀 Will keep adding to this (and share yours too)