Microsoft launches in-house AI models MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2, built by its superintelligence team, as it pursues “AI self-sufficiency”
Microsoft on Wednesday launched three new foundational AI models it built entirely in-house — a state …
VentureBeat Michael Nuñez
Related Coverage
- Microsoft Unveils MAI-Transcribe-1, Its Own Speech-to-Text Model The AI Economy · Ken Yeung
- Microsoft Releases AI Models for Transcription, Speech, Image Generation The Information · Aaron Holmes
- Microsoft's New AI Models Go Beyond Just Text CNET · Katelyn Chedraoui
- Microsoft launches ‘mid-class’ AI model as compute limits bite Financial Times
- Microsoft (MSFT) Unveils Three Proprietary AI Models in Major Strategic Shift Blockonomi · Trader Edge
- Microsoft now has an AI that can turn hours of audio into text instantly — and businesses will love it Windows Central · Kevin Okemwa
- We're bringing our growing MAI model family to every developer in Foundry, including ... • MAI-Transcribe-1, most accurate transcription model in world across 25 languages … Satya Nadella
- Microsoft takes on AI rivals with three new foundational models TechCrunch · Rebecca Szkutak
- Microsoft launches 3 AI models for transcription, image, and speech generation The Economic Times
- Microsoft Builds Its Own AI Model Stack To Reduce OpenAI Dependence Forbes · Mustafa Suleyman
- Microsoft's MAI-Transcribe-1 runs 2.5x faster than its predecessor at $0.36 per audio hour The Decoder · Matthias Bastian
- Microsoft releases MAI-Transcribe-1, the most accurate transcription model in the world Neowin · Pradeep Viswanathan
- Introducing MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 in Microsoft Foundry Microsoft Tech Community · Naomi Moneypenny
- Microsoft launches new high-speed voice and image models SiliconANGLE · Maria Deutscher
- Microsoft aims to create large cutting-edge AI models by 2027 Australian Financial Review · Matt Day
- Microsoft Releases AI Models for Transcription, Voice and Image Generation Dow Jones Newswires · Kelly Cloonan
- Microsoft building its own high-powered AI models as it looks to slash dependence on OpenAI Yahoo Finance · Daniel Howley
- Microsoft shivs OpenAI with three new AI models for speech and images The Register · Thomas Claburn
- Microsoft Declares ‘AI Self-Sufficiency’ with Launch of In-House Frontier Models Techstrong.ai · Jon Swartz
- Microsoft Launches 3 New AI Models in a Direct Challenge to OpenAI and Google Gadget Review · Rex Freiberger
- Microsoft releases new AI models to expand further beyond OpenAI GeekWire · Todd Bishop
- The Netflix Football League Afterthoughts · M.G. Siegler
Discussion
-
@satyanadella
Satya Nadella
on x
We're bringing our growing MAI model family to every developer in Foundry, including ... · MAI-Transcribe-1, most accurate transcription model in world across 25 languages · MAI-Voice-1, natural, expressive speech generation · MAI-Image-2, our most capable image model yet Start […
-
@microsoftai
@microsoftai
on x
MAI-Transcribe-1 makes speech-to-text clearer, faster, and more reliable even in noisy audio. Ranked #1 on the industry-standard FLEURS word error rate benchmark. Now in public preview. Learn more: https://microsoft.ai/... [video]
-
@microsoftai
@microsoftai
on x
The most accurate model across 25 languages, faster transcription speeds, and stronger performance in real-world noise. MAI-Transcribe-1 sets a new bar for speech recognition. Learn more + try it today: https://microsoft.ai/... [image]
-
@mustafasuleyman
Mustafa Suleyman
on x
One place MAI-Image-2 really knocks it out of the park is surrealist images. Try this one: Close-up zoomed in macro photo of a bright orange clownfish hiding among stark white peonies with bright yellow stamens. High contrast, shallow depth of field, vibrant wildlife [image]
-
@mustafasuleyman
Mustafa Suleyman
on x
Three models. Three top-tier results. All shipped within just a few months by the @MicrosoftAI team. - MAI-Transcribe-1 dropped today, the most accurate transcription model in the world across 25 languages according to FLEURS WER benchmark. - MAI-Voice-1 sets a new standard for […
-
@mustafasuleyman
Mustafa Suleyman
on x
Been awesome to have MAI-Image-2 out in the world and see people's creations. Wanted to start sharing some favorite prompts the team has come up with so you can test them out for yourself 👀 Will keep adding to this (and share yours too)
-
@nandodf
Nando de Freitas
on x
Artists and creators: You can now access a growing set of the most powerful speech tools in the world for the lowest price at @Azure. With the tools, it is up to you how to create or monetise your own ideas. @LarryJackson @1benm @tapmusic You could build something like the
-
@patrickmoorhead
Patrick Moorhead
on x
This is something Microsoft should be really good at. Did the foundational research on voice synthesis decades ago.
-
@angrytomtweets
@angrytomtweets
on x
Microsoft just dropped MAI-Transcribe-1, a new SOTA speech-to-text model. The model is built to deliver high quality transcription in messy, real-world environments, while remaining incredibly fast and efficient. MAI-Transcribe-1 delivers SOTA speech-to-text transcription [video]
-
@amandaksilver
Amanda Silver
on x
Developers, developers, developers! Three new models from @MicrosoftAI now in Foundry: speech→text, text→speech, and text→image. Less integration tax. Build agents with voice, captions, call analytics and automate support and creative workflows! @AIFoundryDevs @MSAzureDev
-
@nandodf
Nando de Freitas
on x
MSI is the most fun team I've ever worked with. This team ships. This team creates. This team innovates. This team believes in work-life balance, and none of that 70 hour or 996 bullsh*t We must build AI responsibly and sustainably, put users first, put our teams first, put
-
@azure
@azure
on x
Today, we announced the public preview of MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 on Microsoft Foundry, bringing our first-party AI models directly into the hands of developers. Read more: https://microsoft.ai/... [video]
-
@michaelfnunez
Michael Nuñez
on x
BREAKING: Mustafa Suleyman just told me Microsoft will build a frontier large language model to compete directly with OpenAI's GPT — and revealed that until October 2025, Microsoft was contractually banned from even trying. 🔥🤖 Full story: https://venturebeat.com/... #AI #Microsof…
-
@garymarcus
Gary Marcus
on x
Talk about moving goalposts. This one from MSFT's Suleyman may well take the cake. “Superintelligence” just went from intelligence beyond all humans to merely “delivering product value”. 🙄
-
@deredleritt3r
Prinz
on x
Microsoft can pursue superintelligence independently after OpenAI declares AGI, using AGI's weights as the starting point. Fully automated AI research, combined with the sheer amount of compute available to Microsoft, could suddenly turn it into a frontier lab. [image]
-
@shakeelhashim
Shakeel
on x
New terrible definition of superintelligence just dropped: [image]
-
@tomwarren
Tom Warren
on x
Microsoft's new “superintelligence” game plan is all about business. It's launching a new transcription model today that is a step towards those goals, says Microsoft AI CEO Mustafa Suleyman https://www.theverge.com/...
-
@haydenfield
Hayden Field
on x
Mustafa Suleyman says renegotiating Microsoft's contract with OpenAI “unlocked [Microsoft's] ability to pursue superintelligence.” Though his new JD at Microsoft AI was only made public last month, he'd been preparing for the transition for 6-9 months. https://www.theverge.com/..…
-
@zephyr_z9
@zephyr_z9
on x
ok, so that's why the news about “Microsoft will release strong models in 2027” came out Looks like OAI is on course to hit AGI by 2027
-
@haydenfield
Hayden Field
on bluesky
Mustafa Suleyman says renegotiating Microsoft's contract with OpenAI “unlocked [Microsoft's] ability to pursue superintelligence.” Though his new job description at Microsoft AI was only made public last month, he'd been preparing for the transition for 6-9 months. www.theverge.…