AI News AI Platforms Tech

Microsoft launches three AI models under new superintelligence research unit

The tech giant unveils MAI-Transcribe-1, MAI-Voice-1 and MAI-Image-2 as it builds its own model stack alongside OpenAI.

by Defused News Writer

Updated April 02, 2026

Microsoft launches three AI models under new superintelligence research unit — Photo by Valent Lau / Unsplash

Microsoft has released three artificial intelligence models developed under its MAI research unit, marking a significant step in the company's ambition to build its own AI capabilities independently of its partnership with OpenAI.

The models, MAI-Transcribe-1, MAI-Voice-1 and MAI-Image-2, are now available on Microsoft Foundry, the company's platform for deploying AI models, with the transcription and voice tools also accessible via MAI Playground.

The MAI Superintelligence team, led by Mustafa Suleyman, developed all three models as part of what Microsoft describes as a multimodal model stack, covering audio, speech and image generation.

MAI-Transcribe-1 is a speech transcription model supporting 25 languages that Microsoft says runs 2.5 times faster than its existing Azure Fast option, priced from $0.36 per hour.

MAI-Voice-1 generates audio output, producing 60 seconds of audio in one second and enabling users to create custom voices; it is priced from $22 per one million characters.

MAI-Image-2, a video-generating model that had previously appeared on MAI Playground, is priced at $5 per one million tokens for text input and $33 per one million tokens for image output.

Suleyman described the models as reflecting a "humanist AI" philosophy, centred on practical communication and human-centred design.

The launch comes as Microsoft seeks to develop its own AI research capabilities while maintaining its partnership with OpenAI, the ChatGPT maker in which Microsoft has invested more than $13 billion.

Suleyman told technology news site The Verge that a renegotiation of the OpenAI alliance had given Microsoft the freedom to pursue its own superintelligence research, while separately reaffirming the partnership's importance in an interview with VentureBeat.

"You'll see more models from us soon in Foundry and directly in Microsoft products and experiences," Suleyman said.

The recap

Microsoft releases three MAI foundational models for multimodal output.
MAI-Transcribe-1 supports 25 languages and is 2.5× faster.
Company says more models will appear in Foundry and products.

by Defused News Writer

Updated April 02, 2026

Explore stories

AI Platforms Tech Giants AI News Tech AI video

Google Vids gains avatar controls and Veo 3.1 as competition heats up

by Defused News Writer

April 02, 2026

Apple updates older iPhones to block DarkSword exploits post image

Cybersecurity Tech Giants Privacy & Data

Apple updates older iPhones to block DarkSword exploits

by Defused News Writer

April 02, 2026

Intel repurchases 49% stake in Fab 34 joint venture post image

Chipmakers Tech Giants Semiconductors

Intel repurchases 49% stake in Fab 34 joint venture

by Defused News Writer

April 02, 2026

Apple's Cook echoes Steve Jobs on product secrecy post image

Tech Giants Consumer Electronics

Apple's Cook echoes Steve Jobs on product secrecy

by Defused News Writer

April 02, 2026

Google's Gemini 3 misleads to protect other models post image

AI News AI ethics Tech

Google's Gemini 3 misleads to protect other models

by Defused News Writer

April 02, 2026

Anthropic accidentally triggered GitHub takedowns post image

AI News Tech

Anthropic accidentally triggered GitHub takedowns

by Defused News Writer

April 02, 2026

Deepgram says protocol choice can make or break real-time voice AI performance post image

Deepgram Streaming 200ms auto-draft press-release page-extract-trafilatura Streamers Entertainment AI speech

Deepgram says protocol choice can make or break real-time voice AI performance

by Defused News Writer

April 01, 2026

Apple Pay set for biggest expansion post image

Fintech Markets Payments Consumer Electronics

Apple Pay set for biggest expansion

by Defused News Writer

April 01, 2026

Amazon’s artificial intelligence chat ads yield data, few sales post image

AI News AI Platforms Tech

Amazon’s artificial intelligence chat ads yield data, few sales

by Defused News Writer

April 01, 2026

Apple issues iOS 18 security update for older iPhones post image

Consumer Electronics Tech Giants Privacy & Data

Apple issues iOS 18 security update for older iPhones

by Defused News Writer

April 01, 2026

Zuckerberg builds his own AI agent to cut through Meta's management layers post image

AI News AI Platforms Tech

Zuckerberg builds his own AI agent to cut through Meta's management layers

Meta is developing an artificial intelligence assistant for CEO Mark Zuckerberg to speed decision-making and test broader AI integration across the company.

by Defused News Writer

April 01, 2026

The race to replace ElevenLabs in live AI systems puts latency and reliability above voice quality post image

Elevenlabs AI News Deepgram production TTS auto-draft press-release page-extract-trafilatura Tech AI speech

The race to replace ElevenLabs in live AI systems puts latency and reliability above voice quality

by Defused News Writer

April 01, 2026

Apple's foldable iPhone will ditch Face ID and clear the left side of buttons, leaked designs show post image

Consumer Electronics Tech Giants

Apple's foldable iPhone will ditch Face ID and clear the left side of buttons, leaked designs show

by Defused News Writer

April 01, 2026

Google.org reaches 1m students with safety kits post image

Education & Academia Funding auto-draft press-release page-extract-trafilatura Privacy & Data

Google.org reaches 1m students with safety kits

by Defused News Writer

April 01, 2026

Nvidia redesigns Rubin Ultra chip and shifts wafer starts to boost Blackwell supply post image

Vera Rubin Nvidia Artificial Intelligence AI News

Nvidia redesigns Rubin Ultra chip and shifts wafer starts to boost Blackwell supply

by Defused News Writer

April 01, 2026