Microsoft’s new AI models signal its independence while challenging OpenAI and Google

The news: Microsoft launched three in-house AI models for speech transcription, voice generation, and image creation last week, per Venture Beat.

Microsoft’s latest in-house AI models—developed independently from OpenAI—underscore its push for greater AI independence, even as the company remains one of OpenAI’s largest investors and long‑term cloud partners.

Why it’s worth watching: The timing for this release is critical. Microsoft just had its worst quarter since 2008 as investors demand an AI spending payoff, per CNBC.

These new Microsoft models—which are available immediately through Microsoft Foundry—as well as the MAI Playground, are the company’s first response.

In a demonstration of software engineering agility, Microsoft relied on small teams of fewer than 10 engineers to develop the three models that rival or beat OpenAI and Google on key performance and output benchmarks, using half the GPU compute resources.

By the numbers:

  • MAI-Transcribe-1 beats OpenAI’s Whisper and Google’s Gemini across 25 languages with a 3.8% word error rate.
  • MAI-Voice-1 generates 60 seconds of audio per second, priced at $22 per million characters.
  • MAI-Image-2 delivers twice faster image generation, priced at $5 per million input tokens.

Zooming out: Microsoft now competes on price and efficiency while its ecosystem of productivity tools and services provides a formidable distribution advantage.

  • Microsoft has more AI in the pipeline. The company has plans for a full frontier LLM to achieve complete independence from OpenAI by 2032, per Venture Beat.
  • Its other advantages include a history of productizing tools and solutions for business and consumer markets.
  • In context, enterprise partners like WPP, one of the world's largest advertising holding companies, are already building with MAI-Image-2.

Implications for marketers: Test Microsoft’s Foundry API immediately for transcription and image generation to see if it serves creative and marketing workflows. Determine price and subscription advantages for scaled deployments.

Brands will likely re-evaluate vendor lock-in with OpenAI or Google. Model pricing could become even more competitive for enterprise adopters through bundles. Industrywise, we can expect cheaper, faster AI infrastructure from Microsoft within 12 to 18 months.

This content is part of EMARKETER’s subscription Briefings, where we pair daily updates with data and analysis from forecasts and research reports. Our Briefings prepare you to start your day informed, to provide critical insights in an important meeting, and to understand the context of what’s happening in your industry. Non-clients can click here to get a demo of our full platform and coverage.

You've read 0 of 2 free articles this month.

Get more articles - create your free account today!