Microsoft introduces a pair of in-house AI models

Microsoft is expanding its AI footprint with the release of two new models that its teams trained completely in-house. MAI-Voice-1 is the tech major's first natural speech generation model, while MAI-1-preview is text-based and is the company's first foundation model trained end-to-end. MAI-Voice-1 is currently being used in the Copilot Daily and Podcast features. Microsoft has made MAI-1-preview available for public tests on LMArena, and will begin previewing it in select Copilot situations in the coming weeks.

In an interview with Semafor , Microsoft AI division leader Mustafa Suleyman said the pair of models was developed with a focus on efficiency and cost-effectiveness. MAI-Voice-1 runs on a single GPU and MAI-1-preview was trained on about 15,000 Nvidia H-100 GPUs. For context, other models, such as xAI's Grok, took more than 100,000 of those chips for training. "Increasingly, the art and craft of training models is selecting the perfect data and not wasting any of your flops on unnecessary tokens that didn’t actually teach your model very much," Suleyman said.