Amazon plays catchup with new Nova AI models to generate voices and video
Published on: 2025-05-08 18:28:38
is a news writer fond of the electric vehicle lifestyle and things that plug in via USB-C. He spent over 15 years in IT support before joining The Verge.
Amazon is showing off new AI technology this week, including its take on a more conversational voice model to better compete with things like Gemini Live and OpenAI’s Advanced Voice Mode and an update to its model that can generate video.
The new Nova Sonic voice model handles real-time speech processing and AI voice generation for conversational applications, Amazon says. Nova Sonic uses a “unified model architecture” that Amazon claims is better than other approaches that interconnect separate models to handle speech recognition, speech-to-text conversion, response generation, and then text-to-audio. Amazon says Nova Sonic can also better detect someone’s tone and deliver more natural responses.
Nova Sonic is available to try through Amazon’s Bedrock developer platform and the company says it can be used to make things like custom
... Read full article.