Amazon launches Nova AI models for voice and video generation to compete

This week, Amazon unveiled groundbreaking AI innovations featuring a conversational voice assistant designed to rival offerings like Gemini Live and OpenAI’s Advanced Voice Mode, along with enhanced video generation capabilities.
The Nova Sonic voice model delivers real-time speech processing and AI-generated responses for natural conversations, according to Amazon. Unlike traditional multi-model architectures, Nova Sonic operates on a unified system that streamlines speech recognition, text conversion, response generation, and audio synthesis. Amazon emphasizes its superior ability to interpret vocal nuances and produce lifelike interactions.
Currently available via Amazon Bedrock for developers, Nova Sonic powers applications ranging from customer service chatbots to specialized AI assistants in healthcare, education, and travel sectors. Rohit Prasad, Amazon's SVP and AGI lead, confirmed to TechCrunch that core Nova Sonic technology already enhances the new Alexa Plus assistant.
On the video front, Amazon introduced Nova Reel 1.1 with significant quality upgrades and reduced latency compared to its predecessor. The system now maintains visual coherence across six-second scene segments, enabling seamless two-minute video production.
Related article
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
YouTube expands AI deepfake detection to politicians, government officials, and journalists
On Tuesday, YouTube announced it is expanding its deepfake detection technology to a select group of government officials, political candidates, and journalists. The tool identifies AI-generated likenesses and lets pilot participants request the remo
The Real Difference: Not One Thing, but Another
Sometimes, things are not only one thing but also another. The phrase "It's not just this — it's that" has become so common in AI-generated writing that it now serves as more than a hint of synthetic content — it's nearly a certainty.That's why, when
Related Special Topic Recommendations
Comments (1)
0/500

This week, Amazon unveiled groundbreaking AI innovations featuring a conversational voice assistant designed to rival offerings like Gemini Live and OpenAI’s Advanced Voice Mode, along with enhanced video generation capabilities.
The Nova Sonic voice model delivers real-time speech processing and AI-generated responses for natural conversations, according to Amazon. Unlike traditional multi-model architectures, Nova Sonic operates on a unified system that streamlines speech recognition, text conversion, response generation, and audio synthesis. Amazon emphasizes its superior ability to interpret vocal nuances and produce lifelike interactions.
Currently available via Amazon Bedrock for developers, Nova Sonic powers applications ranging from customer service chatbots to specialized AI assistants in healthcare, education, and travel sectors. Rohit Prasad, Amazon's SVP and AGI lead, confirmed to TechCrunch that core Nova Sonic technology already enhances the new Alexa Plus assistant.
On the video front, Amazon introduced Nova Reel 1.1 with significant quality upgrades and reduced latency compared to its predecessor. The system now maintains visual coherence across six-second scene segments, enabling seamless two-minute video production.
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
YouTube expands AI deepfake detection to politicians, government officials, and journalists
On Tuesday, YouTube announced it is expanding its deepfake detection technology to a select group of government officials, political candidates, and journalists. The tool identifies AI-generated likenesses and lets pilot participants request the remo
The Real Difference: Not One Thing, but Another
Sometimes, things are not only one thing but also another. The phrase "It's not just this — it's that" has become so common in AI-generated writing that it now serves as more than a hint of synthetic content — it's nearly a certainty.That's why, when





Home






