Google integrates Chirp 3 voice model into Vertex AI

Generative AI has largely been about text-based interfaces for creating text, images, and more. But now, voice is stepping into the spotlight, and it's coming on strong. Google just dropped some big news: starting next week, they're rolling out Chirp 3 — their latest speech-to-text and HD text-to-speech models — on their Vertex AI platform.
Just last week, Google kinda snuck in an announcement that Chirp 3 would introduce eight new voices across 31 languages. This tech is perfect for building voice assistants, whipping up audiobooks, and even creating support agents and voice-overs for videos. They shared this at an event at Google’s DeepMind offices in London.
Google's not the only one jumping on the voice AI bandwagon. Last week, Sesame, the folks behind the super-realistic AI voices "Maya" and "Miles," announced they're letting developers build their own custom apps and services using their tech.
Google's trying to keep things in check with Chirp 3 by putting some usage restrictions in place to prevent misuse. "We're just working through some of these things with our safety team," said Thomas Kurian, the CEO of Google Cloud, at a news event today.
ElevenLabs is another big player in the AI voice game, having raked in millions to grow their voice services.
With Chirp 3, Google's bringing it into the same family as their latest versions of the LLM Gemini, which are still in testing, along with their image-generation model Imagen and the pricey Veo 2 video generation tool.
It's still up in the air whether Chirp 3 will sound as "real" as some other AI voices out there (Sesame's work is pretty impressive). But as Demis Hassabis, the CEO of DeepMind, pointed out, this is a marathon, not a sprint.
"In the near term ... this idea that [AI is] a silver bullet to everything in the next couple of years, I don’t see that happening just yet. I think we’re still quite a few years away from something like AGI happening," he said. "It’s going to change things ... over the next decade, so the medium to longer term. It’s one of those interesting moments in time."
Google kicked off Vertex AI back in 2021 as a spot for developers to build machine learning services in the cloud. That was way before AI, and especially generative AI, became the hot topic it is now, thanks to OpenAI’s GPT services.
Since then, Google's been pushing Vertex AI hard, trying to keep up with the likes of Microsoft and Amazon, who are also building generative AI tools for developers. With Vertex AI, developers can not only build on top of Gemini but also classify data, train models, and get them ready for production. It'll be interesting to see if Google decides to open up their garden to models from other creators.
Google's been at this "Chirp" voice thing for years, starting way back when they used it as a code name to take on Amazon's Alexa.
Related article
Imagen 4 is Google’s newest AI image generator
Google has just unveiled its latest image-generating AI model, Imagen 4, promising users an even better visual experience than its predecessor, Imagen 3. Announced at Google I/O 20
Google's Gemini Code Assist Enhances AI Coding with New Agentic Capabilities
Gemini Code Assist, Google's AI-powered coding companion, is rolling out exciting new "agentic" features in a preview mode. At the recent Cloud Next conference, Google unveiled how
Google’s AI Futures Fund may have to tread carefully
Google’s New AI Investment Initiative: A Strategic Shift Amid Regulatory ScrutinyGoogle's recent announcement of an AI Futures Fund marks a bold move in the tech giant's ongoing qu
Comments (50)
0/200
DonaldBrown
April 10, 2025 at 12:00:00 AM GMT
Chirp 3 is a game-changer for voice AI! The integration with Vertex AI is smooth, but the HD text-to-speech is where it shines. Only wish it was a bit faster. Still, a solid step forward for voice tech!
0
PaulLopez
April 10, 2025 at 12:00:00 AM GMT
Chirp 3は音声AIのゲームチェンジャーです!Vertex AIとの統合はスムーズですが、HDテキスト読み上げが特に優れています。もう少し速ければいいのにと思います。それでも、音声技術にとって前進の一歩です!
0
CarlHill
April 10, 2025 at 12:00:00 AM GMT
Chirp 3는 음성 AI의 게임 체인저입니다! Vertex AI와의 통합이 부드럽고, HD 텍스트 음성 변환이 특히 뛰어납니다. 조금 더 빨랐으면 좋겠어요. 그래도 음성 기술에 있어 한 걸음 앞서 나갔습니다!
0
RyanLee
April 10, 2025 at 12:00:00 AM GMT
Chirp 3 é um divisor de águas para a IA de voz! A integração com o Vertex AI é suave, mas o texto para fala em HD é onde ele brilha. Só gostaria que fosse um pouco mais rápido. Ainda assim, um passo sólido para a tecnologia de voz!
0
RoyYoung
April 10, 2025 at 12:00:00 AM GMT
¡Chirp 3 es un cambio de juego para la IA de voz! La integración con Vertex AI es suave, pero donde realmente brilla es en la conversión de texto a voz en HD. Solo desearía que fuera un poco más rápido. Aún así, un paso sólido hacia adelante para la tecnología de voz!
0
MichaelAdams
April 10, 2025 at 12:00:00 AM GMT
Google's move to integrate Chirp 3 into Vertex AI is exciting! Finally, we're getting more voice-based AI tools. I'm curious to see how well it handles different accents and languages. Hope it's not just another overhyped feature that fizzles out!
0
Generative AI has largely been about text-based interfaces for creating text, images, and more. But now, voice is stepping into the spotlight, and it's coming on strong. Google just dropped some big news: starting next week, they're rolling out Chirp 3 — their latest speech-to-text and HD text-to-speech models — on their Vertex AI platform.
Just last week, Google kinda snuck in an announcement that Chirp 3 would introduce eight new voices across 31 languages. This tech is perfect for building voice assistants, whipping up audiobooks, and even creating support agents and voice-overs for videos. They shared this at an event at Google’s DeepMind offices in London.
Google's not the only one jumping on the voice AI bandwagon. Last week, Sesame, the folks behind the super-realistic AI voices "Maya" and "Miles," announced they're letting developers build their own custom apps and services using their tech.
Google's trying to keep things in check with Chirp 3 by putting some usage restrictions in place to prevent misuse. "We're just working through some of these things with our safety team," said Thomas Kurian, the CEO of Google Cloud, at a news event today.
ElevenLabs is another big player in the AI voice game, having raked in millions to grow their voice services.
With Chirp 3, Google's bringing it into the same family as their latest versions of the LLM Gemini, which are still in testing, along with their image-generation model Imagen and the pricey Veo 2 video generation tool.
It's still up in the air whether Chirp 3 will sound as "real" as some other AI voices out there (Sesame's work is pretty impressive). But as Demis Hassabis, the CEO of DeepMind, pointed out, this is a marathon, not a sprint.
"In the near term ... this idea that [AI is] a silver bullet to everything in the next couple of years, I don’t see that happening just yet. I think we’re still quite a few years away from something like AGI happening," he said. "It’s going to change things ... over the next decade, so the medium to longer term. It’s one of those interesting moments in time."
Google kicked off Vertex AI back in 2021 as a spot for developers to build machine learning services in the cloud. That was way before AI, and especially generative AI, became the hot topic it is now, thanks to OpenAI’s GPT services.
Since then, Google's been pushing Vertex AI hard, trying to keep up with the likes of Microsoft and Amazon, who are also building generative AI tools for developers. With Vertex AI, developers can not only build on top of Gemini but also classify data, train models, and get them ready for production. It'll be interesting to see if Google decides to open up their garden to models from other creators.
Google's been at this "Chirp" voice thing for years, starting way back when they used it as a code name to take on Amazon's Alexa.



Chirp 3 is a game-changer for voice AI! The integration with Vertex AI is smooth, but the HD text-to-speech is where it shines. Only wish it was a bit faster. Still, a solid step forward for voice tech!




Chirp 3は音声AIのゲームチェンジャーです!Vertex AIとの統合はスムーズですが、HDテキスト読み上げが特に優れています。もう少し速ければいいのにと思います。それでも、音声技術にとって前進の一歩です!




Chirp 3는 음성 AI의 게임 체인저입니다! Vertex AI와의 통합이 부드럽고, HD 텍스트 음성 변환이 특히 뛰어납니다. 조금 더 빨랐으면 좋겠어요. 그래도 음성 기술에 있어 한 걸음 앞서 나갔습니다!




Chirp 3 é um divisor de águas para a IA de voz! A integração com o Vertex AI é suave, mas o texto para fala em HD é onde ele brilha. Só gostaria que fosse um pouco mais rápido. Ainda assim, um passo sólido para a tecnologia de voz!




¡Chirp 3 es un cambio de juego para la IA de voz! La integración con Vertex AI es suave, pero donde realmente brilla es en la conversión de texto a voz en HD. Solo desearía que fuera un poco más rápido. Aún así, un paso sólido hacia adelante para la tecnología de voz!




Google's move to integrate Chirp 3 into Vertex AI is exciting! Finally, we're getting more voice-based AI tools. I'm curious to see how well it handles different accents and languages. Hope it's not just another overhyped feature that fizzles out!












