option
Home News OpenAI Yet to Release Voice Cloning Tool a Year Later

OpenAI Yet to Release Voice Cloning Tool a Year Later

release date release date April 21, 2025
Author Author AnthonyHernández
views views 25

OpenAI's Voice Engine: A Long-Awaited Release?

Late last March, OpenAI introduced a "small-scale preview" of its AI service, Voice Engine, which promised to clone a person's voice using just 15 seconds of speech. Fast forward a year, and the tool is still in preview mode, with no clear timeline for a full launch—or even confirmation that it will ever see the light of day.

The hesitation to roll out Voice Engine widely could stem from concerns about misuse, or perhaps an attempt to sidestep regulatory scrutiny. OpenAI has faced criticism in the past for prioritizing flashy products over safety and for rushing to market ahead of competitors.

An OpenAI spokesperson told TechCrunch that the company is still testing Voice Engine with a select group of "trusted partners." "We're learning from how our partners are using the technology to enhance the model's utility and safety," the spokesperson explained. "It's been exciting to see its applications, ranging from speech therapy and language learning to customer support, video game characters, and AI avatars."

Voice Engine: The Journey So Far

Voice Engine, which drives the voices in OpenAI's text-to-speech API and ChatGPT's Voice Mode, creates remarkably natural-sounding speech that closely mimics the original speaker. It converts text into speech, constrained only by certain content guidelines. However, the rollout has been plagued by delays and shifting release dates from the start.

In a June 2024 blog post, OpenAI detailed how the Voice Engine model learns to predict the sounds a speaker would likely make for a given text, considering various voices, accents, and speaking styles. This allows the model not just to generate speech from text but also to produce "spoken utterances" that reflect how different speakers would voice the text aloud.

Originally, Voice Engine, then called Custom Voices, was set to join OpenAI's API on March 7, 2024, according to a draft blog post seen by TechCrunch. The plan was to initially offer access to up to 100 "trusted developers," prioritizing those developing apps with social benefits or showing innovative and responsible use of the technology. OpenAI had already trademarked the service and set pricing at $15 per million characters for "standard" voices and $30 per million characters for "HD quality" voices.

But at the last moment, the announcement was delayed. A few weeks later, OpenAI unveiled Voice Engine without a sign-up option, limiting access to a small group of developers they had been working with since late 2023.

"We hope to start a dialogue on the responsible deployment of synthetic voices and how society can adapt to these new capabilities," OpenAI stated in the late March 2024 announcement blog post. "Based on these conversations and the results of these small-scale tests, we will make a more informed decision about whether and how to deploy this technology at scale."

A Long Development Road

Voice Engine has been in development since 2022, with OpenAI showcasing its potential—and risks—to global policymakers in the summer of 2023. Today, several partners have access to Voice Engine, including startup Livox, which aims to help people with disabilities communicate more naturally. However, Livox CEO Carlos Pereira noted that they couldn't integrate Voice Engine into their products because it requires an internet connection, which many of their customers lack. "The quality of the voice and the ability to have the voices speak in different languages is unique—especially for our customers with disabilities," Pereira told TechCrunch via email. "It's really the most impressive and easy-to-use tool to create voices that I've seen... We hope that OpenAI develops an offline version soon."

Pereira has not received any indication from OpenAI about a potential launch date or plans to charge for the service, and so far, Livox has not had to pay for its usage.

In a June 2024 post, OpenAI suggested that one reason for delaying Voice Engine was the potential for abuse during the U.S. election cycle. The company has implemented safety measures, including watermarking to trace the origin of generated audio. Developers must obtain "explicit consent" from the original speaker and make "clear disclosures" to their audience that voices are AI-generated. However, OpenAI has not detailed how these policies will be enforced at scale, which could be a significant challenge.

OpenAI also hinted at building a "voice authentication experience" to verify speakers and a "no-go" list to prevent the creation of voices resembling prominent figures. These are ambitious projects, and any missteps could further damage OpenAI's reputation regarding safety initiatives.

Effective filtering and ID verification are becoming essential for responsibly releasing voice cloning technology. AI voice cloning was the third fastest-growing scam of 2024, leading to fraud and bypassing bank security checks as privacy and copyright laws struggle to keep pace. Malicious actors have used voice cloning to create deepfakes of celebrities and politicians, which have spread rapidly on social media.

OpenAI might release Voice Engine next week, or it might never happen. The company has mentioned considering keeping the service small in scope. But one thing is certain: whether for optics, safety, or both, Voice Engine's limited preview has become one of the longest in OpenAI's history.

Related article
Google Search présente le «mode AI» pour les requêtes complexes et multi-parties Google Search présente le «mode AI» pour les requêtes complexes et multi-parties Google dévoile le "mode AI" dans la recherche pour rivaliser avec perplexité AI et ChatgptGoogle intensifie son jeu dans l'arène AI avec le lancement d'une fonction expérimentale "Mode AI" dans son moteur de recherche. Visant à prendre des goûts de perplexity AI et de la recherche Chatgpt d'Openai, ce nouveau mode a été annoncé le mercredi
L'utilisation non sollicitée par Chatgpt des noms d'utilisateurs étimule les préoccupations «effrayantes» parmi certains L'utilisation non sollicitée par Chatgpt des noms d'utilisateurs étimule les préoccupations «effrayantes» parmi certains Certains utilisateurs de Chatgpt ont récemment rencontré une nouvelle fonctionnalité étrange: le chatbot utilise occasionnellement leur nom tout en travaillant sur des problèmes. Cela ne faisait pas partie de son comportement habituel auparavant, et de nombreux utilisateurs signalent que Chatgpt mentionne leurs noms sans jamais leur dire comment les appeler. Opinions sur
Openai améliore le chatppt pour rappeler les conversations précédentes Openai améliore le chatppt pour rappeler les conversations précédentes Openai a fait une grande annonce jeudi à propos de déployer une nouvelle fonctionnalité dans Chatgpt intitulée "Memory". Cet outil Nifty est conçu pour rendre vos conversations avec l'IA plus personnalisées en se souvenant de ce dont vous avez déjà parlé. Imaginez de ne pas avoir à vous répéter chaque fois que vous commencez un nouveau conve
Comments (5)
0/200
StephenScott
StephenScott April 21, 2025 at 11:54:47 PM GMT

It's been a year and OpenAI's Voice Engine is still in preview mode? Come on, I was so excited about cloning voices with just 15 seconds of speech! The wait is killing me, but I guess good things take time. Hopefully, it'll be worth it when it finally drops! 🤞

WillieHernández
WillieHernández April 21, 2025 at 11:54:47 PM GMT

オープンAIのVoice Engine、まだプレビュー版のままなんて信じられない!15秒の音声で声をクローンできるって聞いてすごく期待してたのに。待つのはつらいけど、良いものは時間がかかるってことかな。リリースが楽しみだよ!🤞

BillyWilson
BillyWilson April 21, 2025 at 11:54:47 PM GMT

오픈AI의 Voice Engine이 아직도 프리뷰 상태라니 믿기지 않아! 15초의 음성으로 목소리를 복제할 수 있다니 기대가 컸는데. 기다리는 게 힘들지만 좋은 건 시간이 걸리는 법이죠. 출시가 기대돼요! 🤞

KennethKing
KennethKing April 21, 2025 at 11:54:47 PM GMT

Já faz um ano e o Voice Engine da OpenAI ainda está em modo de pré-visualização? Sério? Estava tão animado para clonar vozes com apenas 15 segundos de fala! A espera está me matando, mas suponho que coisas boas levam tempo. Espero que valha a pena quando finalmente for lançado! 🤞

JeffreyThomas
JeffreyThomas April 21, 2025 at 11:54:47 PM GMT

¿Ha pasado un año y el Voice Engine de OpenAI sigue en modo de vista previa? ¡Vamos, estaba tan emocionado de clonar voces con solo 15 segundos de habla! La espera me está matando, pero supongo que las cosas buenas toman tiempo. Espero que valga la pena cuando finalmente se lance! 🤞

Back to Top
OR