FB pixel

OpenAI says its voice cloning tool is too effective for public release

OpenAI says its voice cloning tool is too effective for public release
 

In the original story of the genie in a bottle from One Thousand and One Nights, the genie threatens to kill the fisherman who freed him – a tale that seems to be resonating with OpenAI, as it continues to pursue advanced voice cloning and synthetic audio and video tools that it says come with major risks.

In a blog post, the company says results of testing show that its Voice Engine is so good at deepfake voice cloning and synthetic audio that it will almost certainly be misused on wide release, prompting the ChatGPT maker to hold back on setting the product loose until it establishes stronger rules and guidelines for deployment.

Developed in 2022, Voice Engine is an update on tech already used in Open AI’s text-to-speech API and the conversation mode of ChatGPT. The blog says Voice Engine “uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker. It is notable that a small model with a single 15-second sample can create emotive and realistic voices.” The company has not disclosed the source of the emotionally rich data used to train Voice Engine, but told TechCrunch that the model “was trained on a mix of licensed and publicly available data.”

Beginning, perhaps, to understand the full-scale implications of a free, easily accessible tool that can recreate the realistic voice of anyone from whom it has a 15-second sample, the company says it is now “taking a cautious and informed approach to a broader release due to the potential for synthetic voice misuse.”

“We hope to start a dialogue on the responsible deployment of synthetic voices, and how society can adapt to these new capabilities,” says the blog post. “Based on these conversations and the results of these small scale tests, we will make a more informed decision about whether and how to deploy this technology at scale.”

According to a report from ArsTechnica, the current terms and conditions for companies testing Voice Engine prohibit the impersonation of an individual or organization “without consent or legal right.” They mandate clear disclosure of the use of AI to clone voices, and informed consent from anyone whose voice is being cloned. Plus, Open AI uses watermarks to make it easier to identify audio produced using Voice Engine.

Nonetheless, the company makes clear its belief that stopping the generative AI speed train is not an option, and that it is up to society to change with the times. “We hope this preview of Voice Engine both underscores its potential and also motivates the need to bolster societal resilience against the challenges brought by ever more convincing generative models,” says its post. To start, it suggests phasing out voice authentication as a means of ID verification for banking and other sensitive use cases, increasing public education on AI, “exploring policies to protect the use of individuals’ voices in AI” and accelerating the development of liveness detection, watermarking and other tools to distinguish real voices from synthetic cloned audio.

It is worth noting that OpenAI has rung this particular bell before, likewise warning that its facial recognition software and its text-to-video API Sora are so astonishingly good as to be positioned to transform the world.

Related Posts

Article Topics

 |   |   |   |   |   | 

Latest Biometrics News

 

African nations making digital ID gains in the face of common challenges

Several countries in Africa are on the right track with their national digital ID projects, but this is not coming…

 

Digidentity enables remote onboarding registration for GMC doctors in UK

Healthcare continues to open up as a potentially major market for biometrics and reusable digital identity, as evidenced by the…

 

Explaining W3C Verifiable Credentials and biometrics a key mission for Dock Labs

All legitimate credentials can be verified – but not all credentials are Verifiable Credentials. It sounds a bit like a…

 

Ethiopia reveals strategy behind digital ID progress as ID4Africa 2025 opens

Ethiopia’s Fayda digital ID program successes were in the spotlight on the first day of ID4Africa 2025 in the country’s…

 

ID4Africa 2025 begins with record numbers, urgency and Ethiopian PM’s address

ID4Africa’s 2025 AGM kicked off today in Addis Ababa, Ethiopia, attended by Prime Minister Dr. Abiy Ahmed Ali and several…

 

DHS invites comments on new biometric sensor performance studies

The U.S. Department of Homeland Security (DHS) is inviting public comments on a proposed information collection initiative that is aimed…

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Biometric Market Analysis

Most Viewed This Week

Featured Company

Biometrics Insight, Opinion

Digital ID In-Depth

Biometrics White Papers

Biometrics Events