OpenAI is rolling out limited access to its text-to-voice generation platform called Voice Engine, as reported by The Verge. This innovative platform can synthesize a voice based on a 15-second audio clip, enabling the creation of realistic-sounding artificial voices. These AI-generated voices are capable of reading text prompts in multiple languages and have potential applications across various industries, according to OpenAI’s blog post.

Among the companies granted access to Voice Engine are Age of Learning, HeyGen, Dimagi, Livox, and Lifespan. OpenAI has showcased samples demonstrating how Age of Learning is utilizing the technology to produce pre-scripted voice-over content and deliver personalized responses to students generated by GPT-4.

Voice Engine development commenced in late 2022 and has since powered preset voices for text-to-speech APIs and ChatGPT’s Read Aloud feature. Jeff Harris from OpenAI’s Voice Engine product team revealed to TechCrunch that the model was trained on a combination of licensed and publicly available data. The platform will be limited to approximately 10 developers, according to OpenAI’s disclosure to the publication.

While AI text-to-audio generation continues to advance, voice generation has received less attention due to various concerns, as highlighted by OpenAI. However, companies like Podcastle and ElevenLabs are exploring AI voice cloning technologies, as previously explored on The Vergecast.

Simultaneously, the US government is taking measures to regulate unethical applications of AI voice technology. The Federal Communications Commission recently prohibited robocalls utilizing AI voices after instances of spam calls impersonating President Joe Biden’s voice.

OpenAI’s partners have committed to adhering to usage policies that prohibit impersonation without consent, requiring explicit and informed consent from original speakers, and disclosing AI-generated voices to listeners. To ensure accountability, OpenAI has implemented watermarking on audio clips and actively monitors their usage.

OpenAI suggests several measures to mitigate risks associated with such tools, including phasing out voice-based authentication for bank accounts, implementing policies safeguarding the use of individuals’ voices in AI, enhancing education on AI deepfakes, and developing AI content tracking systems.

Industry News

OpenAI’s Voice Cloning AI Model Requires Just a 15-Second Sample to Operate

April 8, 20243 min read 分钟阅读

OpenAI

PreviousMicrosoft Exchange Hit by Major Security Breach: “Storm-0558” Hacker Group Exploits Vulnerability, Compromising US Government Officials’ Accounts

NextHow safe is Private Cloud Storage?

Unveiling Server IP Types: A Comprehensive Guide and Best Practices

A server IP address serves as a unique identifier on the internet or local network, composed of a series of numbers to locate and recognize servers. Selecting the appropriate server IP type is crucial for guaranteeing server security, stability, and performance. This article delves into four prominent server IP types: native IP, broadcasted IP, dedicated IP, and shared IP. …

March 11, 20246 min read 分钟阅读
Japanese Enterprises Hit by DDoS Attack Surge: The Growing Threat to Cybersecurity

In recent years, Distributed Denial of Service (DDoS) attacks have escalated globally, posing a severe threat to corporate cybersecurity. Japan, as one of the world’s largest economies, has become a prominent target. In the past month, several leading Japanese enterprises were struck by massive DDoS attacks, disrupting critical operations and drawing widespread attention. Incident Overview: …

January 15, 20256 min read 分钟阅读
Blockchain Beyond Cryptocurrencies: Real-World Applications

Introduction When most people hear “blockchain,” they think of cryptocurrencies like Bitcoin and Ethereum. However, blockchain technology has potential far beyond being just a ledger for cryptocurrencies. This article explores the diverse and innovative real-world applications of blockchain technology that are shaping various industries. Supply Chain Transparency Example: Food Safety and Traceability Blockchain technology is …

March 5, 20243 min read 分钟阅读

OpenAI’s Voice Cloning AI Model Requires Just a 15-Second Sample to Operate

We Accept

Products

Company

Resources

Contact

Sales

Follow us on social media

Subscribe and discover the latest updates, news, and features

OpenAI’s Voice Cloning AI Model Requires Just a 15-Second Sample to Operate

Related Posts

Unveiling Server IP Types: A Comprehensive Guide and Best Practices

Japanese Enterprises Hit by DDoS Attack Surge: The Growing Threat to Cybersecurity

Blockchain Beyond Cryptocurrencies: Real-World Applications

We Accept