Skip to main content

Digital Trends may earn a commission when you buy through links on our site. Why trust us?

The new AI tool that was deemed ‘too dangerous’ to release

Back in 2019, OpenAI refused to release its full research into the development of GPT2 over fears that it was “too dangerous” to release publicly. On Thursday, OpenAI’s biggest financial backer, Microsoft, made a similar pronouncement about its new VALL-E 2 voice synthesizer AI.

The VALL-E 2 system is a zero-shot text-to-speech synthesis (TTS) AI, meaning that it can recreate hyper-realistic speech based on just a few seconds of sample audio. Per the research team, VALL-E 2 “surpasses previous systems in speech robustness, naturalness, and speaker similarity. It is the first of its kind to reach human parity on these benchmarks.”

Recommended Videos

The system reportedly can even handle sentences that are difficult to pronounce because of their structural complexity or repetitive phrasing, such as tongue twisters.

There are a host of potential beneficial uses for such a system, like enabling people suffering from aphasia or Amyotrophic lateral sclerosis (commonly known as ALS or Lou Gehrig’s disease) to speak again, albeit through a computer, as well as use in education, entertainment, journalism, chatbots and translation, or as accessibility features and “interactive voice response systems,” like Siri. However, the team also recognizes numerous opportunities for the public to misuse its technology, “such as spoofing voice identification or impersonating a specific speaker.”

As such the AI will only be available for research purposes. “Currently, we have no plans to incorporate VALL-E 2 into a product or expand access to the public,” the team wrote. ” If you suspect that VALL-E 2 is being used in a manner that is abusive or illegal or infringes on your rights or the rights of other people, you can report it at the Report Abuse Portal.”

Microsoft is hardly alone in its efforts to train computers to speak as humans do. Google’s Chirp, ElevenLabs’ Iconic Voices, and Voicebox from Meta all aim to perform similar functions.

However, such systems have come under ethical scrutiny as they have repeatedly been used to scam unsuspecting victims by emulating the voice of a loved one or a well-known celebrity. And unlike generated images, there’s currently no way to effectively “watermark” AI generated audio.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
NASA tests new AI chatbot to make sense of complex data
An Earth image captured by NASA.

Using its Earth-observing satellites, NASA has collected huge amounts of highly complex data about our planet over the years to track climate change, monitor wildfires, and plenty more besides.

But making sense of it all, and bringing it to the masses, is a challenging endeavor. Until now, that is.

Read more
Is AI already plateauing? New reporting suggests GPT-5 may be in trouble
A person sits in front of a laptop. On the laptop screen is the home page for OpenAI's ChatGPT artificial intelligence chatbot.

OpenAI's next-generation Orion model of ChatGPT, which is both rumored and denied to be arriving by the end of the year, may not be all it's been hyped to be once it arrives, according to a new report from The Information.

Citing anonymous OpenAI employees, the report claims the Orion model has shown a "far smaller" improvement over its GPT-4 predecessor than GPT-4 showed over GPT-3. Those sources also note that Orion "isn’t reliably better than its predecessor [GPT-4] in handling certain tasks," specifically coding applications, though the new model is notably stronger at general language capabilities, such as summarizing documents or generating emails.

Read more
Elon Musk reportedly will blow $10 billion on AI this year
Elon Musk at Tesla Cyber Rodeo.

Between Tesla and xAI, Elon Musk's artificial intelligence aspirations have cost some $10 billion dollars in bringing training and inference compute capabilities online this year, according to a Thursday post on X (formerly Twitter) by Tesla investor Sawyer Merritt.

"Tesla already deployed and is training ahead of schedule on a 29,000 unit Nvidia H100 cluster at Giga Texas – and will have 50,000 H100 capacity by the end of October, and ~85,000 H100 equivalent capacity by December," Merritt noted.

Read more