AI speech generator 'reaches human parity' - Too dangerous to release.

imhotep · Jul 10, 2024

Microsoft's VALL-E 2 can convincingly recreate human voices using just a few seconds of audio, its creators claim.

Microsoft has developed a new artificial intelligence (AI) speech generator that is apparently so convincing it cannot be released to the public.

VALL-E 2 is a text-to-speech (TTS) generator that can reproduce the voice of a human speaker using just a few seconds of audio.

Microsoft researchers said VALL-E 2 was capable of generating "accurate, natural speech in the exact voice of the original speaker, comparable to human performance," in a paper that appeared June 17. In other words, the new AI voice generator is convincing enough to be mistaken for a real person — at least, according to its creators.

"VALL-E 2 is the latest advancement in neural codec language models that marks a milestone in zero-shot text-to-speech synthesis (TTS), achieving human parity for the first time," the researchers wrote in the paper. "Moreover, VALL-E 2 consistently synthesizes high-quality speech, even for sentences that are traditionally challenging due to their complexity or repetitive phrases."

Human parity in this context means that speech generated by VALL-E 2 matched or exceeded the quality of human speech in benchmarks used by Microsoft.
The AI engine is capable of this given the inclusion of two key features: "Repetition Aware Sampling" and "Grouped Code Modeling."

Ethics Statement..
VALL-E 2 is purely a research project. Currently, we have no plans to incorporate VALL-E 2 into a product or expand access to the public. VALL-E 2 could synthesize speech that maintains speaker identity and could be used for educational learning, entertainment, journalistic, self-authored content, accessibility features, interactive voice response systems, translation, chatbot, and so on. While VALL-E 2 can speak in a voice like the voice talent, the similarity, and naturalness depend on the length and quality of the speech prompt, the background noise, as well as other factors. It may carry potential risks in the misuse of the model, such as spoofing voice identification or impersonating a specific speaker. We conducted the experiments under the assumption that the user agrees to be the target speaker in speech synthesis. If the model is generalized to unseen speakers in the real world, it should include a protocol to ensure that the speaker approves the use of their voice and a synthesized speech detection model. If you suspect that VALL-E 2 is being used in a manner that is abusive or illegal or infringes on your rights or the rights of other people, you can report it at the Report Abuse Portal.

NRTG · Jul 10, 2024

මුළු ලෝකෙම AI එක්ක කවලම් කරලා දාලා ජංජාලයක් වෙන්නේ :baffled:

LZP1992 · Jul 10, 2024

NRTG said:
මුළු ලෝකෙම AI එක්ක කවලම් කරලා දාලා ජංජාලයක් වෙන්නේ

Limitation damme nathan anayak wena seen ekak tiyenne :yes:

poopoo · Jul 10, 2024

Not released == doesn't exist
Just another Microsoft PR bullshit

LZP1992 · Jul 10, 2024

poopoo said:
Not released == doesn't exist
Just another Microsoft PR bullshit

Ai ennath kalin bullshit tamai machn
Dn harine

dayt0na · Jul 10, 2024

poopoo said:
Not released == doesn't exist
Just another Microsoft PR bullshit

Alternatives already available.

Stimulus mind · Jul 10, 2024

ආර්නෝල්ඩ් සුබසිංහ අයියගෙ වොයිස් එකෙන් මේක ද දන්නෙ නෑ කියලා තියෙන්නෙ? එහෙනම් අපි කපෝතියි. :dull:

poopoo · Jul 10, 2024

LZP1992 said:
Ai ennath kalin bullshit tamai machn
Dn harine

yeah, but im talking about Microsoft
lately all of their AI products gone shit

Copilot useless in many ways; Gemini better by now
Also, former copilot head left Microsoft for a startup
If the next VALL-E model is ready, they should release it without hyping it up too much

imhotep said:
Currently, we have no plans to incorporate VALL-E 2 into a product or expand access to the public.

see

, just some PR boost
MS under fire from EU regulations and US Gov for Teams and weak security

AnuradhaRa · Jul 10, 2024

සෙක්ස් චැට් කරන්න AI කෑල්ලක් සෙට් කරල දෙන්න මට...
හොද වනචර AI කෑල්ලක්

LZP1992 · Jul 10, 2024

poopoo said:
yeah, but im talking about Microsoft
lately all of their AI products gone shit

Copilot useless in many ways; Gemini better by now
Also, former copilot head left Microsoft for a startup
If the next VALL-E model is ready, they should release it without hyping it up too much

see , just some PR boost
MS under fire from EU regulations and US Gov for Teams and weak security

Ahaaa Microsoft :yes:

kasunkaru · Jul 10, 2024

NRTG said:
මුළු ලෝකෙම AI එක්ක කවලම් කරලා දාලා ජංජාලයක් වෙන්නේ

ayee kumburu kotanna laasti weyalla sinhalu. IT jobut kela weegena yanne kolloneee

your_love · Jul 10, 2024

AnuradhaRa said:
සෙක්ස් චැට් කරන්න AI කෑල්ලක් සෙට් කරල දෙන්න මට...
හොද වනචර AI කෑල්ලක්

ehema karaddi ubata metal issues nadda? Athal da moleta?

NRTG · Jul 10, 2024

kasunkaru said:
ayee kumburu kotanna laasti weyalla sinhalu. IT jobut kela weegena yanne kolloneee

එකනම් ඇත්ත දැන් ඕනෑම කොම්ප්ලෙක්ස් SQL QUERY එකක් උට දුන්නාව පට ගාලා හදලා දෙනවා. සමහර එව්වට database එකත් ලින්ක් කරන්න පුළුවන් එතකොට ටේබල් නේම් දදා ඉන්න ඕනෙත් නැහැ..... [ මම මෙව්වා ගැන විනෝදෙට ඉගෙන ගන්න කොට ලැබුණු අත්දැකීම් ]

kinkon · Jul 10, 2024

Arnold Schwarzenegger Film GIF by Tech Noir

kasunkaru · Jul 10, 2024

NRTG said:
එකනම් ඇත්ත දැන් ඕනෑම කොම්ප්ලෙක්ස් SQL QUERY එකක් උට දුන්නාව පට ගාලා හදලා දෙනවා. සමහර එව්වට database එකත් ලින්ක් කරන්න පුළුවන් එතකොට ටේබල් නේම් දදා ඉන්න ඕනෙත් නැහැ..... [ මම මෙව්වා ගැන විනෝදෙට ඉගෙන ගන්න කොට ලැබුණු අත්දැකීම් ]

ekat hondai bn software engineer kiyala samahru un adi 2 3k udin giyee. apith software field tamai eth mamanm kamati AI apu ekata godak wada lesi wunaa

pasansnoop · Jul 10, 2024

imhotep said:
Microsoft's VALL-E 2

මේක කෙලින්ම Open Aiලගේ Dall-E 2 නම copy කරලනෙ

olu bakka · Jul 10, 2024

If they don't, someone else will

poopoo · Jul 10, 2024

pasansnoop said:
මේක කෙලින්ම Open Aiලගේ Dall-E 2 නම copy කරලනෙ

yes, because MS own the OpenAI
Dall-E is an image generative model
Vall-E for speech synthesis

It makes sense for using rhyming brand names

Sonique · Jul 30, 2024

මයික්‍රොසොෆ්ට් දැන් දොන්ත පතන් නයිත් අරින්න පටන් ගෙනද බන් ඒ පාර

poopoo said:
yes, because MS own the OpenAI
Dall-E is an image generative model
Vall-E for speech synthesis

It makes sense for using rhyming brand names

Who says openAI is owned by MS? OpenAI is a direct rival to MS afaik
------ Post added on Jul 30, 2024 at 11:24 AM

Asmodeus · Jul 30, 2024

NRTG said:
මුළු ලෝකෙම AI එක්ක කවලම් කරලා දාලා ජංජාලයක් වෙන්නේ

Yes, this is a transition age. the old ideas clashed with the new and this clash is nothing new. As it happened thousands of years ago and is still happening. The future will be far different, and I can see that the next evolution of humans is on the way. Within less than two centuries, I think there will be androids that have the same rights we humans have. That those new species will be our next step in the evolution.

AI speech generator 'reaches human parity' - Too dangerous to release.

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Well-known member

Similar threads