AI speech generator 'reaches human parity' - Too dangerous to release.

imhotep

Well-known member
  • Mar 29, 2017
    14,833
    8
    35,357
    113
    Microsoft's VALL-E 2 can convincingly recreate human voices using just a few seconds of audio, its creators claim.

    Microsoft has developed a new artificial intelligence (AI) speech generator that is apparently so convincing it cannot be released to the public.

    VALL-E 2 is a text-to-speech (TTS) generator that can reproduce the voice of a human speaker using just a few seconds of audio.


    Microsoft researchers said VALL-E 2 was capable of generating "accurate, natural speech in the exact voice of the original speaker, comparable to human performance," in a paper that appeared June 17. In other words, the new AI voice generator is convincing enough to be mistaken for a real person — at least, according to its creators.

    "VALL-E 2 is the latest advancement in neural codec language models that marks a milestone in zero-shot text-to-speech synthesis (TTS), achieving human parity for the first time," the researchers wrote in the paper. "Moreover, VALL-E 2 consistently synthesizes high-quality speech, even for sentences that are traditionally challenging due to their complexity or repetitive phrases."

    Human parity in this context means that speech generated by VALL-E 2 matched or exceeded the quality of human speech in benchmarks used by Microsoft.
    The AI engine is capable of this given the inclusion of two key features: "Repetition Aware Sampling" and "Grouped Code Modeling."

    Ethics Statement..
    VALL-E 2 is purely a research project. Currently, we have no plans to incorporate VALL-E 2 into a product or expand access to the public. VALL-E 2 could synthesize speech that maintains speaker identity and could be used for educational learning, entertainment, journalistic, self-authored content, accessibility features, interactive voice response systems, translation, chatbot, and so on. While VALL-E 2 can speak in a voice like the voice talent, the similarity, and naturalness depend on the length and quality of the speech prompt, the background noise, as well as other factors. It may carry potential risks in the misuse of the model, such as spoofing voice identification or impersonating a specific speaker. We conducted the experiments under the assumption that the user agrees to be the target speaker in speech synthesis. If the model is generalized to unseen speakers in the real world, it should include a protocol to ensure that the speaker approves the use of their voice and a synthesized speech detection model. If you suspect that VALL-E 2 is being used in a manner that is abusive or illegal or infringes on your rights or the rights of other people, you can report it at the Report Abuse Portal.
     

    Stimulus mind

    Well-known member
  • Feb 27, 2021
    30,862
    152,602
    113
    ආර්නෝල්ඩ් සුබසිංහ අයියගෙ වොයිස් එකෙන් මේක ද දන්නෙ නෑ කියලා තියෙන්නෙ? එහෙනම් අපි කපෝතියි. :dull:😰🥶




    1*G7XNkE4VZ7KNyfatjLlSrg.jpeg
     

    poopoo

    Well-known member
  • Nov 18, 2021
    5,766
    11,420
    113
    Ai ennath kalin bullshit tamai machn
    Dn harine :D
    yeah, but im talking about Microsoft
    lately all of their AI products gone shit

    Copilot useless in many ways; Gemini better by now
    Also, former copilot head left Microsoft for a startup
    If the next VALL-E model is ready, they should release it without hyping it up too much

    Currently, we have no plans to incorporate VALL-E 2 into a product or expand access to the public.
    see :frown:, just some PR boost
    MS under fire from EU regulations and US Gov for Teams and weak security
     
    • Like
    Reactions: NRTG

    AnuradhaRa

    Well-known member
  • Dec 25, 2010
    61,707
    1
    42,859
    113
    සෙක්ස් චැට් කරන්න AI කෑල්ලක් සෙට් කරල දෙන්න මට...
    හොද වනචර AI කෑල්ලක්
     
    • Haha
    Reactions: Asmodeus

    LZP1992

    Well-known member
  • Feb 6, 2014
    5,636
    5,940
    113
    @ගෙදර
    yeah, but im talking about Microsoft
    lately all of their AI products gone shit

    Copilot useless in many ways; Gemini better by now
    Also, former copilot head left Microsoft for a startup
    If the next VALL-E model is ready, they should release it without hyping it up too much


    see :frown:, just some PR boost
    MS under fire from EU regulations and US Gov for Teams and weak security
    Ahaaa Microsoft :yes:
     
    • Like
    Reactions: NRTG

    kasunkaru

    Well-known member
  • Jan 25, 2018
    7,552
    5,815
    113
    මුළු ලෝකෙම AI එක්ක කවලම් කරලා දාලා ජංජාලයක් වෙන්නේ :baffled:
    ayee kumburu kotanna laasti weyalla sinhalu. IT jobut kela weegena yanne kolloneee
     
    • Sad
    Reactions: NRTG

    your_love

    Well-known member
  • Apr 7, 2012
    14,047
    1
    11,083
    113
    සෙක්ස් චැට් කරන්න AI කෑල්ලක් සෙට් කරල දෙන්න මට...
    හොද වනචර AI කෑල්ලක්
    ehema karaddi ubata metal issues nadda? Athal da moleta?
     

    NRTG

    Well-known member
  • Oct 19, 2019
    40,882
    198,128
    113
    Colombo, Sri Lanka
    ayee kumburu kotanna laasti weyalla sinhalu. IT jobut kela weegena yanne kolloneee
    එකනම් ඇත්ත දැන් ඕනෑම කොම්ප්ලෙක්ස් SQL QUERY එකක් උට දුන්නාව පට ගාලා හදලා දෙනවා. සමහර එව්වට database එකත් ලින්ක් කරන්න පුළුවන් එතකොට ටේබල් නේම් දදා ඉන්න ඕනෙත් නැහැ..... [ මම මෙව්වා ගැන විනෝදෙට ඉගෙන ගන්න කොට ලැබුණු අත්දැකීම් ]
     

    kasunkaru

    Well-known member
  • Jan 25, 2018
    7,552
    5,815
    113
    එකනම් ඇත්ත දැන් ඕනෑම කොම්ප්ලෙක්ස් SQL QUERY එකක් උට දුන්නාව පට ගාලා හදලා දෙනවා. සමහර එව්වට database එකත් ලින්ක් කරන්න පුළුවන් එතකොට ටේබල් නේම් දදා ඉන්න ඕනෙත් නැහැ..... [ මම මෙව්වා ගැන විනෝදෙට ඉගෙන ගන්න කොට ලැබුණු අත්දැකීම් ]
    ekat hondai bn software engineer kiyala samahru un adi 2 3k udin giyee. apith software field tamai eth mamanm kamati AI apu ekata godak wada lesi wunaa
     

    Sonique

    Well-known member
  • Oct 22, 2007
    25,165
    11,184
    113
    Forest
    මයික්‍රොසොෆ්ට් දැන් දොන්ත පතන් නයිත් අරින්න පටන් ගෙනද බන් ඒ පාර 😂

    yes, because MS own the OpenAI
    Dall-E is an image generative model
    Vall-E for speech synthesis

    It makes sense for using rhyming brand names
    Who says openAI is owned by MS? OpenAI is a direct rival to MS afaik
    ------ Post added on Jul 30, 2024 at 11:24 AM
     
    • Like
    Reactions: NRTG

    Asmodeus

    Well-known member
  • Feb 6, 2023
    6,846
    15,615
    113
    Ursa Major
    මුළු ලෝකෙම AI එක්ක කවලම් කරලා දාලා ජංජාලයක් වෙන්නේ :baffled:
    Yes, this is a transition age. the old ideas clashed with the new and this clash is nothing new. As it happened thousands of years ago and is still happening. The future will be far different, and I can see that the next evolution of humans is on the way. Within less than two centuries, I think there will be androids that have the same rights we humans have. That those new species will be our next step in the evolution.
     
    • Like
    • Sad
    Reactions: kinkon and NRTG