Microsoft's VALL-E can imitate any voice in just 3 seconds

11:00:00 AM, Wednesday 11th of January 2023 | in technology

Image Credit: firstpost

Three seconds, that's all it takes for Microsoft's newly developed text-to-speech AI model to mimic a person's voice. Dubbed VALL-E, it can generate audio of a person saying anything once it learns a specific voice. The AI's ability in mimicking voices has caught everyone by surprise. It is trained on over 60,00 hours of English speaking, much more than other text-to-speech models.

Microsoft's VALL-E can imitate any voice in just 3 seconds

Shortpedia

technology

Apple proposes $100M investment in Indonesia to reverse iPhone ban

Japan launches world's first wooden satellite, LignoSat

Apple, Goldman Sachs fined $89M for misleading Apple Card users

'Sub-Earth' exoplanet discovered around nearest single star to solar system

Who is Mira Murati, the woman behind OpenAI's technological advancements?

BookMyShow crashes amid rush for Coldplay's Mumbai concert tickets

Astronomers discover smallest black hole lurking in Milky Way

Robots to operate Indian space station before astronauts arrive: ISRO

Scientists discover over 800 genes that could potentially cause cancer

Massive crowds flood Apple stores as iPhone 16 sales begin