Meta announced a new AI model called Voicebox yesterday, one it says is the most versatile yet for speech generation, but it’s not releasing it yet: The model is still only a research project, but Meta says can generate speech in six languages from samples as short as two seconds and could be used for “natural, authentic” translation in the future, among other things.
I’ve personally messed around with ElevenLabs and their voice generation, and I was honestly amazed. I even did an experiment by running a fully AI YouTube channel for a couple weeks. I wouldn’t be surprised if a massive company like Facebook was able to pull off a realistic sounding voice.