Neural Voice Cloning with a Few Samples - Baidu Research

research.baidu.com

Baidu's Deep Voice project focuses on teaching machines to generate speech from text that sound more human-like.

Beyond single-speaker speech synthesis, this new research demonstrates that a single system could learn to reproduce thousands of speaker identities, with less than half an hour of training data for each speaker. This capability is enabled by learning shared and discriminative information from speakers.

Read more...
Linkedin

Want to receive more content like this in your inbox?