Neural Voice Cloning with a Few Samples - Baidu Research

Baidu's Deep Voice project focuses on teaching machines to generate speech from text that sound more human-like.

Beyond single-speaker speech synthesis, this new research demonstrates that a single system could learn to reproduce thousands of speaker identities, with less than half an hour of training data for each speaker. This capability is enabled by learning shared and discriminative information from speakers.


Want to receive more content like this in your inbox?