WaveNet: A generative model for raw audio
fetched at September 8, 2016

This post presents WaveNet, a deep generative model of raw audio waveforms. We show that WaveNets are able to generate speech which mimics any human voice and which sounds more natural than the best existing Text-to-Speech systems, reducing the gap with human performance by over 50%. We also demonstrate that the same network can be used to synthesize other audio signals such as music, and present some striking samples of automatically generated piano pieces.

Visit Link
🤖 CodeKN.com

Follow our Twitter bot for more updates.