Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks


Abstract in English

The state-of-the-art in text-to-speech synthesis has recently improved considerably due to novel neural waveform generation methods, such as WaveNet. However, these methods suffer from their slow sequential inference process, while their parall

Download