Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks


الملخص بالإنكليزية

The state-of-the-art in text-to-speech synthesis has recently improved considerably due to novel neural waveform generation methods, such as WaveNet. However, these methods suffer from their slow sequential inference process, while their parall

تحميل البحث