본문 바로가기

TTS

CHiVE: Varying prosody in speech synthesis with a linguistically driven dynamic hierarchical conditional variational network