본문 바로가기

TTS

Pitchtron: Towards audiobook generation from ordinary people’s voices

Source code: https://github.com/hash2430/pitchtron/

 

 

 

 

 

Audio samples
Prosody Reference GST Soft pitchtron Hard pitchtron
Standard Korean neutral dialogue
Standard Korean neutral dialogue
Standard Korean emotive dialogue
Standard Korean emotive dialogue
Kyoungsang dialect
Kyoungsang dialect
Cheolla dialect
Cheolla dialect

 

 

Prosody transferability and vocal range scalability
Scale Reference GST Soft pitchtron Hard pitchtron
As is
Scale pitch by 0.5 -

 

 

Non-linear prosody transfer
Reference GST Soft pitchtron