Creating a Voice Conversion Model for Estonian
Laen...
Kuupäev
Autorid
Ajakirja pealkiri
Ajakirja ISSN
Köite pealkiri
Kirjastaja
Tartu Ülikool
Abstrakt
Voice conversion has a variety of uses, like enhancement of impaired speech or entertainment purposes. The main challenge in voice conversion is extracting speaker-independent linguistic features from speech. To date, one of the most promising solutions is the Cotatron model. Estonian has some high-quality speech synthesis models, but there are no voice conversion models for Estonian. This thesis aims to take the Cotatron model and train it using Estonian Text-to-Speech datasets to produce a voice conversion model for the Estonian language.
Kirjeldus
Märksõnad
neural networks, voice conversion, speech technology, Cotatron, speaker imitation