Creating a Voice Conversion Model for Estonian

Laen...
Pisipilt

Kuupäev

Ajakirja pealkiri

Ajakirja ISSN

Köite pealkiri

Kirjastaja

Tartu Ülikool

Abstrakt

Voice conversion has a variety of uses, like enhancement of impaired speech or entertainment purposes. The main challenge in voice conversion is extracting speaker-independent linguistic features from speech. To date, one of the most promising solutions is the Cotatron model. Estonian has some high-quality speech synthesis models, but there are no voice conversion models for Estonian. This thesis aims to take the Cotatron model and train it using Estonian Text-to-Speech datasets to produce a voice conversion model for the Estonian language.

Kirjeldus

Märksõnad

neural networks, voice conversion, speech technology, Cotatron, speaker imitation

Viide