Estonian isolated-word text-to-speech synthesiser

dc.contributor.authorKiissel, Indrek
dc.contributor.authorPiits, Liisi
dc.contributor.authorSahkai, Heete
dc.contributor.authorHein, Indrek
dc.contributor.authorErmus, Liis
dc.contributor.authorMihkla, Meelis
dc.contributor.editorJohansson, Richard
dc.contributor.editorStymne, Sara
dc.coverage.spatialTallinn, Estonia
dc.date.accessioned2025-02-18T09:18:46Z
dc.date.available2025-02-18T09:18:46Z
dc.date.issued2025-03
dc.description.abstractThis paper presents the development and evaluation of an Estonian isolated-word text-to-speech (TTS) synthesiser. Unlike conventional TTS systems that convert continuous text into speech, this system focuses on the synthesis of isolated words, which is crucial for applications such as pronunciation training, speech therapy, and (learners’) dictionaries. The system addresses two key challenges: generating natural prosody for isolated words and context-free disambiguation of homographs. We conducted a perception test to evaluate the performance of the TTS system in terms of pronunciation accuracy. We used 16 pairs of homographs that differ in palatalisation and 16 pairs of homographs that differ in quantity. Given that all the test items were correctly recognised by a majority of the evaluators, the performance of the synthesiser can be considered very good.
dc.identifier.urihttps://hdl.handle.net/10062/107223
dc.language.isoen
dc.publisherUniversity of Tartu Library
dc.relation.ispartofseriesNEALT Proceedings Series, No. 57
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleEstonian isolated-word text-to-speech synthesiser
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
2025_nodalida_1_32.pdf
Suurus:
385.96 KB
Formaat:
Adobe Portable Document Format