Sirvi Autor "Yankovskaya, Lisa" järgi
Nüüd näidatakse 1 - 2 2
- Tulemused lehekülje kohta
- Sorteerimisvalikud
listelement.badge.dso-type Kirje , Machine Translation for Low-resource Finno-Ugric Languages(University of Tartu Library, 2023-05) Yankovskaya, Lisa; Tars, Maali; Tätar, Andre; Fišhel, Marklistelement.badge.dso-type Kirje , Paragraph-Level Machine Translation for Low-Resource Finno-Ugric Languages(University of Tartu Library, 2025-03) Pashchenko, Dmytro; Yankovskaya, Lisa; Fishel, Mark; Johansson, Richard; Stymne, SaraWe develop paragraph-level machine translation for four low-resource Finno-Ugric languages: Proper Karelian, Livvi, Ludian, and Veps. The approach is based on sentence-level pre-trained translation models, which are fine-tuned with paragraph-parallel data. This allows the resulting model to develop a native ability to handle discource-level phenomena correctly, in particular translating from grammatically gender-neutral input in Finno-Ugric languages. We collect monolingual and parallel paragraph-level corpora for these languages. Our experiments show that paragraph-level translation models can translate sentences no worse than sentence-level systems, while handling discourse-level phenomena better. For evaluation, we manually translate part of FLORES-200 into these four languages. All our results, data, and models are released openly.