Sirvi Autor "Kanerva, Jenna" järgi
Nüüd näidatakse 1 - 8 8
- Tulemused lehekülje kohta
- Sorteerimisvalikud
listelement.badge.dso-type Kirje , Dep_search: Efficient Search Tool for Large Dependency Parsebanks(Gothenburg, Sweden, Association for Computational Linguistics, pp. 255--258, 2017) Luotolahti, Juhani; Kanerva, Jenna; Ginter, Filip; Tiedemann, Jörg; Tahmasebi, Ninalistelement.badge.dso-type Kirje , Finnish Paraphrase Corpus(Reykjavik, Iceland (Online), Linköping University Electronic Press, Sweden, pp. 288--298, 2021) Kanerva, Jenna; Ginter, Filip; Chang, Li-Hsin; Rastas, Iiro; Skantsi, Valtteri; Kilpeläinen, Jemina; Kupari, Hanna-Mari; Saarni, Jenna; Sevón, Maija; Tarkka, Otto; Dobnik, Simon; Øvrelid, Liljalistelement.badge.dso-type Kirje , Is Multilingual BERT Fluent in Language Generation?(Turku, Finland, Linköping University Electronic Press, pp. 29--36, 2019) Rönnqvist, Samuel; Kanerva, Jenna; Salakoski, Tapio; Ginter, Filip; Nivre, Joakim and Derczynski, Leon and Ginter, Filip; Lindi, Bjørn; Oepen, Stephan; Søgaard, Anders; Tidemann, Jörglistelement.badge.dso-type Kirje , OCR Error Post-Correction with LLMs in Historical Documents: No Free Lunches(University of Tartu Library, 2025-03) Kanerva, Jenna; Ledins, Cassandra; Käpyaho, Siiri; Ginter, Filip; Tudor, Crina Madalina; Debess, Iben Nyholm; Bruton, Micaella; Scalvini, Barbara; Ilinykh, Nikolai; Holdt, Špela ArharOptical Character Recognition (OCR) systems often introduce errors when transcribing historical documents, leaving room for post-correction to improve text quality. This study evaluates the use of open-weight LLMs for OCR error correction in historical English and Finnish datasets. We explore various strategies, including parameter optimization, quantization, segment length effects, and text continuation methods. Our results demonstrate that while modern LLMs show promise in reducing character error rates (CER) in English, a practically useful performance for Finnish was not reached. Our findings highlight the potential and limitations of LLMs in scaling OCR post-correction for large historical corpora.listelement.badge.dso-type Kirje , Template-free Data-to-Text Generation of Finnish Sports News(Turku, Finland, Linköping University Electronic Press, pp. 242--252, 2019) Kanerva, Jenna; Rönnqvist, Samuel; Kekki, Riina; Salakoski, Tapio; Ginter, Filip; Hartmann, Mareike; Plank, Barbaralistelement.badge.dso-type Kirje , Towards the Classification of the Finnish Internet Parsebank: Detecting Translations and Informality(Vilnius, Lithuania, Linköping University Electronic Press, Sweden, pp. 107--116, 2015) Laippala, Veronika; Kanerva, Jenna; Missilä, Anna; Pyysalo, Sampo; Salakoski, Tapio; Ginter, Filip; Megyesi, Beátalistelement.badge.dso-type Kirje , Universal Dependencies for Finnish(Vilnius, Lithuania, Linköping University Electronic Press, Sweden, pp. 163--172, 2015) Pyysalo, Sampo; Kanerva, Jenna; Missilä, Anna; Laippala, Veronika; Ginter, Filip; Megyesi, Beátalistelement.badge.dso-type Kirje , WikiBERT Models: Deep Transfer Learning for Many Languages(Reykjavik, Iceland (Online), Linköping University Electronic Press, Sweden, pp. 1–10, 2021) Pyysalo, Sampo; Kanerva, Jenna; Virtanen, Antti; Ginter, Filip; Dobnik, Simon; Øvrelid, Lilja