First Steps in Benchmarking Latvian in Large Language Models

Skadina, Inguna; Bakanovs, Bruno; Darģis, Roberts

First Steps in Benchmarking Latvian in Large Language Models

Failid

2025_resourceful_1_22.pdf (133.01 KB)

Kuupäev

2025-03

Autorid

Skadina, Inguna

Bakanovs, Bruno

Darģis, Roberts

Kirjastaja

University of Tartu Library

Abstrakt

The performance of multilingual large language models (LLMs) in low-resource languages, such as Latvian, has been under-explored. In this paper, we investigate the capabilities of several open and commercial LLMs in the Latvian language understanding tasks. We evaluate these models across several well-known benchmarks, such as the Choice of Plausible Alternatives (COPA) and Measuring Massive Multitask Language Understanding (MMLU), which were adapted into Latvian using machine translation. Our results highlight significant variability in model performance, emphasizing the challenges of extending LLMs to low-resource languages. We also analyze the effect of post-editing on machine-translated datasets, observing notable improvements in model accuracy, particularly with BERT-based architectures. We also assess open-source LLMs using the Belebele dataset, showcasing competitive performance from open-weight models when compared to proprietary systems. This study reveals key insights into the limitations of current LLMs in low-resource settings and provides datasets for future benchmarking efforts.

URI

https://aclanthology.org/2025.resourceful-1.0/
https://hdl.handle.net/10062/107120

Kollektsioonid

Proceedings of the Third Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2025)

Kirje täielik lehekülg

First Steps in Benchmarking Latvian in Large Language Models

Failid

Kuupäev

Autorid

Ajakirja pealkiri

Ajakirja ISSN

Köite pealkiri

Kirjastaja

Abstrakt

Kirjeldus

Märksõnad

Viide

URI

Kollektsioonid