The Roles of English in Evaluating Multilingual Language Models

Poelman, Wessel; Lhoneux, Miryam de

The Roles of English in Evaluating Multilingual Language Models

Failid

2025_nodalida_1_53.pdf (199.7 KB)

Kuupäev

2025-03

Autorid

Poelman, Wessel

Lhoneux, Miryam de

Kirjastaja

University of Tartu Library

Abstrakt

Multilingual natural language processing is getting increased attention, with numerous models, benchmarks, and methods being released for many languages. English is often used in multilingual evaluation to prompt language models (LMs), mainly to overcome the lack of instruction tuning data in other languages. In this position paper, we lay out two roles of English in multilingual LM evaluations: as an interface and as a natural language. We argue that these roles have different goals: task performance versus language understanding. This discrepancy is highlighted with examples from datasets and evaluation setups. Numerous works explicitly use English as an interface to boost task performance. We recommend to move away from these imprecise methods and instead focus on language understanding.

URI

https://hdl.handle.net/10062/107245

Kollektsioonid

Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)

Kirje täielik lehekülg

The Roles of English in Evaluating Multilingual Language Models

Failid

Kuupäev

Autorid

Ajakirja pealkiri

Ajakirja ISSN

Köite pealkiri

Kirjastaja

Abstrakt

Kirjeldus

Märksõnad

Viide

URI

Kollektsioonid