The Roles of English in Evaluating Multilingual Language Models

dc.contributor.author	Poelman, Wessel
dc.contributor.author	Lhoneux, Miryam de
dc.contributor.editor	Johansson, Richard
dc.contributor.editor	Stymne, Sara
dc.coverage.spatial	Tallinn, Estonia
dc.date.accessioned	2025-02-18T14:18:47Z
dc.date.available	2025-02-18T14:18:47Z
dc.date.issued	2025-03
dc.description.abstract	Multilingual natural language processing is getting increased attention, with numerous models, benchmarks, and methods being released for many languages. English is often used in multilingual evaluation to prompt language models (LMs), mainly to overcome the lack of instruction tuning data in other languages. In this position paper, we lay out two roles of English in multilingual LM evaluations: as an interface and as a natural language. We argue that these roles have different goals: task performance versus language understanding. This discrepancy is highlighted with examples from datasets and evaluation setups. Numerous works explicitly use English as an interface to boost task performance. We recommend to move away from these imprecise methods and instead focus on language understanding.
dc.identifier.uri	https://hdl.handle.net/10062/107245
dc.language.iso	en
dc.publisher	University of Tartu Library
dc.relation.ispartofseries	NEALT Proceedings Series, No. 57
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.title	The Roles of English in Evaluating Multilingual Language Models
dc.type	Article

Failid

Nüüd näidatakse 1 - 1 1