Revisiting Projection-based Data Transfer for Cross-Lingual Named Entity Recognition in Low-Resource Languages
| dc.contributor.author | Politov, Andrei | |
| dc.contributor.author | Shkalikov, Oleh | |
| dc.contributor.author | Jäkel, Rene | |
| dc.contributor.author | Färber, Michael | |
| dc.contributor.editor | Johansson, Richard | |
| dc.contributor.editor | Stymne, Sara | |
| dc.coverage.spatial | Tallinn, Estonia | |
| dc.date.accessioned | 2025-02-18T14:24:13Z | |
| dc.date.available | 2025-02-18T14:24:13Z | |
| dc.date.issued | 2025-03 | |
| dc.description.abstract | Cross-lingual Named Entity Recognition (NER) leverages knowledge transfer between languages to identify and classify named entities, making it particularly useful for low-resource languages. We show that the data-based cross-lingual transfer method is an effective technique for cross-lingual NER and can outperform multi-lingual language models for low-resource languages. This paper introduces two key enhancements to the annotation projection step in cross-lingual NER for low-resource languages. First, we explore refining word alignments using back-translation to improve accuracy. Second, we present a novel formalized projection approach of matching source entities with extracted target candidates. Through extensive experiments on two datasets spanning 57 languages, we demonstrated that our approach surpasses existing projection-based methods in low-resource settings. These findings highlight the robustness of projection-based data transfer as an alternative to model-based methods for cross-lingual named entity recognition in low-resource languages. | |
| dc.identifier.uri | https://hdl.handle.net/10062/107246 | |
| dc.language.iso | en | |
| dc.publisher | University of Tartu Library | |
| dc.relation.ispartofseries | NEALT Proceedings Series, No. 57 | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.title | Revisiting Projection-based Data Transfer for Cross-Lingual Named Entity Recognition in Low-Resource Languages | |
| dc.type | Article |
Failid
Originaal pakett
1 - 1 1