Universal Dependencies Treebank for Uzbek
| dc.contributor.author | Akhundjanova, Arofat | |
| dc.contributor.author | Talamo, Luigi | |
| dc.contributor.editor | Holdt, Špela Arhar | |
| dc.contributor.editor | Ilinykh, Nikolai | |
| dc.contributor.editor | Scalvini, Barbara | |
| dc.contributor.editor | Bruton, Micaella | |
| dc.contributor.editor | Debess, Iben Nyholm | |
| dc.contributor.editor | Tudor, Crina Madalina | |
| dc.coverage.spatial | Tallinn, Estonia | |
| dc.date.accessioned | 2025-02-14T09:34:31Z | |
| dc.date.available | 2025-02-14T09:34:31Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | We present the first Universal Dependencies treebank for Uzbek, a low-resource language from the Turkic family. The treebank contains 500 sentences (5850 tokens) sourced from the news and fiction genres and it is annotated for lemmas, part-of-speech (POS) tags, morphological features, and dependency relations. We describe our methodology for building the treebank, which consists of a mix of manual and automatic annotation and discuss some constructions of the Uzbek language that pose challenges to the UD framework. | |
| dc.description.uri | https://aclanthology.org/2025.resourceful-1.0/ | |
| dc.identifier.uri | https://hdl.handle.net/10062/107109 | |
| dc.language.iso | en | |
| dc.publisher | University of Tartu Library | |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
| dc.title | Universal Dependencies Treebank for Uzbek | |
| dc.type | Article |
Failid
Originaal pakett
1 - 1 1