Universal Dependencies Treebank for Uzbek

dc.contributor.authorAkhundjanova, Arofat
dc.contributor.authorTalamo, Luigi
dc.contributor.editorHoldt, Špela Arhar
dc.contributor.editorIlinykh, Nikolai
dc.contributor.editorScalvini, Barbara
dc.contributor.editorBruton, Micaella
dc.contributor.editorDebess, Iben Nyholm
dc.contributor.editorTudor, Crina Madalina
dc.coverage.spatialTallinn, Estonia
dc.date.accessioned2025-02-14T09:34:31Z
dc.date.available2025-02-14T09:34:31Z
dc.date.issued2025
dc.description.abstractWe present the first Universal Dependencies treebank for Uzbek, a low-resource language from the Turkic family. The treebank contains 500 sentences (5850 tokens) sourced from the news and fiction genres and it is annotated for lemmas, part-of-speech (POS) tags, morphological features, and dependency relations. We describe our methodology for building the treebank, which consists of a mix of manual and automatic annotation and discuss some constructions of the Uzbek language that pose challenges to the UD framework.
dc.description.urihttps://aclanthology.org/2025.resourceful-1.0/
dc.identifier.urihttps://hdl.handle.net/10062/107109
dc.language.isoen
dc.publisherUniversity of Tartu Library
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleUniversal Dependencies Treebank for Uzbek
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
2025_resourceful_1_1.pdf
Suurus:
109.51 KB
Formaat:
Adobe Portable Document Format