Benchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles

dc.contributor.authorTouileb, Samia
dc.contributor.authorMikhailov, Vladislav
dc.contributor.authorKroka, Marie Ingeborg
dc.contributor.authorVelldal, Erik
dc.contributor.authorØvrelid, Lilja
dc.contributor.editorJohansson, Richard
dc.contributor.editorStymne, Sara
dc.coverage.spatialTallinn, Estonia
dc.date.accessioned2025-02-19T08:42:56Z
dc.date.available2025-02-19T08:42:56Z
dc.date.issued2025-03
dc.description.abstractWe introduce a dataset of high-quality human-authored summaries of news articles in Norwegian. The dataset is intended for benchmarking of the abstractive summarisation capabilities of generative language models. Each document in the dataset is provided with three different candidate gold-standard summaries written by native Norwegian speakers and all summaries are provided in both of the written variants of Norwegian – Bokmål and Nynorsk. The paper describes details on the data creation effort as well as an evaluation of existing open LLMs for Norwegian on the dataset. We also provide insights from a manual human evaluation, comparing human-authored to model generated summaries. Our results indicate that the dataset provides a challenging LLM benchmark for Norwegian summarisation capabilities.
dc.identifier.urihttps://hdl.handle.net/10062/107266
dc.language.isoen
dc.publisherUniversity of Tartu Library
dc.relation.ispartofseriesNEALT Proceedings Series, No. 57
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleBenchmarking Abstractive Summarisation: A Dataset of Human-authored Summaries of Norwegian News Articles
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
2025_nodalida_1_73.pdf
Suurus:
338.96 KB
Formaat:
Adobe Portable Document Format