Andmebaasi logo
Valdkonnad ja kollektsioonid
Kogu ADA
Eesti
English
Deutsch
  1. Esileht
  2. Sirvi autori järgi

Sirvi Autor "Solberg, Per Erik" järgi

Tulemuste filtreerimiseks trükkige paar esimest tähte
Nüüd näidatakse 1 - 3 3
  • Tulemused lehekülje kohta
  • Sorteerimisvalikud
  • Laen...
    Pisipilt
    listelement.badge.dso-type Kirje ,
    Adding Metadata to Existing Parliamentary Speech Corpus
    (University of Tartu Library, 2025-03) Parsons, Phoebe; Solberg, Per Erik; Kvale, Knut; Svendsen, Torbjørn; Salvi, Giampiero; Johansson, Richard; Stymne, Sara
    Parliamentary proceedings are convenient data sources for creating corpora for speech technology. Given its public nature, there is an abundance of extra information about the speakers that can be legally and ethically harvested to enrich this kind of corpora. This paper describes the methods we have used to add speaker metadata to the Stortinget Speech Corpus (SSC) containing over 5,000 hours of Norwegian speech with non-verbatim transcripts but without speaker metadata. The additional metadata for each speech segment includes speaker ID, gender, date of birth, municipality of birth, and counties represented. We also infer speaker dialect from their municipality of birth using a manually designed mapping between municipalities and Norwegian dialects. We provide observations on the SSC data and give suggestions for how it may be used for tasks other than speech recognition. Finally, we demonstrate the utility of this new metadata through a dialect identification task. The described methods can be adapted to add metadata information to parliamentary corpora in other languages.
  • Laen...
    Pisipilt
    listelement.badge.dso-type Kirje ,
    Building Gold-Standard Treebanks for Norwegian
    (Oslo, Norway, Linköping University Electronic Press, Sweden, pp. 459--464, 2013) Solberg, Per Erik; Oepen, Stephan; Hagen, Kristin; Johannessen, Janne Bondi
  • Laen...
    Pisipilt
    listelement.badge.dso-type Kirje ,
    Improving Generalization of Norwegian ASR with Limited Linguistic Resources
    (University of Tartu Library, 2023-05) Solberg, Per Erik; Ortiz, Pablo; Parsons, Phoebe; Svendsen, Torbjørn; Salvi, Giampiero

DSpace tarkvara autoriõigus © 2002-2025 LYRASIS

  • Teavituste seaded
  • Saada tagasisidet