Exploratory Swedish text analysis using notebooks – a smörgårdsbord of basic corpus linguistic insights

dc.contributor.authorKokkinakis, Dimitrios
dc.contributor.authorBouma, Gerlof
dc.contributor.editorBouma, Gerlof
dc.contributor.editorDannélls, Dana
dc.contributor.editorKokkinakis, Dimitrios
dc.contributor.editorVolodina, Elena
dc.date.accessioned2025-11-10T12:41:59Z
dc.date.available2025-11-10T12:41:59Z
dc.date.issued2025-11
dc.description.abstractThe computational notebook has established itself as a significant tool for conducting exploratory data analysis, which aims at investigating characteristics of a dataset without preformulated expectations. Computational notebooks are a type of interactive document, that supports mixing prose, executable code and its output, such as a calculated result, a table, or a graphic. Data, process, and narrative are effectively integrated into one environment, which makes notebooks ideal for documenting exploratory research. Notebooks also facilitate sharing research in a reproducible way for teaching, collaboration or dissemination. This chapter demonstrates basic exploratory techniques for Swedish text analysis implemented as Jupyter notebooks, a popular computational notebook implementation. Using a selection of documents from a Swedish corpus of COVID-19-related materials, we show some of the kinds of text analysis that can easily be performed using readily available software libraries. The examples in this chapter rely only on automatic annotation, requiring minimal manual processing.
dc.identifier.isbn9789908536125
dc.identifier.urihttps://hdl.handle.net/10062/117355
dc.identifier.urihttps://doi.org/10.58009/aere-perennius0185
dc.language.isoen
dc.publisherUniversity of Tartu Library
dc.relation.ispartofHuminfra handbook: Empowering digital and experimental humanities
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleExploratory Swedish text analysis using notebooks – a smörgårdsbord of basic corpus linguistic insights
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
Huminfra_Handbook_Chapter16.pdf
Suurus:
2.09 MB
Formaat:
Adobe Portable Document Format