Navigating Swedish Salafism Large language model-augmented content detection and topic modeling using BERTopic with YouTube metadata
| dc.contributor.author | Svensson, Jonas | |
| dc.contributor.editor | Bouma, Gerlof | |
| dc.contributor.editor | Dannélls, Dana | |
| dc.contributor.editor | Kokkinakis, Dimitrios | |
| dc.contributor.editor | Volodina, Elena | |
| dc.date.accessioned | 2025-11-10T11:17:41Z | |
| dc.date.available | 2025-11-10T11:17:41Z | |
| dc.date.issued | 2025-11 | |
| dc.description.abstract | The chapter suggests and provides an example of a Large Language Model (LLM)-augmented method for gaining a quick overview of large sets of YouTube videos using metadata collected through the YouTube API. The case chosen is the Swedish Salafist YouTube channel islam.nu that houses 1 680 videos. An LLM (GPT-4o mini) is given a prompt to guess the content of videos based on information given in their titles and descriptions. These guesses are then used in an LLM-augmented topic modeling process utilizing the Python library BERTopic and the HUMINFRA resource, the Swedish Royal Library’s sentencetransformers model “sentence-bert-swedish-cased”. The videos thus placed under topics are then again subjected to processing by an LLM, to produce easyto-read representations of the topics. This method provides a convenient way to quickly understand the content of YouTube video sets and can serve as a first step in a purposive sampling procedure. | |
| dc.identifier.isbn | 9789908536125 | |
| dc.identifier.uri | https://hdl.handle.net/10062/117346 | |
| dc.identifier.uri | https://doi.org/10.58009/aere-perennius0176 | |
| dc.language.iso | en | |
| dc.publisher | University of Tartu Library | |
| dc.relation.ispartof | Huminfra handbook: Empowering digital and experimental humanities | |
| dc.rights | Attribution 4.0 International | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.title | Navigating Swedish Salafism Large language model-augmented content detection and topic modeling using BERTopic with YouTube metadata | |
| dc.type | Article |
Failid
Originaal pakett
1 - 1 1
Laen...
- Nimi:
- Huminfra_Handbook_Chapter7.pdf
- Suurus:
- 349.94 KB
- Formaat:
- Adobe Portable Document Format