Freedom House'i demokraatia indeksi ühtsuse analüüs BERT keelemudeliga

dc.contributor.advisorVits, Kristel, juhendaja
dc.contributor.advisorKangur, Uku, juhendaja
dc.contributor.authorKostabi, Karl Hans
dc.contributor.otherTartu Ülikool. Sotsiaalteaduste valdkondet
dc.contributor.otherTartu Ülikool. Johan Skytte poliitikauuringute instituutet
dc.date.accessioned2024-07-04T08:56:15Z
dc.date.available2024-07-04T08:56:15Z
dc.date.issued2024
dc.description.abstractThis research examines Freedom House’s democracy index, Freedom in the World (FITW). Academic literature has shown, that there are a lot of problems with academic literature. They have found to be biased and to have poor aggregation rules. The study investigates FITW using a machine learning-trained BERT language model, exploring how machine learning has been utilized in social sciences and its potential for further applications. BERT is used to create embeddings, that a vectorial representation of texts. Embeddings allow for a qualitative examination of the any texts. Since FITW index uses descriptive texts, these descriptive texts can be examined. Hence, this research examined, how FITW descriptions are tied to their scores. A hypothesis was created, that descriptive texts, that are most similar to one another, are also the ones with the highest scores. The empirical findings demonstrate, that texts and score are indeed linked. This was done through creating a score difference index for all questions. Next, the score difference between the top 1% most similar texts was done. Through this it was shown that the score and texts are most strongly linked, when the score are high, and linked the weakest, when the scores are low. T-SNE method was used to show embeddings in 2-d projections. Through this the score were also proven to be linked visually. Visualizations were also used to show, that texts from the same regions of the world cluster together. Further geographical analysis revealed, that countries generally cluster according to their geographical locations, especially in the Americas and Europe. Overall, the study concludes that FITW descriptions strongly correlate with high scores but weaken as scores decrease. Geographically, FITW descriptions are most consistent in the Americas and Europe, while the index is less uniform in Africa.en
dc.identifier.urihttps://hdl.handle.net/10062/101303
dc.language.isoet
dc.publisherTartu Ülikoolet
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Estoniaen
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/ee/
dc.subject.otherbakalaureusetöödet
dc.titleFreedom House'i demokraatia indeksi ühtsuse analüüs BERT keelemudeligaet
dc.typeThesisen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
kostabi_karl_ba_2024.pdf
Size:
1.62 MB
Format:
Adobe Portable Document Format