Eesti keele digitaalsete ressursside ja tehnoloogiate rakendamine teksti lihtsustamise programmis
Files
Date
2017
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Käesoleva bakalaureusetöö eesmärk oli uurida teksti lihtsustamise meetodeid ning luua veebipõhine rakendus, mis lihtsustaks eestikeelset teksti. Rakenduse loomiseks kasutati keeleressursse, nagu Eesti Wordnet, word2vec’i mudel, sagedusloend, võõrsõnade leksikon ja põhisõnavara sõnastik ning nendega leitakse sõnade keerukus ning sobivus teksti.
The purpose of this Bachelor’s thesis was to research text simplification methods and to create a web-based application to simplify Estonian texts. The web application uses language resources such as the Estonian Wordnet, word2vec model, frequency dictionary, foreign word dictionary and basic vocabulary dictionary, which are used to identify word complexity and suitability to the text.
The purpose of this Bachelor’s thesis was to research text simplification methods and to create a web-based application to simplify Estonian texts. The web application uses language resources such as the Estonian Wordnet, word2vec model, frequency dictionary, foreign word dictionary and basic vocabulary dictionary, which are used to identify word complexity and suitability to the text.