Lihtsate eesti keele lausete grammatika tuletamine korpusest geneetilise algoritmiga

dc.contributor.advisorKoit, Mareet
dc.contributor.authorRaudvere, Ukuet
dc.contributor.otherTartu Ülikool. Matemaatika-informaatikateaduskondet
dc.contributor.otherTartu Ülikool. Arvutiteaduse instituutet
dc.date.accessioned2013-09-09T08:45:05Z
dc.date.available2013-09-09T08:45:05Z
dc.date.issued2013et
dc.description.abstractTöö eesmärk on analüüsida geneetilisi algoritme kui tööriista grammatikate au- tomaatseks tuvastamiseks ning konkreetsemalt leida lihtsate eestikeelsete lausete grammatika. Grammatikate tuvastamine näitelausete baasil on huvitav loomuli- ke keelte analüüsi vahend juhul kui grammatika käsitsi koostamine ei ole prak- tiline, näiteks meditsiiniraportite kui valdkonnaspetsiifilise allkeele grammatika automaattuvastamine artiklis [Kat11]. Lisaks sellele on grammatikate tuletamine leidnud rakendusi andmete kadudeta pakkimise valdkonnas, näiteks [NMWM94].et
dc.description.abstractThis work concentrated on generating context-free grammars with genetic algo- rithms. Its main purpose was to generate a grammar for simple Estonian sentences. We gave an overview of some previous works in the field. These works discussed ways to represent grammars for genetic algorithms and potential problems, like staying in a local maximum or generating too liberal grammars. We also gave a brief overview of genetic algorithms and of context-free gram- mars. We evaluated our version of genetic algorithms by inducing a known grammar of three-variable algebraic expressions. We concluded that our approach is promising, but the implementation is problematic. Our experiment of inducing a grammar from a morphologically tagged corpus was not successful. We concluded that the results might be improved by using a more detailed evaluation function.et
dc.identifier.urihttp://hdl.handle.net/10062/32951
dc.language.isoetet
dc.publisherTartu Ülikoolet
dc.subject.otherbakalaureusetöödet
dc.subject.otherinformaatikaet
dc.subject.otherinfotehnoloogiaet
dc.subject.otherinformaticsen
dc.subject.otherinfotechnologyen
dc.titleLihtsate eesti keele lausete grammatika tuletamine korpusest geneetilise algoritmigaet
dc.title.alternativeInducing the grammar for simple Estonian languages using genetic algorithmset
dc.typeThesiset

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis.pdf
Size:
104.64 KB
Format:
Adobe Portable Document Format