Show simple item record

dc.contributor.advisorAnbarjafari, Gholamreza, supervisor
dc.contributor.authorHook, Joosep
dc.contributor.otherTartu Ülikool. Loodus- ja täppisteaduste valdkondet
dc.contributor.otherTartu Ülikool. Tehnoloogiainstituutet
dc.date.accessioned2018-06-04T08:39:14Z
dc.date.available2018-06-04T08:39:14Z
dc.date.issued2018
dc.identifier.urihttp://hdl.handle.net/10062/60295
dc.description.abstractThe main objectives of affective computing is the study and creation of computer systems which can detect human affects. For speech-based emotion recognition, universal features offering the best performance for all languages have not yet been found. In this thesis, a speech-based emotion recognition system using a novel set of features is created. Support vector machines are used as classifiers in the offline system on Surrey Audio-Visual Expressed Emotion database, Berlin Database of Emotional Speech, Polish Emotional Speech database and Serbian emotional speech database. Average emotion recognition rates of 80.21%, 88.6%, 75.42% and 93.41% are achieved, respectively, with a total number of 87 features. The online system, which uses Random Forests as it’s classifier, consists of two models trained on reduced versions of the first and second database, with the first model trained on only male samples and the second trained on both. The main purpose of the online system was to test the features’ usability in real-life scenarios and to explore the effects of gender in speech-based emotion recognition. To test the online system, two female and two male non-native English speakers recorded emotionally spoken sentences and used these as inputs to the trained model. Averaging over all emotions and speakers per model, it is seen that the features offer better performance than random guessing, achieving 28% emotion recognition in both models. The average recognition rate for female speakers was 19% in the first and 29% in the second model. For male speakers, the rates were 36% and 28%, respectively. These results show how having more samples for training for a particular gender affects emotion recognition rates in a trained model.en
dc.language.isoenget
dc.publisherTartu Ülikoolet
dc.rightsopenAccesset
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 Estonia*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/ee/*
dc.subjectkõnepõhine emotsioonituvastuset
dc.subjectsupport vector machinesen
dc.subjectspeech emotion recognitionen
dc.subjectrandom forestsen
dc.subjecttugivektorklassifitseerijaet
dc.subjectotsustusmetset
dc.subject.otherbakalaureusetöödet
dc.subject.othermasinõpeet
dc.subject.othermachine learningen
dc.titleAutomatic Speech-based Emotion Recognitionen
dc.title.alternativeautomaatne kõnepõhine emotsioonituvastuset
dc.typeThesisen


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record

openAccess
Except where otherwise noted, this item's license is described as openAccess