Andmepunktide panuse visualiseerimine binaarklassifitseerija kaos

Paal, Magnus

Andmepunktide panuse visualiseerimine binaarklassifitseerija kaos

Failid

Magnus_Paal_Ba.pdf (902.61 KB)

Kuupäev

2021

Autorid

Paal, Magnus

Kirjastaja

Tartu Ülikool

Abstrakt

The decrease of a classifier’s loss function’s output is one way to know if a classifier is improving. The output of a loss function which is also known as loss is just one value and doesn’t give a complete overview of the classifier and dataset as a whole. The aim of this thesis was to find a way how to interpret loss through datapoints and visualize it. The visualizations found can help to grasp how each datapoint contributes in the whole loss. These visualizations could be used to find out which sets of datapoints contribute the most in loss, the ones whose predicted value is farther from their actual value and which make up a smaller number of points, or those whose predicted value is closer to the actual value and which make up a bigger number of points. Secondly these visualizations could be used to compare the different losses of two classifiers and find out which datapoints are the ones that contribute most in that difference. Lastly the visualizations could be used to find out which datapoints with which features contribute the most in a loss.

Märksõnad

masinõpe, binaarne klassifitseerimine, kaofunktsioon, Brier’i skoor, ristentroopia

URI

https://hdl.handle.net/10062/92165

Kollektsioonid

MTAT bakalaureusetööd – Bachelor's theses

Kirje täielik lehekülg

Andmepunktide panuse visualiseerimine binaarklassifitseerija kaos

Failid

Kuupäev

Autorid

Ajakirja pealkiri

Ajakirja ISSN

Köite pealkiri

Kirjastaja

Abstrakt

Kirjeldus

Märksõnad

Viide

URI

Kollektsioonid