Augmenting public sector data-driven decision support systems with expert knowledge: case of OTT
dc.contributor.advisor | Solvak, Mihkel, juhendaja | |
dc.contributor.advisor | Võrk, Andres, juhendaja | |
dc.contributor.author | Leets, Peeter | |
dc.contributor.other | Tartu Ülikool. Sotsiaalteaduste valdkond | et |
dc.contributor.other | Tartu Ülikool. Johan Skytte poliitikauuringute instituut | et |
dc.date.accessioned | 2022-06-29T11:19:32Z | |
dc.date.available | 2022-06-29T11:19:32Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Public sector data-driven decision support systems are uniquely challenging to design due to the ramifications they have on the societal level. Accountability and ethical considerations require these systems to arrive at an equilibirium between accuracy and interpretability amid various implementation and data constraints. While these systems need to contribute to legitimate governance through reasoned and explainable decision-making, they also need to accurately model the policy outcomes they were designed to support. Inopportunely, inductive data-driven systems struggle to solve problems that rely on heuristic input. In this thesis, a particular knowledge engineering technique was adopted to augment a public sector Machine Learning decision support tool with domain expert knowledge. The case in question is OTT – a job-seeker profiling tool used by the Estonian Unemployment Insurance Fund to predict the long-term unemployment risks of their clients. Upon augmenting it with knowledge from caseworkers and data scientists associated with the project, some evidence was found that accounting for expert knowledge in probabilistic data-driven models can lead to a model that performs better on new out-of-sample data and is more in line with underlying domain rules. This yields important implications on the future of Machine Learning in the public sector as it opens up new potential use cases in avenues where 1) labelled training data is hard to come by, 2) a more generalizable model is preferred due to frequent changes in the surrounding context, 3) a model has to perfectly mimic domain logic for interpretability and explainability reasons. | en |
dc.description.uri | https://www.ester.ee/record=b5508371*est | |
dc.identifier.uri | http://hdl.handle.net/10062/82883 | |
dc.language.iso | eng | et |
dc.publisher | Tartu Ülikool | et |
dc.rights | openAccess | et |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject.other | magistritööd | et |
dc.subject.other | avalik sektor | et |
dc.subject.other | tehisõpe | et |
dc.subject.other | tööotsimine | et |
dc.subject.other | magistritööd | et |
dc.title | Augmenting public sector data-driven decision support systems with expert knowledge: case of OTT | en |
dc.type | Thesis | et |