Continuous learning for multilingual neural machine translation
| dc.contributor.advisor | Tättar, Andre, juhendaja | |
| dc.contributor.advisor | Fišel, Mark, juhendaja | |
| dc.contributor.author | Kolesnykov, Dmytro | |
| dc.contributor.other | Tartu Ülikool. Loodus- ja täppisteaduste valdkond | et |
| dc.contributor.other | Tartu Ülikool. Arvutiteaduse instituut | et |
| dc.date.accessioned | 2023-09-08T07:33:27Z | |
| dc.date.available | 2023-09-08T07:33:27Z | |
| dc.date.issued | 2020 | |
| dc.description.abstract | With the growing amount of text data, there is also a growing demand for automatic translation systems. The majority of big companies are trying to develop their own translation engines to compete in this field. Especially, there is a need for universal multilingual models that ideally are capable of translating between any languages. This work aims to establish a decent multilingual translation system that continues learning from the monolingual inputs of in-domain data. Thus, to improve the multilingual NMT translation system’s performance and transfer knowledge to unseen language pairs without any additional models or parallel data sources. We describe our adaptation of back-translation, a practical approach for data-augmentation, to continuous learning. The results are reported for English, Russian and Estonian languages using only publicly available data. | et |
| dc.identifier.uri | https://hdl.handle.net/10062/92015 | |
| dc.language.iso | eng | et |
| dc.publisher | Tartu Ülikool | et |
| dc.rights | openAccess | et |
| dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
| dc.subject | natural language processing | et |
| dc.subject | neural machine translation | et |
| dc.subject | transfer-learning | et |
| dc.subject | back-translation | et |
| dc.subject.other | magistritööd | et |
| dc.subject.other | informaatika | et |
| dc.subject.other | infotehnoloogia | et |
| dc.subject.other | informatics | et |
| dc.subject.other | infotechnology | et |
| dc.title | Continuous learning for multilingual neural machine translation | et |
| dc.type | Thesis | et |