English-Estonian Machine Translation: Evaluation Across Different Models and Architectures

Islam, Md Rezwanul

English-Estonian Machine Translation: Evaluation Across Different Models and Architectures

dc.contributor.advisor	Anbarjafari, Gholamreza
dc.contributor.advisor	Sait Arslan, Hasan
dc.contributor.author	Islam, Md Rezwanul
dc.contributor.other	Tartu Ülikool. Loodus- ja täppisteaduste valdkond	et
dc.contributor.other	Tartu Ülikool. Tehnoloogiainstituut	et
dc.date.accessioned	2021-05-31T08:42:19Z
dc.date.available	2021-05-31T08:42:19Z
dc.date.issued	2020
dc.description.abstract	This thesis is based on three main objectives: at first, the implementation of RNMT+ archi-tecture with Relational-RNN model. This is an interaction between this architecture and the RNN model. Secondly, train three different translation models based on RNMT+, Trans-former, and sequence to sequence architectures. Previously, we have witnessed the perfor-mance comparison among RNMT+ with LSTM, Transformer, seq2seq, etc. Finally, evalu-ate the translation model based on training data. When implementing RNMT+, the core idea was to use a newer type of Recurrent Neural Network (RNN) instead of a widely used LSTM or GRU. Besides this, we evaluate the RNMT+ model with other models based on state-of-the-art Transformer and Sequence to Sequence with attention architectures. This evaluation (BLEU) shows that neural machine translation is domain-dependent, and translation based on the Transformer model performs better than the other two in OpenSubtitle v2018 domain while RNMT+ model performs better compared to other two in a cross-domain evaluation. Additionally, we compare all the above-mentioned architectures based on their correspond-ing encoder-decoder layers, attention mechanism and other available neural machine translation and statistical machine translation architectures. In estonian: See lõputöö põhineb kolmel põhieesmärgil: alguses RNMT + arhitektuuri rakendamine Relatsioon-RNN-mudeli abil. See on interaktsioon selle arhitektuuri ja RNN-mudeli vahel. Teiseks, koolitage kolme erinevat tõlkemudelit, mis põhinevad RNMT +, Trafo ja järjestusearhitektuuridel. Varem oleme olnud tunnistajaks RNMT + jõudluse võrdlusele LSTM, Transformeri, seq2seq jne abil. Lõpuks hinnake tõlkemudelit koolitusandmete põhjal. RNMT + rakendamisel oli peamine idee kasutada laialdaselt kasutatava LSTM või GRU asemel uuemat tüüpi korduvat närvivõrku (RNN). Lisaks hindame RNMT + mudelit koos teiste mudelitega, mis põhinevad tipptehnoloogial Transformer ja Sequence to Sequence koos tähelepanu arhitektuuridega. See hinnang (BLEU) näitab, et neuraalne masintõlge on domeenist sõltuv ja muunduril Transformer põhinev tõlge toimib paremini kui ülejäänud kaks OpenSubtitle v2018 domeenis, samal ajal kui RNMT + mudel toimib paremini kui ülejäänud kaks domeenidevahelist hindamist. Lisaks võrdleme kõiki ülalnimetatud arhitektuure nende vastavate kodeerija-dekoodri kihtide, tähelepanu mehhanismi ja muude saadaolevate närvi masintõlke ning statistiliste masintõlke arhitektuuride põhjal.	en
dc.identifier.uri	http://hdl.handle.net/10062/72121
dc.language.iso	eng	et
dc.publisher	Tartu Ülikool	et
dc.rights	openAccess	et
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International	*
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	*
dc.subject	Neural Machine Translation	en
dc.subject	Natural Language Processing	en
dc.subject	LSTM	en
dc.subject	Relational-RNN	en
dc.subject	RNMT+	en
dc.subject	Transformer	en
dc.subject	Sequence-to-Sequence	en
dc.subject	Py-Torch	en
dc.subject	Encoder-Decoder	en
dc.subject	Attention	en
dc.subject	Evaluation	en
dc.subject	Neuraalne masintõlge	et
dc.subject	loomuliku keele töötlemine	et
dc.subject	relatsiooniline-RNN	et
dc.subject	trafo	et
dc.subject	jada-järjestus	et
dc.subject	Py-taskulamp	et
dc.subject	kooder-dekooder	et
dc.subject	tähelepanu	et
dc.subject	hindamine	et
dc.subject.other	magistritööd	et
dc.title	English-Estonian Machine Translation: Evaluation Across Different Models and Architectures	en
dc.title.alternative	Inglise-eesti masintõlge: hindamine erinevate mudelite ja arhitektuuri	et
dc.type	info:eu-repo/semantics/masterThesis	et

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Rezwanul_Islam_MSc2020.pdf
Size:: 1.03 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.67 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Robotics and Computer Engineering - Master's theses