Modular Septilingual Neural Machine Translation
dc.contributor.advisor | Tättar, Andre, juhendaja | |
dc.contributor.advisor | Korotkova, Elizaveta, juhendaja | |
dc.contributor.author | Purason, Taido | |
dc.contributor.other | Tartu Ülikool. Loodus- ja täppisteaduste valdkond | et |
dc.contributor.other | Tartu Ülikool. Arvutiteaduse instituut | et |
dc.date.accessioned | 2023-09-13T11:12:33Z | |
dc.date.available | 2023-09-13T11:12:33Z | |
dc.date.issued | 2021 | |
dc.description.abstract | Currently, the majority of state-of-the-art multilingual neural machine translation systems use a single universal model which fully shares parameters between all language pairs. The University of Tartu Neural Machine Translation system uses the universal architecture as well, and thus also suffers from the problems associated with it, such as limited capacity per language pair. Previous research has shown that a modularized approach with language-specific encoders and decoders successfully addresses many of the universal model’s shortcomings. This thesis applies the modularized architecture and improves the University of Tartu translation system. Orders of magnitude larger dataset containing 7 languages is used to train the models compared to previous work. The modularized model achieves significantly higher BLEU scores than the University of Tartu model and the baseline universal model on all language pairs. | et |
dc.identifier.uri | https://hdl.handle.net/10062/92143 | |
dc.language.iso | eng | et |
dc.publisher | Tartu Ülikool | et |
dc.rights | openAccess | et |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | machine translation | et |
dc.subject | multilingual machine translation | et |
dc.subject | neural machine translation | et |
dc.subject | neural networks | et |
dc.subject | natural language processing | et |
dc.subject.other | bakalaureusetööd | et |
dc.subject.other | informaatika | et |
dc.subject.other | infotehnoloogia | et |
dc.subject.other | informatics | et |
dc.subject.other | infotechnology | et |
dc.title | Modular Septilingual Neural Machine Translation | et |
dc.type | Thesis | et |