How Well do LLMs know Finno-Ugric Languages? A Systematic Assessment
dc.contributor.author | Kuulmets, Hele-Andra | |
dc.contributor.author | Purason, Taido | |
dc.contributor.author | Fishel, Mark | |
dc.contributor.editor | Johansson, Richard | |
dc.contributor.editor | Stymne, Sara | |
dc.coverage.spatial | Tallinn, Estonia | |
dc.date.accessioned | 2025-02-18T09:35:07Z | |
dc.date.available | 2025-02-18T09:35:07Z | |
dc.date.issued | 2025-03 | |
dc.description.abstract | We present a systematic evaluation of multilingual capabilities of open large language models (LLMs), specifically focusing on five Finno-Ugric (FiU) languages. Our investigation covers multiple prompting strategies across several benchmarks and reveals that Llama-2 7B and Llama-2 13B perform weakly on most FiU languages. In contrast, Llama 3.1 models show impressive improvements, even for extremely low-resource languages such as Võro and Komi, indicating successful cross-lingual knowledge transfer inside the models. Finally, we show that stronger base models outperform weaker, language-adapted models, thus emphasizing the importance of base model in successful language adaptation. | |
dc.identifier.uri | https://hdl.handle.net/10062/107228 | |
dc.language.iso | en | |
dc.publisher | University of Tartu Library | |
dc.relation.ispartofseries | NEALT Proceedings Series, No. 57 | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.title | How Well do LLMs know Finno-Ugric Languages? A Systematic Assessment | |
dc.type | Article |
Failid
Originaal pakett
1 - 1 1