An Icelandic Linguistic Benchmark for Large Language Models
dc.contributor.author | Ármannsson, Bjarki | |
dc.contributor.author | Ingimundarson, Finnur Ágúst | |
dc.contributor.author | Sigurðsson, Einar Freyr | |
dc.contributor.editor | Johansson, Richard | |
dc.contributor.editor | Stymne, Sara | |
dc.coverage.spatial | Tallinn, Estonia | |
dc.date.accessioned | 2025-02-17T13:50:35Z | |
dc.date.available | 2025-02-17T13:50:35Z | |
dc.date.issued | 2025-03 | |
dc.description.abstract | This paper introduces a linguistic benchmark for Icelandic-language LLMs, the first of its kind manually constructed by native speakers. We report on the scores obtained by current state-of-the-art models, which indicate room for improvement, and discuss the theoretical problems involved in creating such a benchmark and scoring a model's performance. | |
dc.identifier.uri | https://hdl.handle.net/10062/107196 | |
dc.language.iso | en | |
dc.publisher | University of Tartu Library | |
dc.relation.ispartofseries | NEALT Proceedings Series, No. 57 | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.title | An Icelandic Linguistic Benchmark for Large Language Models | |
dc.type | Article |
Failid
Originaal pakett
1 - 1 1