An Icelandic Linguistic Benchmark for Large Language Models

dc.contributor.authorÁrmannsson, Bjarki
dc.contributor.authorIngimundarson, Finnur Ágúst
dc.contributor.authorSigurðsson, Einar Freyr
dc.contributor.editorJohansson, Richard
dc.contributor.editorStymne, Sara
dc.coverage.spatialTallinn, Estonia
dc.date.accessioned2025-02-17T13:50:35Z
dc.date.available2025-02-17T13:50:35Z
dc.date.issued2025-03
dc.description.abstractThis paper introduces a linguistic benchmark for Icelandic-language LLMs, the first of its kind manually constructed by native speakers. We report on the scores obtained by current state-of-the-art models, which indicate room for improvement, and discuss the theoretical problems involved in creating such a benchmark and scoring a model's performance.
dc.identifier.urihttps://hdl.handle.net/10062/107196
dc.language.isoen
dc.publisherUniversity of Tartu Library
dc.relation.ispartofseriesNEALT Proceedings Series, No. 57
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.titleAn Icelandic Linguistic Benchmark for Large Language Models
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
2025_nodalida_1_5.pdf
Suurus:
128.02 KB
Formaat:
Adobe Portable Document Format