An Icelandic Linguistic Benchmark for Large Language Models

dc.contributor.author	Ármannsson, Bjarki
dc.contributor.author	Ingimundarson, Finnur Ágúst
dc.contributor.author	Sigurðsson, Einar Freyr
dc.contributor.editor	Johansson, Richard
dc.contributor.editor	Stymne, Sara
dc.coverage.spatial	Tallinn, Estonia
dc.date.accessioned	2025-02-17T13:50:35Z
dc.date.available	2025-02-17T13:50:35Z
dc.date.issued	2025-03
dc.description.abstract	This paper introduces a linguistic benchmark for Icelandic-language LLMs, the first of its kind manually constructed by native speakers. We report on the scores obtained by current state-of-the-art models, which indicate room for improvement, and discuss the theoretical problems involved in creating such a benchmark and scoring a model's performance.
dc.identifier.uri	https://hdl.handle.net/10062/107196
dc.language.iso	en
dc.publisher	University of Tartu Library
dc.relation.ispartofseries	NEALT Proceedings Series, No. 57
dc.rights	Attribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by-nc-nd/4.0/
dc.title	An Icelandic Linguistic Benchmark for Large Language Models
dc.type	Article

Failid

Nüüd näidatakse 1 - 1 1