UAM-CSI at MultiGEC-2025: Parameter-efficient LLM Fine-tuning for Multilingual Grammatical Error Correction

dc.contributor.authorStaruch, Ryszard
dc.contributor.editorMuñoz Sánchez, Ricardo
dc.contributor.editorAlfter, David
dc.contributor.editorVolodina, Elena
dc.contributor.editorKallas, Jelena
dc.coverage.spatialTallinn, Estonia
dc.date.accessioned2025-02-17T10:37:13Z
dc.date.available2025-02-17T10:37:13Z
dc.date.issued2025-03
dc.description.abstractThis paper describes the solution of the UAMCSI team to the shared task on Multilingual Grammatical Error Correction (MultiGEC-2025), which is part of the workshop on Natural Language Processing for Computer-Assisted Language Learning (NLP4CALL). The shared task covers 12 languages: Czech, English, Estonian, German, Greek, Icelandic, Italian, Latvian, Russian, Slovene, Swedish and Ukrainian. The aim of the task is to correct errors in the provided texts. Our system is a google/gemma-2-9b-it model with 2 QLoRA adapters, one for the minimal-edit track and another for the fluency-edit track. Our solution achieves the best performance on the test sets on GLEU and F0.5 metrics for all languages and the best performance on the Scribendi Score metric except for the Greek language in the minimal-edit track.
dc.identifier.urihttps://hdl.handle.net/10062/107168
dc.language.isoen
dc.publisherUniversity of Tartu Library
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titleUAM-CSI at MultiGEC-2025: Parameter-efficient LLM Fine-tuning for Multilingual Grammatical Error Correction
dc.typeArticle

Failid

Originaal pakett

Nüüd näidatakse 1 - 1 1
Laen...
Pisipilt
Nimi:
2025_nlp4call_1_3.pdf
Suurus:
213.23 KB
Formaat:
Adobe Portable Document Format