Federated Meta-Learning for Low-Resource Translation of Kirundi

Sang, Kyle Rui; Rabbani, Tahseen; Zhou, Tianyi

Federated Meta-Learning for Low-Resource Translation of Kirundi

Failid

2025_resourceful_1_34.pdf (246.35 KB)

Kuupäev

2025-03

Autorid

Sang, Kyle Rui

Rabbani, Tahseen

Zhou, Tianyi

Kirjastaja

University of Tartu Library

Abstrakt

In this work, we reframe multilingual neural machine translation (NMT) as a federated meta-learning problem and introduce a translation dataset for the low-resource Kirundi language. We aggregate machine translation models () locally trained on varying (but related) source languages to produce a global meta-model that encodes abstract representations of key semantic structures relevant to the parent languages. We then use the Reptile algorithm and Optuna fine-tuning to fit the global model onto a target language. The target language may live outside the subset of parent languages (such as closely-related dialects or sibling languages), which is particularly useful for languages with limitedly available sentence pairs. We first develop a novel dataset of Kirundi-English sentence pairs curated from Biblical translation. We then demonstrate that a federated learning approach can produce a tiny 4.8M Kirundi translation model and a stronger NLLB-600M model which performs well on both our Biblical corpus and the FLORES-200 Kirundi corpus.

URI

https://hdl.handle.net/10062/107131

Kollektsioonid

Proceedings of the Third Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2025)

Kirje täielik lehekülg

Federated Meta-Learning for Low-Resource Translation of Kirundi

Failid

Kuupäev

Autorid

Ajakirja pealkiri

Ajakirja ISSN

Köite pealkiri

Kirjastaja

Abstrakt

Kirjeldus

Märksõnad

Viide

URI

Kollektsioonid