Sirvi Autor "Charpentier, Lucas Georges Gabriel" järgi
Nüüd näidatakse 1 - 2 2
- Tulemused lehekülje kohta
- Sorteerimisvalikud
Kirje BRENT: Bidirectional Retrieval Enhanced Norwegian Transformer(University of Tartu Library, 2023-05) Charpentier, Lucas Georges Gabriel; Wold, Sondre; Samuel, David; Rønningstad, EgilKirje Small Languages, Big Models: A Study of Continual Training on Languages of Norway(University of Tartu Library, 2025-03) Samuel, David; Mikhailov, Vladislav; Velldal, Erik; Øvrelid, Lilja; Charpentier, Lucas Georges Gabriel; Kutuzov, Andrey; Oepen, Stephan; Johansson, Richard; Stymne, SaraTraining large language models requires vast amounts of data, posing a challenge for less widely spoken languages like Norwegian and even more so for truly low-resource languages like Northern Sámi. To address this issue, we present a novel three-stage continual training approach that substantially improves the downstream performance together with the inference efficiency for the target languages. Based on our findings, we train, evaluate, and openly release a new generative language model for Norwegian Bokmål, Nynorsk, and Northern Sámi with 11.4 billion parameters: NorMistral-11B.