EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models Paper • 2504.04155 • Published Apr 5 • 1
Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources Paper • 2504.04152 • Published Apr 5 • 1
Massively Multilingual Adaptation of Large Language Models Using Bilingual Translation Data Paper • 2506.00469 • Published May 31 • 4
A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives Paper • 2407.15489 • Published Jul 22, 2024
Scaling Low-Resource MT via Synthetic Data Generation with LLMs Paper • 2505.14423 • Published May 20 • 2