CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation Paper • 2505.24456 • Published May 30, 2025
AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text Paper • 2503.18247 • Published Mar 24, 2025
Afri-MCQA: Multimodal Cultural Question Answering for African Languages Paper • 2601.05699 • Published Jan 9 • 2
Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches Paper • 2508.21512 • Published Aug 29, 2025
Ethio-ASR: Joint Multilingual Speech Recognition and Language Identification for Ethiopian Languages Paper • 2603.23654 • Published 20 days ago
AfrIFact: Cultural Information Retrieval, Evidence Extraction and Fact Checking for African Languages Paper • 2604.00706 • Published 12 days ago
LLM2Vec-Gen: Generative Embeddings from Large Language Models Paper • 2603.10913 • Published Mar 11 • 44
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't) Paper • 2602.14696 • Published Feb 16
A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn't) Paper • 2602.14696 • Published Feb 16
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL Paper • 2602.03773 • Published Feb 3 • 13
BhashaKritika: Building Synthetic Pretraining Data at Scale for Indic Languages Paper • 2511.10338 • Published Nov 13, 2025
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published Dec 23, 2025 • 18