Text Datasets for Evaluation Collection Collection of datasets in Galician for LLM evaluation. It includes translations from already existing datasets as well as datasets created by us. • 18 items • Updated 3 days ago
CorpusNÓS: A massive Galician corpus for training LLM Collection CorpusNÓS is the largest collection of data in Galician language for training LLM. • 1 item • Updated 8 days ago
Domain Specific Corpora Collection Collection of corpora prepared from specific domains mainly in Galician language. • 4 items • Updated 8 days ago