Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers β’ 70 items β’ Updated Dec 10, 2025 β’ 169
google/embeddinggemma-300m Sentence Similarity β’ 0.3B β’ Updated Sep 25, 2025 β’ 1.62M β’ β’ 1.64k
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 30 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 30 days ago β’ 2
MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model Paper β’ 2602.06393 β’ Published Feb 6 β’ 3
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 30 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 30 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 30 days ago β’ 2
MuCo Collection MuCo: Multi-turn Contrastive Learning for Multimodal Embedding Model [CVPR 2026] β’ 4 items β’ Updated 30 days ago β’ 2