microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 375k • 1.6k
ibm-granite/granite-vision-3.2-2b Image-Text-to-Text • 3B • Updated 26 days ago • 3.78k • 123
nomic-ai/colnomic-embed-multimodal-7b Visual Document Retrieval • Updated Apr 15, 2025 • 5.49k • 105
Running Agents 206 Vidore Leaderboard 🥇 206 Browse and compare visual document retrieval model scores
nomic-ai/nomic-embed-multimodal-3b Visual Document Retrieval • Updated Apr 15, 2025 • 2.1k • 29
moondream/moondream-2b-2025-04-14-4bit Image-Text-to-Text • 1B • Updated May 22, 2025 • 18.3k • 68
Alibaba-NLP/gme-Qwen2-VL-2B-Instruct Sentence Similarity • 2B • Updated Jun 9, 2025 • 13.3k • 133