mistralai/Voxtral-Mini-4B-Realtime-2602 Automatic Speech Recognition β’ 4B β’ Updated Mar 11 β’ 869k β’ 808
Running on Zero Featured 1.85k Qwen3-TTS Demo π 1.85k Generate speech from text with custom voice, cloning, or presets
Configuration error Featured 131 Ministral WebGPU β‘ 131 Frontier multimodal AI, running entirely in your browser.
Running on CPU Upgrade 1.01k Open VLM Leaderboard π 1.01k VLMEvalKit Evaluation Results Collection
Running on Zero MCP 405 Multimodal OCR π 405 Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
docling-project/SmolDocling-256M-preview Image-Text-to-Text β’ Updated Sep 17, 2025 β’ 55.4k β’ 1.61k
Running on Zero Featured 1.76k Dia 1.6B π― 1.76k Generate realistic dialogue from a script, using Dia!