Running Featured 126 Voxtral Realtime WebGPU 💬 126 Real-time speech transcription, entirely in your browser.
Running on Zero Agents Featured 1.93k Qwen3-TTS Demo 🎙 1.93k Generate custom speech from text, voice descriptions, or samples
Running Agents 24 Audio To MIDI And Advanced Renderer 🎹 24 Audio to MIDI Transcription and Advanced render
MIT/ast-finetuned-audioset-10-10-0.4593 Audio Classification • 86.6M • Updated Sep 6, 2023 • 390k • 356
Running on Zero Agents Featured 2.51k Qwen Image Multiple Angles 3D Camera 🎥 2.51k Transform image viewpoint with adjustable camera angles