Running on A10G Featured 215 faster-qwen3-tts 🎙 215 Generate natural speech from text and voice samples
Running on T4 Agents Featured 467 Parakeet-TDT-0.6b-V2 467 Transcribe audio files with timestamps and download transcripts
Running on CPU Upgrade Featured 3.11k The Smol Training Playbook 📚 3.11k The secrets to building world-class LLMs
Running Featured 88 Parakeet STT Progressive Transcription 🎤 88 Transcribe speech to text instantly with WebGPU acceleration
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 6.6M • • 2.95k
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations Paper • 2108.01073 • Published Aug 2, 2021 • 9
Running on Zero Agents Featured 131 Qwen3-ASR Demo 🎙 131 Transcribe audio to text with multi-language timestamps
Running on CPU Upgrade Agents 1.46k Omni Image Editor 🖼 1.46k Image edit, text to image, image upscale, remove watermark
Running on Zero Agents Featured 1.88k Qwen3-TTS Demo 🎙 1.88k Generate speech audio from text with custom or cloned voices