daVinci-MagiHuman
Create a short video from an image and description
A cutting-edge speech generation model with stereo support
Controllable TTS via instruction prompting (JPN / Anime)
FireRed-Image-Edit ร Qwen-Image-Edit-Rapid (Transformers)
FireRed-OCR for Document Recognition
Generate spoken audio from text with custom or cloned voices