WavAlign: Enhancing Intelligence and Expressiveness in Spoken Dialogue Models via Adaptive Hybrid Post-Training Paper • 2604.14932 • Published 9 days ago • 10
Running on Zero Agents Featured 1.89k Qwen3-TTS Demo 🎙 1.89k Generate speech audio from text with custom or cloned voices
ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting Paper • 2504.20630 • Published Apr 29, 2025 • 9
ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting Paper • 2504.20630 • Published Apr 29, 2025 • 9
Versatile Framework for Song Generation with Prompt-based Control Paper • 2504.19062 • Published Apr 27, 2025 • 6