An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex
-
MOSS Audio 8B Thinking
π’26Generate answers to audio or video prompts
-
OpenMOSS-Team/MOSS-Audio-4B-Instruct
Audio-Text-to-Text β’ 5B β’ Updated β’ 3.76k β’ 72 -
OpenMOSS-Team/MOSS-Audio-4B-Thinking
Audio-Text-to-Text β’ 5B β’ Updated β’ 1.69k β’ 31 -
OpenMOSS-Team/MOSS-Audio-8B-Instruct
Audio-Text-to-Text β’ 9B β’ Updated β’ 1.34k β’ 42