MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence Paper • 2508.13992 • Published Aug 19, 2025 • 7
FLiP: Towards understanding and interpreting multimodal multilingual sentence embeddings Paper • 2604.18109 • Published 4 days ago
Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs Paper • 2506.08633 • Published Jun 10, 2025
MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence Paper • 2508.13992 • Published Aug 19, 2025 • 7
DeCRED: Decoder-Centric Regularization for Encoder-Decoder Based Speech Recognition Paper • 2508.08938 • Published Aug 12, 2025 • 11