arxiv:2511.07253
Umberto Cappellazzo
hisoka94
AI & ML interests
Multimodal Large Language Models and audio-visual speech processing at @ Imperial College London.
Recent Activity
authored
a paper
26 days ago
Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large
Language Models
upvoted
a
paper
26 days ago
Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large
Language Models
Organizations
None yet