microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 4 days ago • 364k • 1.55k
microsoft/Phi-4-multimodal-instruct-onnx Automatic Speech Recognition • Updated 14 days ago • 138 • 83
huihui-ai/Phi-4-multimodal-instruct-abliterated Automatic Speech Recognition • 6B • Updated Mar 3 • 125 • 26
r-g2-2024/Llama-3.1-70B-Instruct-multimodal-JP-Graph-v0.1 Visual Question Answering • 71B • Updated Jul 30 • 621 • 18
google/pix2struct-widget-captioning-large Visual Question Answering • 1B • Updated Apr 10, 2024 • 65 • 20