Inference Providers
Active filters: modelopt
nvidia/Qwen3.6-35B-A3B-NVFP4
Text Generation
• 19B • Updated • 67k
• 33
Text Generation
• 382B • Updated • 10.4k
• 24
stepfun-ai/Step-3.7-Flash-NVFP4
Image-Text-to-Text
• 104B • Updated • 15.3k
• 20
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
Text Generation
• 67B • Updated • 1.39M
• 321
sakamakismile/Qwen3.6-27B-Text-NVFP4-MTP
Text Generation
• 17B • Updated • 812k
• 64
nvidia/Gemma-4-26B-A4B-NVFP4
Text Generation
• 14B • Updated • 1.23M
• 66
llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4-GGUF
Image-Text-to-Text
• 27B • Updated • 26.2k
• 19
nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4
Any-to-Any
• 18B • Updated • 1.22M
• 121
nvidia/MiniMax-M2.7-NVFP4
Text Generation
• 116B • Updated • 329k
• 50
sakamakismile/Huihui-Qwen3.6-27B-abliterated-NVFP4-MTP
Image-Text-to-Text
• 17B • Updated • 143k
• 51
nilayparikh/Qwen3.6-27B-Text-NVFP4-MTP-GGUF
Text Generation
• Updated • 3.15k
• 5
AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-Multimodal-NVFP4-MTP-XS
Text Generation
• 17B • Updated • 78.3k
• 37
Text Generation
• Updated • 693k
• 27
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8
Text Generation
• 124B • Updated • 430k
• 251
CISCai/gemma-4-31B-it-NVFP4-turbo-GGUF
Text Generation
• 31B • Updated • 14.6k
• 27
llmfan46/Qwen3.6-35B-A3B-uncensored-heretic-Native-MTP-Preserved-NVFP4-Experts-Only
Image-Text-to-Text
• 35B • Updated • 16.8k
• 6
natfii/Qwen3.6-27B-VLM-NVFP4-MTP
Image-Text-to-Text
• 17B • Updated • 4.33k
• 2
crushleorey/Qwopus3.6-27B-v2-NVFP4
Image-Text-to-Text
• 15B • Updated • 630
• 2
llmfan46/Qwen3.5-35B-A3B-uncensored-heretic-v2-Native-MTP-Preserved-NVFP4
Image-Text-to-Text
• 35B • Updated • 301
• 2
sakamakismile/LFM2.5-8B-A1B-NVFP4
Text Generation
• 5B • Updated • 77
• 2
nvidia/Llama-4-Maverick-17B-128E-Instruct-FP8
402B • Updated • 603
• 15
nvidia/Llama-4-Scout-17B-16E-Instruct-FP8
109B • Updated • 333k
• 16
nvidia/Llama-4-Maverick-17B-128E-Eagle3
2B • Updated • 4
• 11
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
• 16B • Updated • 49.4k
• 31
nvidia/Phi-4-reasoning-plus-FP8
15B • Updated • 120
• 7
Text Generation
• 8B • Updated • 8.77k
• 6
Text Generation
• 8B • Updated • 53.5k
• 12
Text Generation
• 15B • Updated • 2.24k
• 6
Text Generation
• 17B • Updated • 52.6k
• 17
nvidia/Qwen2.5-VL-7B-Instruct-NVFP4
Text Generation
• 5B • Updated • 229k
• 16