-
-
-
-
-
-
Inference Providers
Active filters:
ModelOpt
Text Generation
•
Updated
•
36.7k
•
45
nvidia/Qwen3-Next-80B-A3B-Thinking-NVFP4
Text Generation
•
Updated
•
59.2k
•
44
nvidia/Kimi-K2-Thinking-NVFP4
Text Generation
•
Updated
•
112k
•
26
nvidia/Qwen3-Next-80B-A3B-Instruct-NVFP4
Text Generation
•
Updated
•
14.5k
•
25
nvidia/DeepSeek-R1-0528-NVFP4-v2
Text Generation
•
394B
•
Updated
•
118k
•
14
nvidia/Qwen3-235B-A22B-Thinking-2507-NVFP4
Text Generation
•
Updated
•
193
•
3
vincentzed-hf/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
1.73k
•
3
nvidia/DeepSeek-R1-0528-NVFP4
Text Generation
•
397B
•
Updated
•
15.2k
•
41
nvidia/Qwen3-235B-A22B-NVFP4
Text Generation
•
133B
•
Updated
•
5.03k
•
14
nvidia/gpt-oss-120b-Eagle3-long-context
Text Generation
•
0.2B
•
Updated
•
7.68k
•
58
nvidia/Phi-4-multimodal-instruct-NVFP4
4B
•
Updated
•
3.43k
•
7
Text Generation
•
15B
•
Updated
•
3.2k
•
3
nvidia/Llama-3.3-70B-Instruct-Eagle3
Text Generation
•
Updated
•
40
•
1
nvidia/DeepSeek-V3.2-NVFP4
Text Generation
•
394B
•
Updated
•
8.82k
•
5
nvidia/Qwen3-235B-A22B-Instruct-2507-NVFP4
Text Generation
•
120B
•
Updated
•
1.15k
•
2
nvidia/Qwen3-Coder-480B-A35B-Instruct-NVFP4
Text Generation
•
241B
•
Updated
•
209
•
1
vincentzed-hf/Kimi-K2.5-MXFP8
Image-Text-to-Text
•
1T
•
Updated
•
19
•
1
Cirrascale/Qwen3-Coder-Next-NVFP4
Text Generation
•
Updated
•
24
•
1
nvidia/DeepSeek-V3-0324-NVFP4
Text Generation
•
397B
•
Updated
•
83.7k
•
14
NVFP4/DeepSeek-Prover-V2-7B-FP4
4B
•
Updated
•
107
•
1
NVFP4/DeepSeek-R1-0528-Qwen3-8B-FP4
5B
•
Updated
•
156
•
1
Text Generation
•
19B
•
Updated
•
339
•
4
NVFP4/Polaris-4B-Preview-FP4
2B
•
Updated
•
3
NVFP4/Polaris-7B-Preview-FP4
5B
•
Updated
•
3
•
1
nvidia/Qwen3-235B-A22B-FP8
Text Generation
•
235B
•
Updated
•
2.37k
•
3
nvidia/Qwen3-30B-A3B-NVFP4
Text Generation
•
16B
•
Updated
•
34.5k
•
23
tachyphylaxis/DeepSeek-R1-0528-FP4
Text Generation
•
397B
•
Updated
•
4
nvidia/DeepSeek-R1-NVFP4-v2
Text Generation
•
394B
•
Updated
•
3.07k
•
5
NVFP4/Qwen3-235B-A22B-Instruct-2507-FP4
Text Generation
•
118B
•
Updated
•
1.57k
•
3
NVFP4/Qwen3-Coder-480B-A35B-Instruct-FP4
Text Generation
•
241B
•
Updated
•
560
•
2