RedHatAI/Qwen3-14B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
123
RedHatAI/Qwen3-8B-speculator.eagle3
Text Generation
•
1B
•
Updated
•
60.8k
•
1
RedHatAI/Llama-3.3-70B-Instruct-speculator.eagle3
Text Generation
•
2B
•
Updated
•
549
•
1
RedHatAI/Llama-3.3-70B-Instruct-NVFP4
Text Generation
•
41B
•
Updated
•
283
•
1
RedHatAI/Llama-3.1-70B-Instruct-NVFP4
Text Generation
•
41B
•
Updated
•
160
RedHatAI/Llama-3.1-8B-Instruct-NVFP4
Text Generation
•
5B
•
Updated
•
14.9k
Text Generation
•
19B
•
Updated
•
9.92k
•
6
Text Generation
•
9B
•
Updated
•
284
Text Generation
•
5B
•
Updated
•
954
•
1
RedHatAI/Llama-4-Scout-17B-16E-Instruct-NVFP4
Text Generation
•
64B
•
Updated
•
672
RedHatAI/Kimi-K2-Thinking-FP8-Block
1T
•
Updated
•
6
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-FP8-dynamic
Image-Text-to-Text
•
24B
•
Updated
•
147k
•
9
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w8a8
Image-Text-to-Text
•
24B
•
Updated
•
329
•
5
RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16
Image-Text-to-Text
•
5B
•
Updated
•
21.8k
•
10
RedHatAI/Mistral-Small-24B-Instruct-2501-FP8-dynamic
Text Generation
•
24B
•
Updated
•
822
•
13
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w8a8
Text Generation
•
24B
•
Updated
•
14.7k
•
1
RedHatAI/Mistral-Small-24B-Instruct-2501-quantized.w4a16
Text Generation
•
4B
•
Updated
•
15
•
1
RedHatAI/Llama-3.1-8B-Instruct-FP8-block
Text Generation
•
8B
•
Updated
•
84
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-block
Text Generation
•
236B
•
Updated
•
85
•
3
RedHatAI/Qwen3-30B-A3B-FP8-block
Text Generation
•
31B
•
Updated
•
10.6k
RedHatAI/Llama-4-Scout-17B-16E-Instruct-FP8-block
Text Generation
•
109B
•
Updated
•
50
•
3
RedHatAI/Llama-4-Maverick-17B-128E-Instruct-FP8-block
Text Generation
•
402B
•
Updated
•
2
•
1
RedHatAI/Llama-3.3-70B-Instruct-FP8-block
Text Generation
•
71B
•
Updated
•
249
RedHatAI/Qwen3-32B-FP8-block
Text Generation
•
33B
•
Updated
•
17
RedHatAI/Qwen3-14B-FP8-block
Text Generation
•
15B
•
Updated
•
34
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
Text Generation
•
71B
•
Updated
•
14.5k
•
14
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
•
71B
•
Updated
•
11
•
2
RedHatAI/Llama-3.2-1B-FP8
1B
•
Updated
•
29k
Image-Text-to-Text
•
12B
•
Updated
•
15
•
1
RedHatAI/Qwen3-VL-235B-A22B-Instruct-FP8-dynamic
Text Generation
•
236B
•
Updated
•
1.3k
•
4