nm-testing/Llama-2-7b-hf-gsm8k-quant_w4a16_sym-compressed
1B • Updated • 3
nm-testing/Llama-2-7b-hf-gsm8k-gptq_w4a16_sym-uncompressed
7B • Updated • 2
nm-testing/Llama-2-7b-hf-gsm8k-gptq_w4a16_sym-compressed
1B • Updated • 3
nm-testing/Llama-2-7b-hf-gsm8k-awq_w4a16_sym-uncompressed
7B • Updated • 5
nm-testing/Llama-2-7b-hf-gsm8k-awq_w4a16_sym-compressed
1B • Updated • 2
nm-testing/Llama-2-7b-hf-gsm8k-awq_gptq_sym-uncompressed
7B • Updated • 2
nm-testing/Llama-2-7b-hf-gsm8k-awq_gptq_sym-compressed
1B • Updated • 3
nm-testing/Mixtral-8x7B-Instruct-v0.1-FP8-Dynamic
47B • Updated • 6
nm-testing/Llama-3.1-8B-Instruct-W4A16-G128-shared-pipeline
2B • Updated • 4
nm-testing/Qwen2-VL-2B-Instruct-FP8-dynamic-cli
2B • Updated • 2
nm-testing/Qwen2-VL-2B-Instruct-FP8_DYNAMIC
Image-Text-to-Text
• 2B • Updated • 2
nm-testing/whisper-large-v3-quantized.w4a16
0.3B • Updated • 3
nm-testing/whisper-large-v3-quantized.w8a8_sq
2B • Updated • 2
nm-testing/whisper-large-v3-quantized.w8a8
2B • Updated • 3
nm-testing/llama2.c-stories110M-gsm8k-fp8_dynamic-compressed
0.1B • Updated • 3.53k
nm-testing/llama2.c-stories110M-gsm8k-recipe_w4a16_actorder_weight-compressed
60.5M • Updated • 3.63k
nm-testing/Llama-3.2-1B-Instruct-W4A16-uncompressed-mse-hadamard
5B • Updated • 2
nm-testing/llama2.c-stories15M
Text Generation
• 24.4M • Updated • 6.25k
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-kv_cache-qkv_proj
8B • Updated • 2
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation-q_proj
8B • Updated • 2
nm-testing/Meta-Llama-3-8B-Instruct-FP8-channel-output-activation
8B • Updated • 2
nm-testing/Llama-3.2-1B-W4A16-Transforms
4B • Updated • 4
nm-testing/Ministral-8B-Instruct-2410-FP8-dynamic
8B • Updated • 3
nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-asym
0.3B • Updated • 3
nm-testing/Phi-4-mini-instruct-quantized.w4a16.asymmetric
2B • Updated • 15
nm-testing/Qwen1.5-MoE-A2.7B-Chat-quantized.w4a16
14B • Updated • 101k
• 1
nm-testing/Moonlight-16B-A3B.w4a16
3B • Updated • 2
nm-testing/output_llama7b_2of4_w4a16_channel-main
Updated
nm-testing/output_llama7b_2of4_w4a16_channel-refac
Updated
nm-testing/quantization_2of4_sparse_w4a16
Updated