Model Card for genevera/Qwen3-VL-32B-Instruct-Heretic-FP8-DYNAMIC
This is an FP8 quant of coder3101/Qwen3-VL-32B-Instruct-Heretic.
vllm (pretrained=genevera/Qwen3-VL-32B-Instruct-Heretic-FP8-DYNAMIC,add_bos_token=True,max_model_len=161184,dtype=bfloat16), gen_kwargs: (None), limit: 250.0, num_fewshot: 5, batch_size: auto
|Tasks|Version| Filter |n-shot| Metric | |Value| |Stderr|
|-----|------:|----------------|-----:|-----------|---|----:|---|-----:|
|gsm8k| 3|flexible-extract| 5|exact_match|↑ |0.924|± |0.0168|
| | |strict-match | 5|exact_match|↑ |0.928|± |0.0164|
- Downloads last month
- 116