inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_heuristic 23B • Updated 16 days ago • 41
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_noise 23B • Updated 16 days ago • 32
inference-optimization/Qwen3-30B-A3B-Instruct-2507_6.0_bits_mode_hybrid 23B • Updated 16 days ago • 36
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_heuristic 22B • Updated 16 days ago • 34
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_noise 22B • Updated 16 days ago • 36
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.5_bits_mode_hybrid 22B • Updated 16 days ago • 36
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_heuristic 20B • Updated 16 days ago • 39
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_noise 20B • Updated 16 days ago • 35
inference-optimization/Qwen3-30B-A3B-Instruct-2507_5.0_bits_mode_hybrid 20B • Updated 16 days ago • 40
inference-optimization/gpt-oss-120b-from-self-ckpt5-speculator.eagle3 0.9B • Updated 18 days ago • 72
inference-optimization/gpt-oss-120b-from-self-ckpt3-speculator.eagle3 0.9B • Updated 18 days ago • 60
inference-optimization/gpt-oss-120b-from-self-ckpt4-speculator.eagle3 0.9B • Updated 18 days ago • 56
inference-optimization/gpt-oss-120b-from-self-ckpt2-speculator.eagle3 0.9B • Updated 18 days ago • 65
inference-optimization/gpt-oss-120b-from-self-ckpt1-speculator.eagle3 0.9B • Updated 18 days ago • 60
inference-optimization/gpt-oss-120b-from-self-ckpt0-speculator.eagle3 0.9B • Updated 18 days ago • 61
inference-optimization/Qwen3-Next-80B-A3B-Instruct-GSM8K-MTP-finetuned 81B • Updated 18 days ago • 54
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt3 0.5B • Updated 19 days ago • 24
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt3-speculator.eagle3 0.9B • Updated 19 days ago • 41
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt1-speculator.eagle3 0.9B • Updated 19 days ago • 26
inference-optimization/gpt-oss-120b-from-qwen235b-ckpt0-speculator.eagle3 0.9B • Updated 19 days ago • 25
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt2 0.5B • Updated 19 days ago • 24
inference-optimization/Qwen3-30B-from-Qwen3-235B_resps-speculators.eagle3-ckpt1 0.5B • Updated 19 days ago • 21