inference-optimization/Kimi-K2-Instruct-0905-BF16-FP8-BLOCK Text Generation • 1T • Updated 16 days ago • 25
inference-optimization/Meta-Llama-3.1-8B-Instruct-NVFP4-FP8-Dynamic_5.75-bits 6B • Updated Jan 26 • 1