Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model
about 2 hours ago
inference-optimization/granite-4.0-h-tiny-quantized.w4a16
updated
a collection
4 days ago
Granite 4 Small and Tiny Quantized Models
published
a model
4 days ago
inference-optimization/granite-4.0-h-small-quantized.w8a8