Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
liked
a model 24 minutes ago
nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 updated
a model 1 day ago
inference-optimization/Llama-3.1-8B-Instruct-6-bits published
a model 1 day ago
inference-optimization/Llama-3.1-8B-Instruct-6-bits