ISTA-DASLab/switch-c-2048_qmoe
Updated
•
4
None defined yet.
MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization
DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers