ISTA-DASLab 's Collections
Extreme Compression of Large Language Models via Additive Quantization
Paper
• 2401.06118
• Published
• 14
ISTA-DASLab/Meta-Llama-3-70B-Instruct-AQLM-2Bit-1x16
Text Generation
• 11B • Updated
• 51
• 20
ISTA-DASLab/Meta-Llama-3-70B-AQLM-2Bit-1x16
Text Generation
• Updated
• 9
• 14
ISTA-DASLab/Meta-Llama-3-8B-Instruct-AQLM-2Bit-1x16
Text Generation
• 2B • Updated
• 107
• 12
ISTA-DASLab/Meta-Llama-3-8B-AQLM-2Bit-1x16
Text Generation
• 2B • Updated
• 11
• 7
ISTA-DASLab/c4ai-command-r-v01-AQLM-2Bit-1x16
Text Generation
• 6B • Updated
• 1
• 10
ISTA-DASLab/c4ai-command-r-plus-AQLM-2Bit-1x16
Text Generation
• 16B • Updated
• 1
• 10
ISTA-DASLab/Mixtral-8x7B-Instruct-v0_1-AQLM-2Bit-1x16-hf
Text Generation
• 7B • Updated
• 7
• 18
ISTA-DASLab/Mixtral-8x7b-AQLM-2Bit-1x16-hf
Text Generation
• 7B • Updated
• 47
• 23
ISTA-DASLab/Mistral-7B-Instruct-v0.2-AQLM-2Bit-2x8
Text Generation
• 2B • Updated
• 93
• 3
ISTA-DASLab/Mistral-7B-v0.1-AQLM-2Bit-1x16-hf
Text Generation
• 1B • Updated
• 10
• 2
ISTA-DASLab/gemma-2b-AQLM-2Bit-1x16-hf
Text Generation
• 0.8B • Updated
• 2
• 6
ISTA-DASLab/gemma-2b-AQLM-2Bit-2x8-hf
Text Generation
• 1B • Updated
• 4
• 4
ISTA-DASLab/Llama-2-7b-AQLM-2Bit-1x16-hf
Text Generation
• 1B • Updated
• 59
• 5
ISTA-DASLab/Llama-2-7b-AQLM-2Bit-2x8-hf
Text Generation
• 2B • Updated
• 87
• 2
ISTA-DASLab/Llama-2-7b-AQLM-2Bit-8x8-hf
Text Generation
• 2B • Updated
• 5
ISTA-DASLab/Llama-2-13b-AQLM-2Bit-1x16-hf
Text Generation
• 2B • Updated
• 15
ISTA-DASLab/Llama-2-13b-AQLM-4Bit-2x16-hf
Text Generation
• Updated
• 1
ISTA-DASLab/Llama-2-70b-AQLM-2Bit-1x16-hf
Text Generation
• 9B • Updated
• 9
• 6
ISTA-DASLab/Llama-2-70b-AQLM-2Bit-2x8-hf
Text Generation
• 18B • Updated
• 4
• 1
ISTA-DASLab/Llama-2-70b-AQLM-4Bit-2x16-hf
Text Generation
• 18B • Updated
• 4