Load vLLM from local snapshot to support default subfolders 0e2f6c4 Alikestocode commited on Nov 11, 2025
Fix AWQ model loading: point to default/ subfolder and fix tokenizer loading a76dbfd Alikestocode commited on Nov 10, 2025
Remove processor arg from oneshot to avoid tokenizer conflict 4b47dea Alikestocode commited on Nov 10, 2025
Fix processor error: pass tokenizer explicitly for text-only models 5bf2e9f Alikestocode commited on Nov 10, 2025
Fix oneshot() API: use correct parameter names from documentation 35d8225 Alikestocode commited on Nov 10, 2025
Remove duplicate build_awq_modifier_config - keep existing correct version 5bf02e9 Alikestocode commited on Nov 10, 2025
Add build_awq_modifier_config helper using QuantizationScheme objects cf9ed91 Alikestocode commited on Nov 10, 2025
Fix quantization_config structure: use correct AWQ format 3f08592 Alikestocode commited on Nov 10, 2025
Fix modifiers initialization: ensure it's always defined f3114ba Alikestocode commited on Nov 10, 2025
Fix BaseQuantizationConfig import: add fallback approaches a49281c Alikestocode commited on Nov 10, 2025
Add local test script for quantization notebook validation 011c926 Alikestocode commited on Nov 10, 2025
Fix QuantizationConfig: use config_groups with BaseQuantizationConfig ecf6a69 Alikestocode commited on Nov 10, 2025
Add note about restarting kernel if AWQModifier errors occur 33a1d2e Alikestocode commited on Nov 10, 2025
Fix delete_revisions import with fallback cache cleanup 7a2a590 Alikestocode commited on Nov 10, 2025
Fix delete_revisions import - use fallback cache cleanup method 4be72e0 Alikestocode commited on Nov 10, 2025
Fix AWQModifier import path: use modifiers.awq instead of modifiers.quantization f0033ab Alikestocode commited on Nov 10, 2025
Fix LLM Compressor package name: llmcompressor (no hyphen) 2326498 Alikestocode commited on Nov 10, 2025
Remove duplicate LLM Compressor section - now primary method d4bc333 Alikestocode commited on Nov 10, 2025
Replace AutoAWQ with LLM Compressor (vLLM native) in Colab notebook ae07f77 Alikestocode commited on Nov 10, 2025
Add disk space cleanup after quantization in Colab notebook 24107f3 Alikestocode commited on Nov 10, 2025
Fix linter error: use %pip instead of !pip in Colab notebook 2dff966 Alikestocode commited on Nov 10, 2025
Add Colab notebook for AWQ quantization of router models a79bc8f Alikestocode commited on Nov 10, 2025
Clarify LLM Compressor optional status - vLLM has native AWQ support b2bf767 Alikestocode commited on Nov 10, 2025