view article Article Exploring Quantization Backends in Diffusers +1 derekl35, marcsun13, sayakpaul • May 21, 2025 • 45
view article Article Diffusers welcomes FLUX-2 +6 YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart • Nov 25, 2025 • 190
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 378
view article Article SmolLM - blazingly fast and remarkably powerful +1 loubnabnl, anton-l, eliebak • Jul 16, 2024 • 455
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face dvgodoy • Feb 11, 2025 • 123
view article Article KV Cache from scratch in nanoVLM +3 ariG23498, kashif, lusxvr, andito, pcuenq • Jun 4, 2025 • 119
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 326
view article Article Get your VLM running in 3 simple steps on Intel CPUs +3 ezelanza, helenai, nikita-savelyev-intel, echarlaix, IlyasMoutawwakil • Oct 15, 2025 • 22
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 ariG23498, merve, pcuenq, reach-vb • Mar 12, 2025 • 495
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
view article Article Introducing Würstchen: Fast Diffusion for Image Generation +3 dome272, babbleberns, kashif, sayakpaul, pcuenq • Sep 13, 2023 • 21
view article Article How 🤗 Accelerate runs very large models thanks to PyTorch sgugger • Sep 27, 2022 • 18