Collections
Discover the best community collections!
Collections including paper arxiv:2501.05441
-
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper • 2502.02737 • Published • 249 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 429 -
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 301
-
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper • 2501.05441 • Published • 95 -
Transformers without Normalization
Paper • 2503.10622 • Published • 171 -
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction
Paper • 2505.21473 • Published • 16 -
Representing Speech Through Autoregressive Prediction of Cochlear Tokens
Paper • 2508.11598 • Published • 17
-
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper • 2502.02737 • Published • 249 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259 -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper • 2501.12948 • Published • 429 -
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper • 2501.08313 • Published • 301
-
The GAN is dead; long live the GAN! A Modern GAN Baseline
Paper • 2501.05441 • Published • 95 -
Transformers without Normalization
Paper • 2503.10622 • Published • 171 -
DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction
Paper • 2505.21473 • Published • 16 -
Representing Speech Through Autoregressive Prediction of Cochlear Tokens
Paper • 2508.11598 • Published • 17