DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22, 2025 • 452
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 29 days ago • 166
δ-mem: Efficient Online Memory for Large Language Models Paper • 2605.12357 • Published 20 days ago • 124
MolmoAct2: Action Reasoning Models for Real-world Deployment Paper • 2605.02881 • Published 28 days ago • 347
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling Paper • 2605.13301 • Published 19 days ago • 159
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 18 days ago • 84
bartowski/nvidia_Nemotron-H-47B-Reasoning-128K-GGUF Text Generation • 47B • Updated Aug 29, 2025 • 816 • 4