From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper ⢠2511.18538 ⢠Published Nov 23, 2025 ⢠295
Runtime error Featured 2.95k The Smol Training Playbook š 2.95k The secrets to building world-class LLMs
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper ⢠2509.26507 ⢠Published Sep 30, 2025 ⢠547
Running Featured 626 The Tokenizer Playground š 626 Experiment with and compare different tokenizers
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper ⢠2509.02547 ⢠Published Sep 2, 2025 ⢠229
FreedomIntelligence/medical-o1-reasoning-SFT Viewer ⢠Updated Apr 22, 2025 ⢠90.1k ⢠4.71k ⢠1.06k
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. ⢠11 items ⢠Updated about 8 hours ago ⢠96
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper ⢠2508.05629 ⢠Published Aug 7, 2025 ⢠181 ⢠21
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights Paper ⢠2506.16406 ⢠Published Jun 19, 2025 ⢠130