RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments Paper • 2511.07317 • Published Nov 10, 2025 • 15
Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29, 2025 • 98
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees Paper • 2503.08893 • Published Mar 11, 2025 • 6
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning Paper • 2310.06694 • Published Oct 10, 2023 • 3
Evaluating Large Language Models at Evaluating Instruction Following Paper • 2310.07641 • Published Oct 11, 2023
Plug-and-Play Knowledge Injection for Pre-trained Language Models Paper • 2305.17691 • Published May 28, 2023 • 1