Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
AI & ML interests
Scaling up good synthetic reasoning. Post-training and synthetic data research lab.
Organization Card
SynthLabs
Advancing and Scaling Synthetic Reasoning through Post-Training AI Research
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 5.5k • 222 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 7 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 5.69k • 9 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 7
Teaching language models to think efficiently with Adaptive Length Penalty (ALP)
This collection contains assets associated with the Big-Math dataset, a high-quality collection of over 250,000 math questions with verifiable answers
-
SynthLabsAI/Big-Math-RL-Verified
Viewer • Updated • 251k • 5.5k • 222 -
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer • Updated • 34.9k • 7 • 1 -
nlile/NuminaMath-1.5-RL-Verifiable
Viewer • Updated • 131k • 5.69k • 9 -
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models
Paper • 2502.17387 • Published • 7
datasets 6
SynthLabsAI/Big-Math-RL-Verified
Viewer
• Updated
• 251k • 5.5k • 222
SynthLabsAI/Big-Math-RL-UNVERIFIED
Viewer
• Updated
• 34.9k • 7 • 1
SynthLabsAI/PERSONA
Viewer
• Updated
• 200k • 3.34k • 18
SynthLabsAI/PERSONA_subset
Viewer
• Updated
• 5k • 3.32k • 3
SynthLabsAI/PRISM-Filter
Viewer
• Updated
• 3.87k • 5
SynthLabsAI/Synthetic-Personas
Viewer
• Updated
• 1k • 6 • 3