-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • 33B • Updated • 48 • 32 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 82 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 14 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 18
AI & ML interests
Scale up the Reasoner-Zero Training
Recent Activity
Organization Card
Welcome to Open-Reasoner-Zero!
Please check our GitHub!
-
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning • 33B • Updated • 48 • 32 -
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning • 8B • Updated • 82 • 33 -
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning • 2B • Updated • 14 -
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning • 0.5B • Updated • 18
models
10
Open-Reasoner-Zero/PaCoRe-8B
8B
•
Updated
•
5
Open-Reasoner-Zero/ORZ-R1-Distill-Qwen-14B
15B
•
Updated
•
5
•
2
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-32B
Reinforcement Learning
•
32B
•
Updated
•
22
•
6
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-7B
Reinforcement Learning
•
7B
•
Updated
•
15
•
1
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-0.5B
Reinforcement Learning
•
0.5B
•
Updated
•
19
Open-Reasoner-Zero/Open-Reasoner-Zero-7B
Reinforcement Learning
•
8B
•
Updated
•
82
•
33
Open-Reasoner-Zero/Open-Reasoner-Zero-32B
Reinforcement Learning
•
33B
•
Updated
•
48
•
32
Open-Reasoner-Zero/Open-Reasoner-Zero-0.5B
Reinforcement Learning
•
0.5B
•
Updated
•
18
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-1.5B
Reinforcement Learning
•
2B
•
Updated
•
14
•
1
Open-Reasoner-Zero/Open-Reasoner-Zero-1.5B
Reinforcement Learning
•
2B
•
Updated
•
14