Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_42_rule Updated about 6 hours ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_42_rule Updated about 6 hours ago
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split11 Viewer • Updated 19 days ago • 5.6k • 14
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split6 Viewer • Updated 19 days ago • 5.59k • 14
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split9 Viewer • Updated 19 days ago • 5.59k • 13
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split8 Viewer • Updated 20 days ago • 5.59k • 14
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split10 Viewer • Updated 20 days ago • 5.59k • 17
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split7 Viewer • Updated 20 days ago • 5.59k • 17
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split5 Viewer • Updated 20 days ago • 5.59k • 12
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split4 Viewer • Updated 20 days ago • 5.59k • 16
Kazuki1450/Light-R1-SFTData-Extended-With-Difficulty-split3 Viewer • Updated 20 days ago • 5.59k • 15