Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_tok_python_1p0_0p0_1p0_grpo_42_rule Updated about 9 hours ago
Kazuki1450/Qwen2.5-1.5B-Instruct_dsum_3_6_tok_Certainly_1p0_0p0_1p0_grpo_42_rule Updated about 9 hours ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_fnr_no_bracket_0p0_0p0_1p0_grpo_dr_grpo_42_rule Updated 2 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_dr_grpo_42_rule Updated 2 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_sapo_42_rule Updated 2 days ago
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_no_bracket_0p0_0p0_1p0_grpo_dr_grpo_42_rule Updated 2 days ago
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_0p5_0p0_1p0_grpo_dr_grpo_42_rule Text Generation • 7B • Updated 2 days ago • 418
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_sapo_42_rule Text Generation • 2B • Updated 2 days ago • 460
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_dr_grpo_42_rule Text Generation • 2B • Updated 2 days ago • 257
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p8_0p0_1p0_grpo_42_rule Text Generation • 2B • Updated 2 days ago • 398
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_0p5_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated 2 days ago • 384
Kazuki1450/Qwen3-1.7B-Base_dsum_3_6_0p8_0p0_1p0_grpo_dr_grpo_42_rule Text Generation • 2B • Updated 2 days ago • 364
Kazuki1450/Olmo-3-1025-7B_dsum_3_6_fnr_eng_1p0_0p0_1p0_grpo_42_rule Text Generation • 7B • Updated 2 days ago • 769