wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step50_2026-01-27_21-36-45_nvidia_balanced 8B • Updated about 7 hours ago
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step100_2026-01-27_21-36-45_nvidia_balanced 8B • Updated about 7 hours ago
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step350_2026-01-27_03-19-15_nvidia_balanced 8B • Updated about 14 hours ago
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step300_2026-01-27_03-19-15_nvidia_balanced 8B • Updated about 16 hours ago
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step100_2026-01-27_03-19-15_nvidia_balanced 8B • Updated about 20 hours ago
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step150_2026-01-27_03-19-15_nvidia_balanced 8B • Updated about 21 hours ago
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step200_2026-01-27_03-19-15_nvidia_balanced 8B • Updated about 21 hours ago
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step50_2026-01-27_03-19-15_nvidia_balanced 8B • Updated 1 day ago
wenwenD/qwen3-4b-codeexp_grpo_no_prior_think_step280_2026-01-25_06-29-13_nvidia_balanced 4B • Updated 3 days ago • 10
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_step280_2026-01-25_06-28-54_nvidia_balanced 4B • Updated 3 days ago • 11
wenwenD/qwen3-4b-codeexp_grpo_with_prior_think_step280_2026-01-24_07-19-57_nvidia 4B • Updated 4 days ago • 14
wenwenD/qwen3-4b-codeexp_grpo_no_prior_think_step280_2026-01-24_07-21-36_nvdia 4B • Updated 4 days ago • 14
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_discount_always1_step175_2026_01_23_21_40_33 4B • Updated 4 days ago • 8
wenwenD/qwen7B-instruct-repo_sft_3epcs_w_context-synthetic_multiturn_sft_3epcs 8B • Updated Jun 16, 2025 • 1