penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__qwen3base-GLM-4_7-sw-70 8B • Updated about 5 hours ago
penfever/rl__24GPU_base__exp_rpt_pymethods2test-large__Qwen3-8B-Base-65 8B • Updated about 6 hours ago
penfever/rl__24GPU_base__exp_rpt_curriculum-hard__r2egym-nl2bash-stack-15 8B • Updated about 6 hours ago
penfever/rl_rl-conf_24GP_base-yaml_mode-path_r2eg-nl2b-stac-bugs-fixt_trai-data_exp_rpt_soft-v2-45 8B • Updated 16 days ago • 17
penfever/rl_rl-conf_24GP_base_noth-yaml_mode-path_r2eg-nl2b-stac-bugs_trai-data_exp_rpt_stac-bash-110 8B • Updated 16 days ago • 17
penfever/rl_rl-conf_20GP_base-yaml_mode-path_r2eg-nl2b-stac-bugs-fixt_trai-data_exp_rpt_stac-pyte-v2-25 8B • Updated 16 days ago • 25
penfever/rl_rl-conf_20GP_base-yaml_mode-path_r2eg-nl2b-stac-bugs-fixt_trai-data_exp_rpt_code-v2-25 8B • Updated 16 days ago • 25
penfever/glm46-ling-coder-sft-sandboxes-1-maxeps-131k Text Generation • 308k • Updated Dec 20, 2025 • 1
penfever/kimi-k2-swesmith_with_plain_docker-sandboxes-maxeps-32k Text Generation • 308k • Updated Dec 18, 2025 • 15
penfever/GLM-4_6-gemini25flash-stackexchange-overflow-32ep-512k-fixeps Text Generation • 308k • Updated Nov 24, 2025 • 43
penfever/nl2bash-verified-GLM-4_6-traces-32ep-32k-dft Text Generation • 308k • Updated Nov 23, 2025 • 6
penfever/nl2bash-verified-GLM-4_6-traces-32ep-32k-restore-hp Text Generation • 308k • Updated Nov 20, 2025 • 2
penfever/nl2bash_verified_gpt-5-nano-traces-restore-hp Text Generation • 308k • Updated Nov 20, 2025 • 1
penfever/selfinstruct-naive-sandboxes-2-traces-restore-hp Text Generation • 308k • Updated Nov 20, 2025 • 2