agentic-moral-alignment/gemma2-2b-q4__ipd_str_tft__deont__none_notool__r1__core Updated about 1 hour ago
agentic-moral-alignment/Qwen3.5-9B__grpo_unsloth__ipd_structured_actionab_native_tool__deont__tft__1000ep_run1 Text Generation • Updated 3 days ago • 19
agentic-moral-alignment/Qwen3.5-9B__grpo_unsloth__ipd_structured_actionab_native_bare__deont__tft__1000ep_run20 Text Generation • Updated 3 days ago • 13
agentic-moral-alignment/qwen35-9b-grpo-unsloth-ut-tft-1000ep Viewer • Updated 2 days ago • 7.44k • 15
agentic-moral-alignment/qwen35-9b-grpo-unsloth-game-tft-1000ep Viewer • Updated 2 days ago • 6.48k • 16