rediska0123/ue_manager_trip_uhead_token_Qwen3-8B_fixed_prm_feature_linear_hs_20e_best_at_epoch7 Updated Jan 3
rediska0123/train_prm800k_qwen3.5-122B_annotated_k2_think_thinking_extracted_v1_hs Viewer • Updated about 5 hours ago • 10
rediska0123/train_prm800k_qwen3.5-122B_annotated_k2_think_thinking_extracted_v1 Viewer • Updated about 8 hours ago • 8.8k