·
AI & ML interests
LLMs
Organizations
None yet
ZHLiu627/warm_start_sft_v2
Preview
• Updated • 7
ZHLiu627/sciworld_dataset
Preview
• Updated • 6
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1
Viewer
• Updated • 29.3k • 5
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1_v1
Viewer
• Updated • 29.3k • 5
• 1
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filtered_v1
Viewer
• Updated • 29.3k • 4
ZHLiu627/updated-code-qwen7-edufiltered
Viewer
• Updated • 43k • 5
ZHLiu627/updated-code-qwen7-edu
Viewer
• Updated • 75.6k • 7
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2filtered
Viewer
• Updated • 28.9k • 5
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2
Viewer
• Updated • 29.3k • 5
ZHLiu627/dataset_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212_2_global_step_70filteredd
Viewer
• Updated • 29.3k • 6
ZHLiu627/updated_qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1filtered
Viewer
• Updated • 29.1k • 5
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v2
Viewer
• Updated • 29.3k • 6
ZHLiu627/qwen2.5_code_1.5b_grpo_iter0_full_data_miao_0212__self_correction_iter1_v1
Viewer
• Updated • 29.3k • 6
Viewer
• Updated • 118k • 5
ZHLiu627/ultrafeedback_binarized_with_response_full
Viewer
• Updated • 61.1k • 5
ZHLiu627/ultrafeedback_binarized_with_response_full_part2
Viewer
• Updated • 21.1k • 6
ZHLiu627/ultrafeedback_binarized_with_response_full_part1
Viewer
• Updated • 20k • 4
• 1
ZHLiu627/ultrafeedback_binarized_with_response_full_part0
Viewer
• Updated • 20k • 6