-
oceanpty/TOA-Ultrafeedback-SFT-Rand-lla3.1-8b-inst
Viewer • Updated • 59.9k • 6 -
oceanpty/TOA-Ultrafeedback-SFT-Rand-qwen2-7b-inst
Viewer • Updated • 59.9k • 11 -
oceanpty/TOA-Ultrafeedback-SFT-PRS-lla3.1-8b-inst
Viewer • Updated • 59.9k • 7 -
oceanpty/TOA-Ultrafeedback-SFT-PRS-qwen2-7b-inst
Viewer • Updated • 59.9k • 8
Hai Ye
oceanpty
AI & ML interests
None yet
Recent Activity
upvoted a collection about 19 hours ago
MiroThinker-1.7 upvoted a paper about 2 months ago
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Organizations
None yet