SATQuest Dataset Collections
Yanxiao Zhao
sdpkjc
AI & ML interests
Reinforcement Learning
Organizations
models 95
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning • Updated
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning • Updated
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning • Updated
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning • Updated
sdpkjc/Ant-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning • Updated
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed5
Reinforcement Learning • Updated
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed4
Reinforcement Learning • Updated
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed3
Reinforcement Learning • Updated
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed2
Reinforcement Learning • Updated
sdpkjc/Humanoid-v4-ppo_fix_continuous_action-seed1
Reinforcement Learning • Updated
datasets 17
sdpkjc/SATQuest
Viewer
• Updated
• 140 • 77
sdpkjc/SATQuest-RFT-3k
Viewer
• Updated
• 3k • 6
sdpkjc/24problems_quiz-eval-n4-1-10-24
Viewer
• Updated
• 55.5k • 6
sdpkjc/24problems_quiz-eval-5
Viewer
• Updated
• 100k • 5
sdpkjc/24problems_quiz
Viewer
• Updated
• 85.6k • 5
sdpkjc/SATQuest-RFT-1k
Viewer
• Updated
• 1k • 3
sdpkjc/SATQuest-Tiny
Viewer
• Updated
• 10 • 4
sdpkjc/SATQuest-G
Viewer
• Updated
• 963 • 4
sdpkjc/NumBase-N01-S2g-B2g
Viewer
• Updated
• 983k • 3
sdpkjc/NumBase-N01-S2g-B28
Viewer
• Updated
• 459k • 5