[ICLR'24 Spotlight] Tool-Augmented Reward Modeling
AI & ML interests
Large Language Models
Recent Activity
View all activity
Papers
View all Papers models 12
ernie-research/Themis-7b
Updated • 4 • 4
ernie-research/APPS-Gemma-7B-MA-PPO-Fixed10
9B • Updated • 4
ernie-research/APPS-Gemma-2B-MA-PPO-Fixed10
3B • Updated • 17
ernie-research/HH-RLHF-Gemma-2B-MA-PPO-Fixed5
3B • Updated • 10
ernie-research/HH-RLHF-Gemma-7B-MA-PPO-Fixed5
9B • Updated • 3
ernie-research/TLDR-Gemma-7B-MA-PPO-Fixed5
9B • Updated • 2
ernie-research/TLDR-Gemma-2B-MA-PPO-Fixed5
3B • Updated • 4 • 1
ernie-research/TLDR-Gemma-2-27B-MA-PPO-Fixed5
27B • Updated • 14
ernie-research/ernie-code-560m
Updated • 101 • 10
ernie-research/MonoGPT
Text Generation • 0.4B • Updated • 5 • 2
datasets 7
ernie-research/MEnvData-SWE-Trajectory
Viewer • Updated • 3.92k • 163 • 26
ernie-research/MEnvData-SWE
Preview • Updated • 1.26k • 3
ernie-research/MEnvBench
Viewer • Updated • 1k • 17 • 2
ernie-research/TARA
Preview • Updated • 21 • 1
ernie-research/GPTDynamics
Preview • Updated • 53 • 1
ernie-research/rendered_xnli
Updated • 9 • 1
ernie-research/rendered_GLUE
Updated • 18 • 1