12 14 18

Muhammad Khalifa

mkhalifa

https://mukhal.github.io/

AI & ML interests

natural language genration, reinforcement learning

Recent Activity

upvoted a paper about 1 month ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

liked a dataset about 1 month ago

nvidia/Nemotron-Personas-Korea

updated a dataset about 1 month ago

launch/thinkprm-1K-verification-cots

View all activity

Organizations

Papers 9

models 21

datasets 18

mkhalifa/agent

Updated Nov 26, 2025 • 3

mkhalifa/gpqa-diamond-physics

Viewer • Updated Mar 15, 2025 • 86 • 362

mkhalifa/short-to-long-5K

Viewer • Updated Feb 26, 2025 • 5k • 13

mkhalifa/CoGEX

Viewer • Updated Feb 13, 2025 • 51.8k • 375

mkhalifa/llama-3.1-8b-instruct-math-trajectories-64-sample-per-problem

Viewer • Updated Jan 29, 2025 • 736k • 27

mkhalifa/llama-3.1-8b-instruct-math-trajectories-48-sample-per-problem

Viewer • Updated Jan 29, 2025 • 552k • 19

mkhalifa/llama-3.1-8b-instruct-math-trajectories-32-sample-per-problem

Viewer • Updated Jan 29, 2025 • 368k • 16

mkhalifa/llama-3.1-8b-instruct-math-trajectories-16-sample-per-problem

Viewer • Updated Jan 29, 2025 • 184k • 9

mkhalifa/llama-3.1-8b-instruct-math-trajectories-8-sample-per-problem

Viewer • Updated Jan 29, 2025 • 92k • 8

mkhalifa/llama-3.1-70b-instruct-math-trajectories-8-sample-per-problem

Viewer • Updated Jan 29, 2025 • 92k • 6

View 18 datasets

Muhammad Khalifa

AI & ML interests

Recent Activity

Organizations

Papers 9

models 21 Sort: Recently updated

datasets 18 Sort: Recently updated

models 21

datasets 18