Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
GOVINDFROM
/
MindGamesCodeNames
like
0
Reinforcement Learning
Safetensors
game-theory
codenames
neurips-2025
graph-neural-networks
preference-learning
llm-distillation
License:
mit
Model card
Files
Files and versions
xet
Community
1
Copy to bucket
new
main
MindGamesCodeNames
Commit History
Update README.md
5c80055
verified
GOVINDFROM
commited on
Dec 30, 2025
Update README.md
e6db4f9
verified
GOVINDFROM
commited on
Dec 29, 2025
Upload model card
2890d84
verified
GOVINDFROM
commited on
Dec 29, 2025
Upload battleground_eval.json
e91ffab
verified
GOVINDFROM
commited on
Dec 29, 2025
Upload master_config.json
1f81885
verified
GOVINDFROM
commited on
Dec 29, 2025
Upload SFT model
43b7674
verified
GOVINDFROM
commited on
Dec 29, 2025
Upload policy_after_ppo.pt
f0ef1c3
verified
GOVINDFROM
commited on
Dec 29, 2025
Upload policy_after_distill.pt
cd470a3
verified
GOVINDFROM
commited on
Dec 29, 2025
Upload policy_final.pt
edb9110
verified
GOVINDFROM
commited on
Dec 29, 2025
initial commit
12f043b
verified
GOVINDFROM
commited on
Dec 29, 2025