Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
2
3
Yurun Yuan
RyanYr
Follow
John6666's profile picture
KenCao2007's profile picture
ziadrone's profile picture
6 followers
·
2 following
yurun-yuan
AI & ML interests
None yet
Recent Activity
updated
a dataset
26 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_matheval
updated
a model
26 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
published
a model
26 days ago
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
View all activity
Organizations
None yet
RyanYr
's models
30
Sort: Recently updated
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0
Updated
26 days ago
•
56
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_kl_bl0_200
Updated
27 days ago
•
7
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
27 days ago
•
30
RyanYr/pg_sais-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
27 days ago
•
55
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
27 days ago
•
57
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
27 days ago
•
57
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
27 days ago
•
57
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
27 days ago
•
60
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
27 days ago
•
59
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_piref_kl_behavior
Updated
27 days ago
•
59
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B_piref
Updated
27 days ago
•
58
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
27 days ago
•
57
RyanYr/pg_sais-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
27 days ago
•
57
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_nokl
Updated
27 days ago
•
7
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_piref_kl
Updated
27 days ago
•
8
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
28 days ago
•
45
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl
Updated
28 days ago
•
40
RyanYr/pg-dapo_shuffled-10_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
28 days ago
•
39
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
28 days ago
•
43
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_nokl
Updated
28 days ago
•
36
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl
Updated
28 days ago
•
37
RyanYr/pg-dapo_shuffled-0_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
28 days ago
•
34
RyanYr/pg_trajis-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
28 days ago
•
40
RyanYr/pg_sais-dapo_shuffled-offline-grpo_qwen2.5-math-1.5B
Updated
28 days ago
•
42
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl_behavior
Updated
28 days ago
•
49
RyanYr/pg-dapo_shuffled-01_offline-grpo_qwen2.5-math-1.5B_kl
Updated
29 days ago
•
29
RyanYr/grpo-dapo-qwen2.5-math-1.5B-n4
Updated
29 days ago
RyanYr/grpo-dapo-qwen3-1.7B-Base-mbs128-n4
Updated
Apr 20
RyanYr/grpo-dapo_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25
•
6
RyanYr/grpo-dapo-01_offline-qwen2.5math-1.5B-base-mbs256-n8_actor
Updated
Feb 25