Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
s
august66
Follow
Kyleyee's profile picture
callmespring's profile picture
mamba413's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 13 hours ago
august66/hh_qwen1.5_drpo
updated
a model
4 months ago
august66/hh_qwen1.5_drpo
updated
a dataset
4 months ago
august66/drpo_hh_qwen2.5_1.5b_with_ref_btpref
View all activity
Organizations
models
5
Sort: Recently updated
august66/hh_qwen1.5_drpo
2B
•
Updated
about 13 hours ago
august66/hh_qwen_1.5b_dpo_model_2
Text Generation
•
2B
•
Updated
Sep 9, 2025
•
51
august66/ultrafeedback_qwen_1.5b_drpo_model
Updated
Jul 9, 2025
august66/qwen2-sft-dpo-imdb-beta-1.0
Updated
Jun 2, 2025
august66/qwen2-sft-final
Text Generation
•
0.5B
•
Updated
Jun 1, 2025
datasets
26
Sort: Recently updated
august66/drpo_hh_qwen2.5_1.5b_with_ref_btpref
Viewer
•
Updated
Oct 8, 2025
•
48.8k
•
133
august66/hh_qwen2.5_1.5b_with_bias_bt_pref
Viewer
•
Updated
Oct 2, 2025
•
18k
•
3
august66/hh_qwen2.5_1.5b_with_bias
Viewer
•
Updated
Sep 27, 2025
•
18k
•
28
august66/drpo_hh_qwen2.5_1.5b
Viewer
•
Updated
Sep 8, 2025
•
43.8k
•
3
august66/dpo_reward_dist_pi_theta_prompt_3
Viewer
•
Updated
Sep 3, 2025
•
5k
•
1
august66/dpo_reward_dist_pi_theta_prompt_2
Viewer
•
Updated
Sep 3, 2025
•
5k
•
1
august66/dpo_reward_dist_pi_theta
Viewer
•
Updated
Aug 23, 2025
•
5k
•
1
august66/reward_distribution_2_tldr_openassist_pi_ref
Viewer
•
Updated
Aug 4, 2025
•
5k
•
4
august66/reward_distribution_2_tldr_openassist_pi_theta
Viewer
•
Updated
Aug 4, 2025
•
5k
•
16
august66/reward_distribution_tldr_openassist_pi_theta
Viewer
•
Updated
Jul 30, 2025
•
5k
•
21
View 26 datasets