This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
LLM Post-Training
Recent Activity
published a model about 12 hours ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct_thinking-parquet_qwen3-1.7b_epoch_3_mask_k4096 updated a model about 12 hours ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct_thinking-parquet_qwen3-1.7b_epoch_3_mask_k4096 published a model 1 day ago
SeanWang0027/rl_warm_up_mixed_minesweeper_correct_thinking-parquet_qwen3-1.7b_epoch_3_mask