arxiv:2603.10178
Huanxin Sheng
HuanxinSheng
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper 10 days ago
iGRPO: Self-Feedback-Driven LLM Reasoning commentedon a paper about 1 month ago
To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models upvoted a paper about 1 month ago
To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models