arxiv:2603.14769
Xuanfei Ren
xuanfeiren
AI & ML interests
RL and LLM
Recent Activity
upvoted a paper 3 days ago
SkillGrad: Optimizing Agent Skills Like Gradient Descent upvoted a paper 2 months ago
Provably Learning from Language Feedback upvoted a paper 2 months ago
Understanding the Challenges in Iterative Generative Optimization with LLMsOrganizations
None yet