arxiv:2602.05843
Jinyang Wu
Jinyang23
AI & ML interests
large language models, reasoning, agentic rl
Recent Activity
upvoted a paper about 1 month ago
CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models upvoted a paper about 1 month ago
Query as Anchor: Scenario-Adaptive User Representation via Large Language ModelOrganizations
None yet