Song Jiang's picture

Song Jiang

songjiang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

upvoted a paper 5 months ago

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

upvoted a paper 5 months ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

View all activity

Organizations

None yet

Papers 2

arxiv:2503.15478

arxiv:2307.15780

models 0

None public yet

datasets 0

None public yet