arxiv:2601.22108
Shuqi Ke
shuqike
·
AI & ML interests
I work to close the generalization and sample efficiency gap between AI and human.
Recent Activity
upvoted a paper 21 days ago
Value-Based Pre-Training with Downstream Feedback submitted
a paper
21 days ago
Value-Based Pre-Training with Downstream Feedback authored
a paper
21 days ago
Reason for Future, Act for Now: A Principled Framework for Autonomous
LLM Agents with Provable Sample Efficiency