Muapi/sierra-on-line-sci0-character-ega-sprite-sheets Text-to-Image • Updated about 8 hours ago • • 1
aHiroakiIshikawa/act_OpenArm_pick_and_place_20260529_1 Robotics • 51.7M • Updated 2 days ago • 34 • 1
Efficient Agentic Reinforcement Learning with On-Policy Intrinsic Knowledge Boundary Enhancement Paper • 2605.26952 • Published 8 days ago • 16
Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models Paper • 2605.17672 • Published 17 days ago • 22
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published 22 days ago • 195
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 21 days ago • 269
PACEvolve++: Improving Test-time Learning for Evolutionary Search Agents Paper • 2605.07039 • Published 27 days ago • 4
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents Paper • 2605.05185 • Published 28 days ago • 101
DCAgent2/swebench_verified_random_100_folders_R2EGym_32B_Agent_20260424_010913 Viewer • Updated Apr 24 • 300 • 15 • 1
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 504