CocoaBench: Evaluating Unified Digital Agents in the Wild Paper • 2604.11201 • Published 17 days ago • 36