OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent Paper • 2601.07779 • Published 1 day ago • 22
MMLongCite: A Benchmark for Evaluating Fidelity of Long-Context Vision-Language Models Paper • 2510.13276 • Published Oct 15, 2025
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 211
MAC-SLU: Multi-Intent Automotive Cabin Spoken Language Understanding Benchmark Paper • 2512.01603 • Published Dec 1, 2025
Agent2World: Learning to Generate Symbolic World Models via Adaptive Multi-Agent Feedback Paper • 2512.22336 • Published 18 days ago • 2
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 4 days ago • 46
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 4 days ago • 46
The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning Paper • 2601.06002 • Published 4 days ago • 46
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 30 days ago • 63
ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models Paper • 2510.06014 • Published Oct 7, 2025 • 10
Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm Paper • 2511.04570 • Published Nov 6, 2025 • 211
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published Oct 27, 2025 • 96