From Model Scaling to System Scaling: Scaling the Harness in Agentic AI Paper • 2605.26112 • Published 8 days ago • 1
A Review of Safe Reinforcement Learning: Methods, Theory and Applications Paper • 2205.10330 • Published May 20, 2022
TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning Paper • 2403.08694 • Published Mar 13, 2024
AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond Paper • 2509.26636 • Published Sep 30, 2025 • 1
AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions Paper • 2602.06008 • Published Feb 5 • 5
Understanding Agent Scaling in LLM-Based Multi-Agent Systems via Diversity Paper • 2602.03794 • Published Feb 3
Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization Paper • 2602.15028 • Published Feb 16 • 1
Ref-NeuS: Ambiguity-Reduced Neural Implicit Surface Learning for Multi-View Reconstruction with Reflection Paper • 2303.10840 • Published Mar 20, 2023 • 1
GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs Paper • 2412.11258 • Published Dec 15, 2024 • 13
AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond Paper • 2509.26636 • Published Sep 30, 2025 • 1
See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm Paper • 2512.08629 • Published Dec 9, 2025 • 1
LLM-Optic: Unveiling the Capabilities of Large Language Models for Universal Visual Grounding Paper • 2405.17104 • Published May 27, 2024
SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models Paper • 2510.08559 • Published Oct 9, 2025 • 9
Temporal Preference Optimization for Long-Form Video Understanding Paper • 2501.13919 • Published Jan 23, 2025 • 23
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published Jan 13, 2025 • 55