Drive Like a Human: Rethinking Autonomous Driving with Large Language Models Paper • 2307.07162 • Published Jul 14, 2023
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Paper • 2406.08418 • Published Jun 12, 2024 • 31
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes Paper • 2409.04003 • Published Sep 6, 2024 • 1
O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering Paper • 2505.16582 • Published May 22, 2025
KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision Paper • 2506.00783 • Published Jun 1, 2025 • 1
IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video? Paper • 2509.24709 • Published Sep 29, 2025 • 6
RE-Searcher: Robust Agentic Search with Goal-oriented Planning and Self-reflection Paper • 2509.26048 • Published Sep 30, 2025 • 7
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks Paper • 2510.08002 • Published Oct 9, 2025 • 23
RE-Searcher: Robust Agentic Search with Goal-oriented Planning and Self-reflection Paper • 2509.26048 • Published Sep 30, 2025 • 7
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks Paper • 2510.08002 • Published Oct 9, 2025 • 23
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks Paper • 2510.08002 • Published Oct 9, 2025 • 23 • 2
NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints Paper • 2510.08565 • Published Oct 9, 2025 • 19
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 195
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper • 2412.05271 • Published Dec 6, 2024 • 159
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published Feb 20, 2025 • 100