Lewei Lu's picture

Lewei Lu

luotto

·

ottolu

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Scaling Spatial Intelligence with Multimodal Foundation Models

upvoted a collection 10 days ago

upvoted a collection 18 days ago

View all activity

Organizations

upvoted a paper 6 days ago

Scaling Spatial Intelligence with Multimodal Foundation Models

Paper • 2511.13719 • Published 20 days ago • 44

upvoted a collection 10 days ago

NEO1_0

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale • 7 items • Updated Oct 17 • 4

upvoted a collection 18 days ago

SenseNova-SI

Scaling Spatial Intelligence with Multimodal Foundation Models • 8 items • Updated 1 day ago • 10

upvoted a paper 19 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 23 days ago • 158

upvoted a paper about 1 month ago

Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning

Paper • 2510.11027 • Published Oct 13 • 21

upvoted 7 papers about 2 months ago

VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning

Paper • 2510.10518 • Published Oct 12 • 18

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13 • 165

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 65

CVD-STORM: Cross-View Video Diffusion with Spatial-Temporal Reconstruction Model for Autonomous Driving

Paper • 2510.07944 • Published Oct 9 • 24

InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue

Paper • 2510.13747 • Published Oct 15 • 29

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

Paper • 2510.08565 • Published Oct 9 • 19

upvoted 3 papers 2 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6 • 116

BaseReward: A Strong Baseline for Multimodal Reward Model

Paper • 2509.16127 • Published Sep 19 • 21

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22 • 102

upvoted an article 2 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+7

Sep 22

•

120

upvoted 4 papers 3 months ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 189

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18 • 111

From Editor to Dense Geometry Estimator

Paper • 2509.04338 • Published Sep 4 • 92

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 208