29 2

Suhwan Kim

drrobot333

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

RAGEN-2: Reasoning Collapse in Agentic RL

upvoted a paper 2 days ago

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

upvoted a paper 3 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

View all activity

Organizations

None yet

upvoted a paper 1 day ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 4 days ago • 49

upvoted a paper 2 days ago

Vanast: Virtual Try-On with Human Image Animation via Synthetic Triplet Supervision

Paper • 2604.04934 • Published 5 days ago • 35

upvoted a paper 3 days ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 5 days ago • 222

liked a model 9 days ago

KRAFTON/Raon-Speech-9B

Any-to-Any • 9B • Updated 3 days ago • 5.77k • 33

upvoted a paper 14 days ago

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs

Paper • 2603.16932 • Published 27 days ago • 86

liked a model about 1 month ago

nyu-visionx/solaris

Updated Mar 4 • 9

upvoted 4 papers 3 months ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published Jan 11 • 215

K-EXAONE Technical Report

Paper • 2601.01739 • Published Jan 5 • 92

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published Dec 30, 2025 • 64

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published Dec 19, 2025 • 99

upvoted 2 papers 4 months ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published Dec 18, 2025 • 222

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published Dec 9, 2025 • 122

upvoted 4 papers 5 months ago

upvoted 4 papers 7 months ago

Video models are zero-shot learners and reasoners

Paper • 2509.20328 • Published Sep 24, 2025 • 100

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 254

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Paper • 2509.15212 • Published Sep 18, 2025 • 22

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18, 2025 • 33

Suhwan Kim

AI & ML interests

Recent Activity

Organizations

drrobot333's activity