Wenxuan Huang's picture

Wenxuan Huang

Osilly

·

Osilly

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

VimRAG: Navigating Massive Visual Context in Retrieval-Augmented Generation via Multimodal Memory Graph

authored a paper 4 days ago

GroupGPT: A Token-efficient and Privacy-preserving Agentic Framework for Multi-User Chat Assistant

authored a paper 4 days ago

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

View all activity

Organizations

commented a paper 6 months ago

Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models

Paper • 2511.01618 • Published Nov 3, 2025 • 11 •

commented a paper 8 months ago

Interleaving Reasoning for Better Text-to-Image Generation

Paper • 2509.06945 • Published Sep 8, 2025 • 16 •

New activity in Osilly/Vision-R1-cold about 1 year ago

Why not upload images?

#1 opened about 1 year ago by