Yushuo Guan
UnnamedWatcher
AI & ML interests
None yet
Recent Activity
upvoted a paper about 3 hours ago
LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning authored a paper 4 months ago
VidCapBench: A Comprehensive Benchmark of Video Captioning for
Controllable Text-to-Video Generation authored a paper 4 months ago
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion ModelsOrganizations
None yet