In a Training Loop 🔄

Nuo Xu

Norm

https://normxu.github.io/

AI & ML interests

Video Diffusion; Large Language Model; Object Detection; OCR

Recent Activity

authored a paper about 12 hours ago

MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning

liked a dataset 5 days ago

facebook/wearable-ai

liked a model 12 days ago

nvidia/LocateAnything-3B

View all activity

Organizations

authored a paper about 12 hours ago

MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning

Paper • 2512.06810 • Published Dec 7, 2025

liked a dataset 5 days ago

facebook/wearable-ai

Viewer • Updated 26 days ago • 2.1k • 1.61k • 11

liked a model 12 days ago

nvidia/LocateAnything-3B

Image-Text-to-Text • 4B • Updated 4 days ago • 98.7k • 2.09k

liked a model about 1 month ago

hao9610/X2SAM

Updated May 5 • 4

liked a model about 2 months ago

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 9 days ago • 2.83M • • 4.89k

liked a model 2 months ago

tiiuae/Falcon-Perception

Mask Generation • 0.6B • Updated May 11 • 4.07k • 127

liked a model 3 months ago

Logics-MLLM/Logics-Parsing-Omni

32B • Updated Apr 8 • 16 • 10

upvoted a paper 3 months ago

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 187

liked a dataset 4 months ago

NoobEngineere/NSFW_Manga

Viewer • Updated Feb 23 • 596k • 232 • 15

upvoted a paper 5 months ago

LongCat-Flash-Thinking-2601 Technical Report

Paper • 2601.16725 • Published Jan 23 • 181

liked a model 5 months ago

meituan-longcat/LongCat-Flash-Thinking-2601

Text Generation • 562B • Updated Jan 23 • 4.98k • 114

liked 2 datasets 6 months ago

wsdwJohn1231/DreamLIP_capion_csv_w_key

Viewer • Updated Dec 2, 2025 • 13M • 14 • 1

Jyuhamdik/RealSyn15M

Viewer • Updated Dec 18, 2025 • 15.2M • 134 • 1

upvoted 2 papers 7 months ago

Revisiting Multimodal Positional Encoding in Vision-Language Models

Paper • 2510.23095 • Published Oct 27, 2025 • 23

LongCat-Flash-Omni Technical Report

Paper • 2511.00279 • Published Oct 31, 2025 • 27

liked a model 8 months ago

meituan-longcat/LongCat-Flash-Omni

Any-to-Any • 561B • Updated Nov 11, 2025 • 44 • 113

upvoted a paper 8 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 517

liked a model 9 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 170k • 1.31k

liked a model 10 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 58.4k • 535

upvoted a paper 10 months ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 172

Nuo Xu

AI & ML interests

Recent Activity

Organizations

Norm's activity