9 29 19

Xinchen Zhang

comin

https://cominclip.github.io/

Cominclip

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

upvoted a paper 8 days ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

upvoted a paper 13 days ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models

Paper • 2605.21573 • Published 7 days ago • 96

upvoted a paper 8 days ago

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Paper • 2605.18739 • Published 9 days ago • 109

upvoted a paper 13 days ago

Training Long-Context Vision-Language Models Effectively with Generalization Beyond 128K Context

Paper • 2605.13831 • Published 14 days ago • 86

upvoted a paper 2 months ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published Mar 16 • 80

upvoted 2 papers 4 months ago

Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis

Paper • 2602.03139 • Published Feb 3 • 45

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Paper • 2602.01538 • Published Feb 2 • 15

submitted a paper to Daily Papers 4 months ago

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars

Paper • 2602.01538 • Published Feb 2 • 15

upvoted a paper 4 months ago

Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models

Paper • 2601.19834 • Published Jan 27 • 25

liked a model 4 months ago

moonshotai/Kimi-K2.5

Image-Text-to-Text • 1.1T • Updated 27 days ago • 1.47M • • 2.8k

upvoted a paper 4 months ago

X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests

Paper • 2601.06953 • Published Jan 11 • 46

upvoted a paper 5 months ago

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

Paper • 2512.22120 • Published Dec 26, 2025 • 15

upvoted a paper 6 months ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 134

upvoted a paper 7 months ago

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

updated a model 7 months ago

comin/OmniVerifier-7B

8B • Updated Oct 23, 2025 • 49 • 4

New activity in comin/ViVerBench 7 months ago

Enhance ViVerBench dataset card: Add metadata, links, and sample usage

#2 opened 7 months ago by

nielsr

liked a dataset 7 months ago

comin/ViVerBench

Viewer • Updated Oct 17, 2025 • 3.59k • 38 • 2

liked a model 7 months ago

comin/OmniVerifier-7B

8B • Updated Oct 23, 2025 • 49 • 4

authored a paper 7 months ago

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published Oct 15, 2025 • 28

upvoted a paper 7 months ago

Generative Universal Verifier as Multimodal Meta-Reasoner

Paper • 2510.13804 • Published Oct 15, 2025 • 28

published a model 7 months ago

comin/OmniVerifier-7B

8B • Updated Oct 23, 2025 • 49 • 4

Xinchen Zhang

AI & ML interests

Recent Activity

Organizations

comin's activity

Enhance ViVerBench dataset card: Add metadata, links, and sample usage