3 9 3

David Leon

DavidLeon

https://www.linkedin.com/in/daweileng/

AI & ML interests

AIGC & LMM

Recent Activity

upvoted a paper about 2 months ago

FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer

updated a model about 2 months ago

qihoo360/FLUX-Makeup

upvoted a paper about 2 months ago

Generation-Augmented Generation: A Plug-and-Play Framework for Private Knowledge Injection in Large Language Models

View all activity

Organizations

upvoted a paper about 2 months ago

FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer

Paper • 2508.05069 • Published Aug 7, 2025 • 1

updated a model about 2 months ago

qihoo360/FLUX-Makeup

Updated Jan 29

upvoted a paper about 2 months ago

Generation-Augmented Generation: A Plug-and-Play Framework for Private Knowledge Injection in Large Language Models

Paper • 2601.08209 • Published Jan 13 • 1

upvoted 2 papers 4 months ago

RzenEmbed: Towards Comprehensive Multimodal Retrieval

Paper • 2510.27350 • Published Oct 31, 2025 • 1

EVTAR: End-to-End Try on with Additional Unpaired Visual Reference

Paper • 2511.00956 • Published Nov 2, 2025 • 5

New activity in qihoo360/RzenEmbed 4 months ago

Update README.md

#1 opened 4 months ago by

DavidLeon

commented a paper 5 months ago

FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model

Paper • 2510.10921 • Published Oct 13, 2025 • 11 •

liked a model 5 months ago

qihoo360/fg-clip2-base

Zero-Shot Image Classification • 0.4B • Updated Nov 6, 2025 • 1.76k • 24

upvoted a collection 5 months ago

FG-CLIP 2

Collection

FG-CLIP 2 is the foundation model for fine-grained vision-language understanding in both English and Chinese. • 10 items • Updated Nov 6, 2025 • 5

upvoted a paper 5 months ago

FG-CLIP 2: A Bilingual Fine-grained Vision-Language Alignment Model

Paper • 2510.10921 • Published Oct 13, 2025 • 11

liked a Space 6 months ago

MMEB Leaderboard

📊

104

The massive multimodal embedding benchmark

authored a paper 8 months ago

Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection

Paper • 2502.16223 • Published Feb 22, 2025

liked a model 8 months ago

qihoo360/fg-clip-base

Zero-Shot Image Classification • 0.2B • Updated Oct 9, 2025 • 1.47k • 10

upvoted a collection 8 months ago

FG-CLIP

Collection

New generation of CLIP with strong fine grained discrimination capability • 6 items • Updated Oct 15, 2025 • 4

commented a paper 11 months ago

FG-CLIP: Fine-Grained Visual and Textual Alignment

Paper • 2505.05071 • Published May 8, 2025 • 18 •

upvoted a paper 11 months ago

FG-CLIP: Fine-Grained Visual and Textual Alignment

Paper • 2505.05071 • Published May 8, 2025 • 18

authored a paper about 1 year ago

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

Paper • 2502.14377 • Published Feb 20, 2025 • 12

authored 3 papers over 1 year ago

Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities

Paper • 2309.00952 • Published Sep 2, 2023

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

Paper • 2408.08189 • Published Aug 15, 2024 • 17

Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task

Paper • 2409.04005 • Published Sep 6, 2024 • 19

David Leon

AI & ML interests

Recent Activity

Organizations

DavidLeon's activity

Update README.md

MMEB Leaderboard