AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
WebWorld: A Large-Scale World Model for Web Agent Training
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • Updated • 348k • 495 -
Qwen/Qwen3-ASR-0.6B
Automatic Speech Recognition • Updated • 80.3k • 208 -
Qwen/Qwen3-ForcedAligner-0.6B
Automatic Speech Recognition • Updated • 40.6k • 84 -
Qwen3-ASR Demo
🎙101Transcribe audio to text with multi-language timestamps
-
Qwen3 VL Demo
😻381Chat with an AI using images and text
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 2.44M • • 378 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 404k • • 369 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 24.3k • 26
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 69.9k • 80 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • Updated • 46.6k • • 398 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 738k • 145 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • Updated • 139k • • 762
End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5
The long-context version of Qwen2.5, supporting 1M-token context lengths
Qwen with Questions
Math-specific model series based on Qwen2.5
Vision-language model series based on Qwen2
Math-specific model series based on Qwen2
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • Updated • 905k • • 937 -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 81.7k • • 480 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 107k • 77 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • Updated • 74.2k • 49
-
Qwen3 Coder WebDev
🌍986Generate HTML/React code from a web app description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • Updated • 42.6k • • 1.3k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • Updated • 197k • • 148 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 681k • • 945
Vision-language model series based on Qwen2.5
-
Qwen2.5 VL 32B Instruct Demo
🏃164Chat with a multimodal AI using text, images, or video
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 213 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • Updated • 521k • • 476 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • Updated • 160k • • 593
QVQ: Qwen models for visual reasoning
Code-specific model series based on Qwen2.5
-
Qwen2.5 Coder Artifacts
🐢1.72kGenerate and preview web app code from a description
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 725k • • 1.99k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 34.9k • • 138 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 153
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.
Audio-language model series based on Qwen2
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B.
Qwen
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • Updated • 348k • 495 -
Qwen/Qwen3-ASR-0.6B
Automatic Speech Recognition • Updated • 80.3k • 208 -
Qwen/Qwen3-ForcedAligner-0.6B
Automatic Speech Recognition • Updated • 40.6k • 84 -
Qwen3-ASR Demo
🎙101Transcribe audio to text with multi-language timestamps
-
Qwen3 VL Demo
😻381Chat with an AI using images and text
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 2.44M • • 378 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 404k • • 369 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 24.3k • 26
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • Updated • 905k • • 937 -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 81.7k • • 480 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 107k • 77 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • Updated • 74.2k • 49
-
Qwen3 Coder WebDev
🌍986Generate HTML/React code from a web app description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • Updated • 42.6k • • 1.3k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • Updated • 197k • • 148 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 681k • • 945
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 69.9k • 80 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • Updated • 46.6k • • 398 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 738k • 145 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • Updated • 139k • • 762
End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5
Vision-language model series based on Qwen2.5
-
Qwen2.5 VL 32B Instruct Demo
🏃164Chat with a multimodal AI using text, images, or video
-
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 213 -
Qwen/Qwen2.5-VL-32B-Instruct
Image-Text-to-Text • Updated • 521k • • 476 -
Qwen/Qwen2.5-VL-72B-Instruct
Image-Text-to-Text • Updated • 160k • • 593
The long-context version of Qwen2.5, supporting 1M-token context lengths
QVQ: Qwen models for visual reasoning
Qwen with Questions
Code-specific model series based on Qwen2.5
-
Qwen2.5 Coder Artifacts
🐢1.72kGenerate and preview web app code from a description
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 725k • • 1.99k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 34.9k • • 138 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 153
Math-specific model series based on Qwen2.5
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.
Vision-language model series based on Qwen2
Audio-language model series based on Qwen2
Math-specific model series based on Qwen2
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B.
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
Qwen