AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments
CUA-Gym: Scaling Verifiable Training Environments and Tasks for Computer-Use Agents
-
QwenScope
🔥31Explore and steer Qwen3 model features with interactive heatmaps
-
Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models
Paper • 2605.11887 • Published • 11 -
Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_50
Updated • 104 • 37 -
Qwen/SAE-Res-Qwen3.5-2B-Base-W32K-L0_50
Updated • 189 • 10
-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 1.13M • • 1.49k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 1.01M • 173 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 870k • • 557 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 800k • 100
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • 2B • Updated • 1.93M • 839 -
Qwen/Qwen3-ASR-0.6B
Automatic Speech Recognition • 0.9B • Updated • 863k • 295 -
Qwen/Qwen3-ForcedAligner-0.6B
Automatic Speech Recognition • 0.9B • Updated • 416k • 136 -
Qwen3-ASR Demo
🎙139Transcribe audio to text with timestamps and visualization
-
Qwen3 VL Demo
😻440Chat with an AI that sees images and videos
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 8.58k • • 396 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 1.76M • • 389 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 7.22k • 28
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 31.5k • 84 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 47k • • 405 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 493k • 147 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 135k • • 782
End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5
The long-context version of Qwen2.5, supporting 1M-token context lengths
Qwen with Questions
Math-specific model series based on Qwen2.5
Vision-language model series based on Qwen2
Math-specific model series based on Qwen2
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 374k • • 1.02k -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 33.8k • • 489 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 182k • 90 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • 81B • Updated • 3.99k • 55
-
Qwen3 Coder WebDev
🌍1.08kGenerate web app code from a simple description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 27.3k • • 1.34k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 172k • • 153 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 1.92M • • 1.08k
Vision-language model series based on Qwen2.5
QVQ: Qwen models for visual reasoning
Code-specific model series based on Qwen2.5
-
Qwen2.5 Coder Artifacts
🐢1.74kGenerate and preview app code from a text description
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.15M • • 2.03k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 15.4k • • 155 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 156
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.
Audio-language model series based on Qwen2
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B.
Qwen
-
QwenScope
🔥31Explore and steer Qwen3 model features with interactive heatmaps
-
Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models
Paper • 2605.11887 • Published • 11 -
Qwen/SAE-Res-Qwen3.5-27B-W80K-L0_50
Updated • 104 • 37 -
Qwen/SAE-Res-Qwen3.5-2B-Base-W32K-L0_50
Updated • 189 • 10
-
Qwen/Qwen3.5-397B-A17B
Image-Text-to-Text • 403B • Updated • 1.13M • • 1.49k -
Qwen/Qwen3.5-397B-A17B-FP8
Image-Text-to-Text • 403B • Updated • 1.01M • 173 -
Qwen/Qwen3.5-122B-A10B
Image-Text-to-Text • 125B • Updated • 870k • • 557 -
Qwen/Qwen3.5-122B-A10B-FP8
Image-Text-to-Text • 125B • Updated • 800k • 100
-
Qwen/Qwen3-ASR-1.7B
Automatic Speech Recognition • 2B • Updated • 1.93M • 839 -
Qwen/Qwen3-ASR-0.6B
Automatic Speech Recognition • 0.9B • Updated • 863k • 295 -
Qwen/Qwen3-ForcedAligner-0.6B
Automatic Speech Recognition • 0.9B • Updated • 416k • 136 -
Qwen3-ASR Demo
🎙139Transcribe audio to text with timestamps and visualization
-
Qwen3 VL Demo
😻440Chat with an AI that sees images and videos
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 8.58k • • 396 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 1.76M • • 389 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 7.22k • 28
-
Qwen/Qwen3-Next-80B-A3B-Instruct
Text Generation • 81B • Updated • 374k • • 1.02k -
Qwen/Qwen3-Next-80B-A3B-Thinking
Text Generation • 81B • Updated • 33.8k • • 489 -
Qwen/Qwen3-Next-80B-A3B-Instruct-FP8
Text Generation • 81B • Updated • 182k • 90 -
Qwen/Qwen3-Next-80B-A3B-Thinking-FP8
Text Generation • 81B • Updated • 3.99k • 55
-
Qwen3 Coder WebDev
🌍1.08kGenerate web app code from a simple description
-
Qwen/Qwen3-Coder-480B-A35B-Instruct
Text Generation • 480B • Updated • 27.3k • • 1.34k -
Qwen/Qwen3-Coder-480B-A35B-Instruct-FP8
Text Generation • 480B • Updated • 172k • • 153 -
Qwen/Qwen3-Coder-30B-A3B-Instruct
Text Generation • 31B • Updated • 1.92M • • 1.08k
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 31.5k • 84 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 47k • • 405 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 493k • 147 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 135k • • 782
End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5
Vision-language model series based on Qwen2.5
The long-context version of Qwen2.5, supporting 1M-token context lengths
QVQ: Qwen models for visual reasoning
Qwen with Questions
Code-specific model series based on Qwen2.5
-
Qwen2.5 Coder Artifacts
🐢1.74kGenerate and preview app code from a text description
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 1.15M • • 2.03k -
Qwen/Qwen2.5-Coder-32B
Text Generation • 33B • Updated • 15.4k • • 155 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 156
Math-specific model series based on Qwen2.5
Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B.
Vision-language model series based on Qwen2
Audio-language model series based on Qwen2
Math-specific model series based on Qwen2
Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B.
Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud.
Qwen