AI & ML interests
None defined yet.
Recent Activity
Papers
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data
A collection of TwinFlow-accelerated diffusion models
GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute.
-
inclusionAI/Ling-lite-1.5-2507
Text Generation ⢠17B ⢠Updated ⢠35 ⢠77 -
inclusionAI/Ling-lite-1.5-2506
Text Generation ⢠17B ⢠Updated ⢠67 ⢠53 -
inclusionAI/Ling-lite-1.5
Text Generation ⢠17B ⢠Updated ⢠24.8k ⢠58 -
inclusionAI/Ling-lite-base-1.5
Text Generation ⢠17B ⢠Updated ⢠30 ⢠34
AReaL-boba-2
The newest flagship non-reasoning model series.
Ming is the multi-modal series of any-to-any models developed by Ant Ling team.
-
inclusionAI/Ming-flash-omni-2.0
Any-to-Any ⢠Updated ⢠7.21k ⢠260 -
inclusionAI/Ming-omni-tts-16.8B-A3B
Text-to-Speech ⢠18B ⢠Updated ⢠453 ⢠32 -
inclusionAI/Ming-omni-tts-0.5B
Text-to-Speech ⢠2B ⢠Updated ⢠5.9k ⢠35 -
inclusionAI/Ming-omni-tts-tokenizer-12Hz
Audio-to-Audio ⢠0.8B ⢠Updated ⢠18 ⢠8
-
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
Paper ⢠2602.11858 ⢠Published ⢠63 -
inclusionAI/ZwZ-4B
Image-Text-to-Text ⢠5B ⢠Updated ⢠255 ⢠32 -
inclusionAI/ZwZ-8B
Image-Text-to-Text ⢠9B ⢠Updated ⢠9.2k ⢠45 -
inclusionAI/ZwZ-RL-VQA
Viewer ⢠Updated ⢠74k ⢠1.09k ⢠12
-
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
Paper ⢠2604.20796 ⢠Published ⢠226 -
inclusionAI/LLaDA2.0-Uni
Any-to-Any ⢠16B ⢠Updated ⢠281 ⢠166 -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper ⢠2512.15745 ⢠Published ⢠88 -
inclusionAI/LLaDA2.0-mini-CAP
Text Generation ⢠16B ⢠Updated ⢠6.79k ⢠10
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.
The Agent Runtime for Self-Improvement
-
UI-Venus-1.5 Technical Report
Paper ⢠2602.09082 ⢠Published ⢠157 -
inclusionAI/UI-Venus-1.5-30B-A3B
Image-Text-to-Text ⢠31B ⢠Updated ⢠2.65k ⢠26 -
inclusionAI/UI-Venus-1.5-8B
Image-Text-to-Text ⢠9B ⢠Updated ⢠5.46k ⢠27 -
inclusionAI/UI-Venus-1.5-2B
Image-Text-to-Text ⢠2B ⢠Updated ⢠1.13k ⢠36
-
Ming-Omni: A Unified Multimodal Model for Perception and Generation
Paper ⢠2506.09344 ⢠Published ⢠32 -
inclusionAI/Ming-Lite-Omni
Any-to-Any ⢠19B ⢠Updated ⢠32 ⢠199 -
inclusionAI/Ming-Lite-Omni-1.5
Any-to-Any ⢠Updated ⢠263 ⢠86 -
inclusionAI/Ming-UniAudio-16B-A3B
Any-to-Any ⢠18B ⢠Updated ⢠64 ⢠79
The newest flagship non-reasoning model series.
Ming is the multi-modal series of any-to-any models developed by Ant Ling team.
-
inclusionAI/Ming-flash-omni-2.0
Any-to-Any ⢠Updated ⢠7.21k ⢠260 -
inclusionAI/Ming-omni-tts-16.8B-A3B
Text-to-Speech ⢠18B ⢠Updated ⢠453 ⢠32 -
inclusionAI/Ming-omni-tts-0.5B
Text-to-Speech ⢠2B ⢠Updated ⢠5.9k ⢠35 -
inclusionAI/Ming-omni-tts-tokenizer-12Hz
Audio-to-Audio ⢠0.8B ⢠Updated ⢠18 ⢠8
-
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception
Paper ⢠2602.11858 ⢠Published ⢠63 -
inclusionAI/ZwZ-4B
Image-Text-to-Text ⢠5B ⢠Updated ⢠255 ⢠32 -
inclusionAI/ZwZ-8B
Image-Text-to-Text ⢠9B ⢠Updated ⢠9.2k ⢠45 -
inclusionAI/ZwZ-RL-VQA
Viewer ⢠Updated ⢠74k ⢠1.09k ⢠12
-
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model
Paper ⢠2604.20796 ⢠Published ⢠226 -
inclusionAI/LLaDA2.0-Uni
Any-to-Any ⢠16B ⢠Updated ⢠281 ⢠166 -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper ⢠2512.15745 ⢠Published ⢠88 -
inclusionAI/LLaDA2.0-mini-CAP
Text Generation ⢠16B ⢠Updated ⢠6.79k ⢠10
A collection of TwinFlow-accelerated diffusion models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling.
The Agent Runtime for Self-Improvement
GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute.
-
UI-Venus-1.5 Technical Report
Paper ⢠2602.09082 ⢠Published ⢠157 -
inclusionAI/UI-Venus-1.5-30B-A3B
Image-Text-to-Text ⢠31B ⢠Updated ⢠2.65k ⢠26 -
inclusionAI/UI-Venus-1.5-8B
Image-Text-to-Text ⢠9B ⢠Updated ⢠5.46k ⢠27 -
inclusionAI/UI-Venus-1.5-2B
Image-Text-to-Text ⢠2B ⢠Updated ⢠1.13k ⢠36
-
inclusionAI/Ling-lite-1.5-2507
Text Generation ⢠17B ⢠Updated ⢠35 ⢠77 -
inclusionAI/Ling-lite-1.5-2506
Text Generation ⢠17B ⢠Updated ⢠67 ⢠53 -
inclusionAI/Ling-lite-1.5
Text Generation ⢠17B ⢠Updated ⢠24.8k ⢠58 -
inclusionAI/Ling-lite-base-1.5
Text Generation ⢠17B ⢠Updated ⢠30 ⢠34
AReaL-boba-2
-
Ming-Omni: A Unified Multimodal Model for Perception and Generation
Paper ⢠2506.09344 ⢠Published ⢠32 -
inclusionAI/Ming-Lite-Omni
Any-to-Any ⢠19B ⢠Updated ⢠32 ⢠199 -
inclusionAI/Ming-Lite-Omni-1.5
Any-to-Any ⢠Updated ⢠263 ⢠86 -
inclusionAI/Ming-UniAudio-16B-A3B
Any-to-Any ⢠18B ⢠Updated ⢠64 ⢠79