3 115 52

Maozhou Ge

Gmc2

GHGmc2

AI & ML interests

None yet

Recent Activity

liked a model 11 days ago

deepseek-ai/DeepSeek-V4-Pro-Base

liked a model 11 days ago

deepseek-ai/DeepSeek-V4-Pro

upvoted a collection 11 days ago

DeepSeek-V4

View all activity

Organizations

None yet

liked 2 models 11 days ago

deepseek-ai/DeepSeek-V4-Pro-Base

1.6T • Updated 8 days ago • 3.18k • 255

deepseek-ai/DeepSeek-V4-Pro

Text Generation • 862B • Updated 8 days ago • 535k • • 3.52k

upvoted a collection 11 days ago

DeepSeek-V4

Collection

4 items • Updated 11 days ago • 611

upvoted a paper 13 days ago

OneFlow: Redesign the Distributed Deep Learning Framework from Scratch

Paper • 2110.15032 • Published Oct 28, 2021 • 2

liked a Space 26 days ago

Model Explorer

👓

Explore and visualize machine learning models

liked 2 models 28 days ago

ggml-org/gemma-4-E2B-it-GGUF

5B • Updated 22 days ago • 127k • 68

google/gemma-4-E2B-it

Any-to-Any • 5B • Updated 6 days ago • 3.35M • 565

upvoted an article about 2 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Mar 10

•

142

upvoted a paper about 2 months ago

The Art of Scaling Reinforcement Learning Compute for LLMs

Paper • 2510.13786 • Published Oct 15, 2025 • 33

upvoted a collection 2 months ago

Deepseek Papers

Collection

Deepseek papers collection • 31 items • Updated about 18 hours ago • 340

liked 2 models 2 months ago

zai-org/GLM-5

Text Generation • 754B • Updated 30 days ago • 410k • • 2.08k

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 11 days ago • 563k • • 1.47k

liked a Space 3 months ago

Sparsity Viz

📉

Explore MoE model sparsity across many LLMs

upvoted an article 3 months ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

•

269

liked a Space 4 months ago

Megatron Memory Estimator

👁

Estimate GPU memory usage for Megatron models

upvoted an article 4 months ago

Article

Introduction to ggml

Aug 13, 2024

•

280

upvoted a paper 4 months ago

Hyper-Connections

Paper • 2409.19606 • Published Sep 29, 2024 • 27

upvoted a paper 5 months ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 106

liked a model 5 months ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 11.5M • • 1.43k

upvoted a collection 6 months ago

LLaDA2.0

Collection

9 items • Updated 12 days ago • 44

Maozhou Ge

AI & ML interests

Recent Activity

Organizations

Gmc2's activity

Model Explorer

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Sparsity Viz

Visualize and understand GPU memory in PyTorch

Megatron Memory Estimator

Introduction to ggml