17 27 20

ct2

ct-2

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

Low-bit model

upvoted a paper 6 days ago

RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs

upvoted a paper 7 days ago

DFlash: Block Diffusion for Flash Speculative Decoding

View all activity

Organizations

None yet

upvoted a collection 1 day ago

Low-bit model

Collection

2 items • Updated 11 days ago • 2

upvoted a paper 6 days ago

RaBiT: Residual-Aware Binarization Training for Accurate and Efficient LLMs

Paper • 2602.05367 • Published 10 days ago • 7

upvoted a paper 7 days ago

DFlash: Block Diffusion for Flash Speculative Decoding

Paper • 2602.06036 • Published 10 days ago • 41

upvoted a paper 11 days ago

POP: Prefill-Only Pruning for Efficient Large Model Inference

Paper • 2602.03295 • Published 12 days ago • 4

liked a Space 28 days ago

PersonaPlex

🌖

Persona Plex Demo for the interested

upvoted a paper 2 months ago

Fairy2i: Training Complex LLMs from Real LLMs with All Parameters in {pm 1, pm i}

Paper • 2512.02901 • Published Dec 2, 2025 • 6

upvoted a paper 3 months ago

Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 24

upvoted a collection 3 months ago

Olmo 3

Collection

Artifacts for the Olmo 3 release. • 9 items • Updated Dec 23, 2025 • 163

liked a model 4 months ago

turboderp/MiniMax-M2-exl3

Updated Nov 1, 2025 • 1 • 8

liked a Space 4 months ago

Megrez2-3x7B-A3B

👀

Megrez2 Chat Demo

liked a model 4 months ago

turboderp/Qwen3-Next-80B-A3B-Thinking-exl3

Updated Nov 1, 2025 • 14 • 7

New activity in inclusionAI/Ring-1T-preview 5 months ago

what is the active parameters of the model??

#2 opened 5 months ago by

ct-2

liked a model 5 months ago

stockmark/Stockmark-2-100B-Instruct

Text Generation • 96B • Updated Sep 25, 2025 • 24 • 10

New activity in unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit 5 months ago

Error when using vLLM

➕ 11

#2 opened 5 months ago by

sheliak

New activity in meituan-longcat/LongCat-Flash-Chat 5 months ago

Any plan to release 120b and 20-30b level models?

👍 3

#5 opened 6 months ago by

Sunny2038

upvoted a paper 5 months ago

Metis: Training Large Language Models with Advanced Low-Bit Quantization

Paper • 2509.00404 • Published Aug 30, 2025 • 7

upvoted an article 6 months ago

Article

The Hacker's Guide to Building an AI Supercluster

Aug 31, 2025

•

liked a model 6 months ago

NexaAI/OmniNeural-4B

Any-to-Any • Updated Nov 7, 2025 • 47 • 161

New activity in moonshotai/Kimi-K2-Instruct 7 months ago

is kimi k2 trained with fp8?

#30 opened 7 months ago by

ct-2

liked a model 7 months ago

ai21labs/AI21-Jamba-Mini-1.7

52B • Updated 13 days ago • 867 • 41

ct2

AI & ML interests

Recent Activity

Organizations

ct-2's activity

PersonaPlex

Megrez2-3x7B-A3B

what is the active parameters of the model??

Error when using vLLM

Any plan to release 120b and 20-30b level models?

The Hacker's Guide to Building an AI Supercluster

is kimi k2 trained with fp8?