Echo2334's picture

Echo2334

smy111

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

new activity 5 months ago

RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo:DuoAttention(ICLR 2025)

updated a model 5 months ago

RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo

View all activity

Organizations

upvoted a paper about 12 hours ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

Paper • 2605.16928 • Published 7 days ago • 70

New activity in RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo 5 months ago

DuoAttention(ICLR 2025)

#1 opened 5 months ago by

updated a model 5 months ago

RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo

31B • Updated Dec 29, 2025 • 1 • 2

published a model 5 months ago

RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo

31B • Updated Dec 29, 2025 • 1 • 2