Echo2334's picture

Echo2334

smy111

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 11 hours ago

Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps

new activity 5 months ago

RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo:DuoAttention(ICLR 2025)

updated a model 5 months ago

RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo

View all activity

Organizations

smy111 's datasets

None public yet