Echo2334
smy111
ยท
AI & ML interests
None yet
Recent Activity
upvoted a paper about 11 hours ago
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps new activity 5 months ago
RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo:DuoAttention(ICLR 2025) updated a model 5 months ago
RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo