arxiv:2605.16928
Richard ZHou
zykRichard
AI & ML interests
None yet
Recent Activity
authored a paper 1 day ago
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps updated a model 5 months ago
RTP-LLM/Qwen3-Coder-30B-A3B-Instruct-RTPurbo