GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment Paper • 2605.19577 • Published 14 days ago • 58
Full Attention Strikes Back: Transferring Full Attention into Sparse within Hundred Training Steps Paper • 2605.16928 • Published 17 days ago • 93
Orthrus: Memory-Efficient Parallel Token Generation via Dual-View Diffusion Paper • 2605.12825 • Published 21 days ago • 12
Refusal in Language Models Is Mediated by a Single Direction Paper • 2406.11717 • Published Jun 17, 2024 • 13
dragonkue/snowflake-arctic-embed-l-v2.0-ko Sentence Similarity • 0.6B • Updated Oct 16, 2025 • 22k • • 47