Growing Through Experience: Scaling Episodic Grounding in Language Models Paper • 2506.01312 • Published Jun 2, 2025
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Paper • 2411.10606 • Published Nov 15, 2024 • 1
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement Paper • 2504.16053 • Published Apr 22, 2025
SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs Paper • 2510.05069 • Published Oct 6, 2025 • 13
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models Paper • 2507.14204 • Published Jul 14, 2025
Superficial Self-Improved Reasoners Benefit from Model Merging Paper • 2503.02103 • Published Mar 3, 2025
Mitigating Forgetting Between Supervised and Reinforcement Learning Yields Stronger Reasoners Paper • 2510.04454 • Published Oct 6, 2025