Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper โข 2504.20752 โข Published Apr 29, 2025 โข 94
RULER: What's the Real Context Size of Your Long-Context Language Models? Paper โข 2404.06654 โข Published Apr 9, 2024 โข 39