view article Article Saving Memory Using Padding-Free Transformer Layers during Finetuning Jun 11, 2024 • 20
Granite Code Models: A Family of Open Foundation Models for Code Intelligence Paper • 2405.04324 • Published May 7, 2024 • 25