Data and models for the paper "How Much Is One Recurrence Worth? Iso-Depth Scaling Laws for Looped Language Models"
Kristian Schwethelm
KristianS7
AI & ML interests
Large Language Models
Recent Activity
updated a bucket 1 day ago
gemma-challenge/gemma-zmaj published a bucket 1 day ago
gemma-challenge/gemma-zmaj new activity 10 days ago
KristianS7/Ouro-1.4B:Add assistant generation tags to chat template