Ministral 3 Collection A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 9 days ago • 122
Using the Output Embedding to Improve Language Models Paper • 1608.05859 • Published Aug 20, 2016 • 1
WhiteRabbitNeo-V3 Collection The latest and most capable cybersecurity model we've ever created • 1 item • Updated Jun 25 • 12
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated Jul 21 • 160
MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 49 items • Updated 17 days ago • 134
Code Llama Family Collection This collection hosts the transformers repos of the Code Llama release • 12 items • Updated Dec 6, 2024 • 61
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 647
Llama 2 Family Collection This collection hosts the transformers and original repos of the Llama 2 and Llama Guard releases • 13 items • Updated Dec 6, 2024 • 92