view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning Aug 9 • 12
SauerkrautLM-Multilingual-(Reason)-ColBERT Collection SauerkrautLM ColBERT is a suite of Late-Interaction retrieval models built with PyLate’s ColBERT architecture and tuned for seven European languages. • 7 items • Updated Aug 3 • 18
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published Dec 18, 2024 • 158
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 134 items • Updated Oct 20 • 116
Open LLM Leaderboard best models ❤️🔥 Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 655
view article Article 🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦⬛ Oct 21, 2024 • 20
AnglE📐-based Embeddings Collection This collection consists of popular embeddings trained with AnglE: https://github.com/SeanLee97/AnglE • 9 items • Updated Aug 1, 2024 • 3
view article Article How we leveraged distilabel to create an Argilla 2.0 Chatbot +3 Jul 16, 2024 • 33
A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30, 2024 • 12
🇮🇹 Italian NLP Resources Collection Collection of models, datasets and demos relevant to Italian NLP 🇮🇹 • 303 items • Updated Oct 2 • 31
LLM ITA Collection Open-Source Language Models Finetuned for Italian • 4 items • Updated Oct 19, 2024 • 7