These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers
Minish
non-profit
AI & ML interests
small models
Recent Activity
View all activity
Organization Card
Hello, we're Minish!
About us
We're a two-person (pringled and stephantul) open-source lab, with a focus on Natural Language Processing.
We believe that if you make models fast enough, you unlock new possibilities.
Using our models and packages, you can:
- Embed the entire English Wikipedia in 5 minutes
- Classify tens of thousands of documents per second on a CPU
- Approximately deduplicate extremely large datasets in minutes
- Build the fastest RAG application in the world
- Easily evaluate which ANN algorithm works best for your data
Our projects:
- model2vec: tiny static embedding models with state-of-the-art performance.
- potion: the best small models in the world. 100-500x faster than a sentence-transformer, and almost as good.
- vicinity: consistent interfaces to many approximate nearest neighbor algorithms.
- semhash: lightning-fast, super accuracte, semantic deduplication and filtering for your text datasets.
- model2vec-rs: a Rust port of model2vec.
You can also find us on: 🔬 GitHub 👽 LinkedIn 💬 Discord
models 14
minishlab/potion-code-16M
Updated • 195 • 9
minishlab/potion-multilingual-128M
Updated • 101k • 50
minishlab/potion-base-32M
Updated • 80.4k • 25
minishlab/potion-base-8M
Updated • 1.07M • 77
minishlab/potion-base-4M
Updated • 725k • 9
minishlab/potion-base-2M
Updated • 14.5k • 17
minishlab/potion-retrieval-32M
Updated • 267k • 28
minishlab/M2V_base_output
Updated • 38k • 10
minishlab/potion-8m-edu-classifier
Updated • 5 • 2
minishlab/potion-science-8M
Updated • 14 • 2
datasets 6
minishlab/tokenlearn-cornstack-queries-coderankembed
Viewer • Updated • 300k • 29 • 1
minishlab/tokenlearn-cornstack-docs-coderankembed
Viewer • Updated • 300k • 28 • 2
minishlab/tokenlearn-c4-multilingual-bge-m3
Viewer • Updated • 12M • 720 • 2
minishlab/tokenlearn-c4-en-bge-base-en-v1.5
Viewer • Updated • 10M • 421 • 2
minishlab/my-vicinity-repo
Viewer • Updated • 5 • 12 • 2
minishlab/tokenlearn_C4
Updated • 11 • 2