A collection of items telated the the MMTEB release
AI & ML interests
Massive Text Embeddings Benchmark
Recent Activity
Papers
MAEB: Massive Audio Embedding Benchmark
HUME: Measuring the Human-Model Performance Gap in Text Embedding Task
Organization Card
MTEB is a Python framework for evaluating embeddings and retrieval systems for both text and image. MTEB covers more than 1000 languages and diverse tasks, from classics like classification and clustering to use-case specialized tasks such as legal, code, or healthcare retrieval.
You can get started using mteb, check out our documentation.
| Overview | |
|---|---|
| π Leaderboard | The interactive leaderboard of the benchmark |
| Get Started. | |
| π Get Started | Overview of how to use mteb |
| π€ Defining Models | How to use existing model and define custom ones |
| π Selecting tasks | How to select tasks, benchmarks, splits etc. |
| π Running Evaluation | How to run the evaluations, including cache management, speeding up evaluations etc. |
| π Loading Results | How to load and work with existing model results |
| Overview. | |
| π Tasks | Overview of available tasks |
| π Benchmarks | Overview of available benchmarks |
| π€ Models | Overview of available Models |
| Contributing | |
| π€ Adding a model | How to submit a model to MTEB and to the leaderboard |
| π©βπ» Adding a dataset | How to add a new task/dataset to MTEB |
| π©βπ» Adding a benchmark | How to add a new benchmark to MTEB and to the leaderboard |
| π€ Contributing | How to contribute to MTEB and set it up for development |
spaces 5
pinned
Running on CPU Upgrade
7.11k
MTEB Leaderboard
π₯
Embedding Leaderboard
Running
37
MTEB Legacy Leaderboard
π₯
Explore and filter MTEB model benchmark results
Running
Featured
11
Leaderboard Dev
π’
Dedicated display for RTEB benchmark results
Running
116
MTEB Arena
β
Display MTEB Arena interface
datasets 1,542
mteb/llm-eval-amazon_reviews
Viewer
β’ Updated
β’ 1.2M β’ 28
mteb/llm-eval-banking77
Viewer
β’ Updated
β’ 13.1k β’ 13
mteb/llm-eval-dbpedia_14
Viewer
β’ Updated
β’ 2.55k β’ 12
mteb/llm-eval-emotion
Viewer
β’ Updated
β’ 16.5k β’ 15
mteb/llm-eval-news_classification
Viewer
β’ Updated
β’ 121k β’ 15
mteb/llm-eval-financial_phrasebank
Viewer
β’ Updated
β’ 2.76k β’ 14
mteb/llm-eval-imdb
Viewer
β’ Updated
β’ 25.4k β’ 41
mteb/results
Updated
β’ 324k β’ 1
mteb/rag-quest-128k
Viewer
β’ Updated
β’ 666 β’ 11
mteb/rag-quest-32k
Viewer
β’ Updated
β’ 121 β’ 13