The training datasets used for training the ChEmbed family of text embedding models
AI & ML interests
None defined yet.
Organization Card
Edit this README.md markdown file to author your organization card.
models 7
BASF-AI/ChEmbed-prog
Feature Extraction • 0.1B • Updated • 901
BASF-AI/ChEmbed-vanilla
Feature Extraction • 0.1B • Updated • 658
BASF-AI/ChEmbed-plug
Feature Extraction • 0.1B • Updated • 646
BASF-AI/ChEmbed-full
Feature Extraction • 0.1B • Updated • 676 • 1
BASF-AI/ChemVocab
Updated
BASF-AI/nomic-bert-2048
0.1B • Updated
BASF-AI/nomic-embed-text-v1.5
Sentence Similarity • 0.1B • Updated • 131
datasets 76
BASF-AI/ChemRxivRetrieval
Viewer • Updated • 79.5k • 86 • 1
BASF-AI/uspto-title-abs-chem
Viewer • Updated • 75.8k • 13
BASF-AI/uspto-synth-query-abs-chem
Viewer • Updated • 75.8k • 7
BASF-AI/PlantCAD2_virtual_hackathon
Viewer • Updated • 9 • 8
BASF-AI/dolma-pes2o-chemistry
Viewer • Updated • 361k • 810 • 1
BASF-AI/ChemRxiv-Papers
Viewer • Updated • 30.4k • 79 • 1
BASF-AI/ChemRxiv-Paragraphs
Viewer • Updated • 209k • 27 • 2
BASF-AI/ChemRxiv-Train-CC-BY
Viewer • Updated • 139k • 25 • 1
BASF-AI/dolma-chem-only-query-generated
Viewer • Updated • 1.17M • 18
BASF-AI/ChemRxiv-Train-CC-BY-v2
Viewer • Updated • 138k • 6 • 2