view article Article DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models about 19 hours ago β’ 23
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 6 days ago β’ 60
Gemma 4 Collection Gemma 4 is Google's new model family including including E2B, E4B, 26B-A4B, and 31B. β’ 28 items β’ Updated 6 days ago β’ 154
ByT5: Towards a token-free future with pre-trained byte-to-byte models Paper β’ 2105.13626 β’ Published May 28, 2021 β’ 5
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings β’ 7 items β’ Updated Feb 26 β’ 96
LateOn-Code π» Collection State-of-the-art late interaction code retrieval models β’ 6 items β’ Updated 15 days ago β’ 18
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 β’ 53
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 β’ 92
PyLate π Collection State-of-the-art late interaction models trained using PyLate β’ 5 items β’ Updated 15 days ago β’ 4