Gecko: Versatile Text Embeddings Distilled from Large Language Models Paper • 2403.20327 • Published Mar 29, 2024 • 48
Transforming LLMs into Cross-modal and Cross-lingual Retrieval Systems Paper • 2404.01616 • Published Apr 2, 2024
EmbeddingGemma: Powerful and Lightweight Text Representations Paper • 2509.20354 • Published Sep 24 • 41
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query Language Paper • 2011.03836 • Published Nov 7, 2020
Multilingual Universal Sentence Encoder for Semantic Retrieval Paper • 1907.04307 • Published Jul 9, 2019
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation Paper • 2205.12647 • Published May 25, 2022
Sentence-T5: Scalable Sentence Encoders from Pre-trained Text-to-Text Models Paper • 2108.08877 • Published Aug 19, 2021 • 2
MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models Paper • 2005.02507 • Published May 5, 2020
SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and Cross-lingual Focused Evaluation Paper • 1708.00055 • Published Jul 31, 2017
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval Paper • 2311.05800 • Published Nov 10, 2023 • 4