GUI-based workflow
AI & ML interests
None defined yet.
Recent Activity
Papers
Learning from Language Feedback via Variational Policy Distillation
The Illusion of Certainty: Decoupling Capability and Calibration in On-Policy Distillation
A collection's of Salesforce's Finance-specific model
CoDA is Salesforce AI Research's open, lightweight and diffusion-based language model.
SweRank is a framework for software issue localization, combining an embedding-based retriever (SweRankEmbed) with an LLM-based reranker (SweRankLLM).
A collection of all XGen-MM (Foundation LMM) models!
-
Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5
Image-Text-to-Text • 4B • Updated • 401 • 59 -
Salesforce/blip3-kale
Viewer • Updated • 235M • 4.31k • 46 -
Salesforce/xgen-mm-vid-phi3-mini-r-v1.5-128tokens-8frames
Image-Text-to-Text • 4B • Updated • 52 • 11 -
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text • 5B • Updated • 156 • 185
This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets.
A collection of contextual QA datasets dedicated to evaluate the contextual faithfulness of LLMs
A collection of all BLIP2 models!
-
Salesforce/blip2-opt-2.7b
Image-Text-to-Text • 4B • Updated • 588k • 447 -
Salesforce/blip2-flan-t5-xxl
Image-Text-to-Text • 12B • Updated • 774 • 94 -
Salesforce/blip2-opt-6.7b-coco
Image-Text-to-Text • 8B • Updated • 333 • 35 -
Salesforce/blip2-opt-6.7b
Image-Text-to-Text • 8B • Updated • 77.5k • 80
-
Salesforce/moirai-2.0-R-small
Time Series Forecasting • Updated • 421k • 43 -
Salesforce/moirai-moe-1.0-R-base
Time Series Forecasting • 0.9B • Updated • 154k • 19 -
Salesforce/moirai-1.1-R-small
Time Series Forecasting • 13.8M • Updated • 68.4k • 7 -
Salesforce/moirai-1.1-R-base
Time Series Forecasting • 91.4M • Updated • 17.3k • 8
xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM
FARE are Salesforce AI Research's open multi-task evaluator models.
A collection of GUI grounding models trained with GRPO.
This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP.
A collection of all BLIP models
-
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 690k • 1.48k -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.06M • 860 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 365k • 194 -
Salesforce/blip-vqa-capfilt-large
Visual Question Answering • Updated • 16.8k • 54
A collection that contains all InstructBLIP models!
-
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text • 8B • Updated • 8.83k • 102 -
Salesforce/instructblip-vicuna-13b
Image-Text-to-Text • 14B • Updated • 102 • 43 -
Salesforce/instructblip-flan-t5-xxl
Image-Text-to-Text • 12B • Updated • 154 • 21 -
Salesforce/instructblip-flan-t5-xl
Image-Text-to-Text • 4B • Updated • 6.07k • 30
A collection of embedding models
-
Salesforce/SFR-Embedding-2_R
Feature Extraction • 7B • Updated • 601k • 94 -
Salesforce/SFR-Embedding-Mistral
Feature Extraction • 7B • Updated • 7.42k • 299 -
Salesforce/SFR-Embedding-Code-2B_R
Feature Extraction • 3B • Updated • 3.61k • 49 -
Salesforce/SFR-Embedding-Code-400M_R
Feature Extraction • 0.4B • Updated • 17.8k • 35
GUI-based workflow
A collection's of Salesforce's Finance-specific model
xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM
FARE are Salesforce AI Research's open multi-task evaluator models.
CoDA is Salesforce AI Research's open, lightweight and diffusion-based language model.
A collection of GUI grounding models trained with GRPO.
SweRank is a framework for software issue localization, combining an embedding-based retriever (SweRankEmbed) with an LLM-based reranker (SweRankLLM).
A collection of all XGen-MM (Foundation LMM) models!
-
Salesforce/xgen-mm-phi3-mini-instruct-interleave-r-v1.5
Image-Text-to-Text • 4B • Updated • 401 • 59 -
Salesforce/blip3-kale
Viewer • Updated • 235M • 4.31k • 46 -
Salesforce/xgen-mm-vid-phi3-mini-r-v1.5-128tokens-8frames
Image-Text-to-Text • 4B • Updated • 52 • 11 -
Salesforce/xgen-mm-phi3-mini-instruct-r-v1
Image-Text-to-Text • 5B • Updated • 156 • 185
This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets.
This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP.
A collection of contextual QA datasets dedicated to evaluate the contextual faithfulness of LLMs
A collection of all BLIP models
-
Salesforce/blip-image-captioning-large
Image-to-Text • 0.5B • Updated • 690k • 1.48k -
Salesforce/blip-image-captioning-base
Image-to-Text • Updated • 2.06M • 860 -
Salesforce/blip-vqa-base
Visual Question Answering • 0.4B • Updated • 365k • 194 -
Salesforce/blip-vqa-capfilt-large
Visual Question Answering • Updated • 16.8k • 54
A collection of all BLIP2 models!
-
Salesforce/blip2-opt-2.7b
Image-Text-to-Text • 4B • Updated • 588k • 447 -
Salesforce/blip2-flan-t5-xxl
Image-Text-to-Text • 12B • Updated • 774 • 94 -
Salesforce/blip2-opt-6.7b-coco
Image-Text-to-Text • 8B • Updated • 333 • 35 -
Salesforce/blip2-opt-6.7b
Image-Text-to-Text • 8B • Updated • 77.5k • 80
A collection that contains all InstructBLIP models!
-
Salesforce/instructblip-vicuna-7b
Image-Text-to-Text • 8B • Updated • 8.83k • 102 -
Salesforce/instructblip-vicuna-13b
Image-Text-to-Text • 14B • Updated • 102 • 43 -
Salesforce/instructblip-flan-t5-xxl
Image-Text-to-Text • 12B • Updated • 154 • 21 -
Salesforce/instructblip-flan-t5-xl
Image-Text-to-Text • 4B • Updated • 6.07k • 30
-
Salesforce/moirai-2.0-R-small
Time Series Forecasting • Updated • 421k • 43 -
Salesforce/moirai-moe-1.0-R-base
Time Series Forecasting • 0.9B • Updated • 154k • 19 -
Salesforce/moirai-1.1-R-small
Time Series Forecasting • 13.8M • Updated • 68.4k • 7 -
Salesforce/moirai-1.1-R-base
Time Series Forecasting • 91.4M • Updated • 17.3k • 8
A collection of embedding models
-
Salesforce/SFR-Embedding-2_R
Feature Extraction • 7B • Updated • 601k • 94 -
Salesforce/SFR-Embedding-Mistral
Feature Extraction • 7B • Updated • 7.42k • 299 -
Salesforce/SFR-Embedding-Code-2B_R
Feature Extraction • 3B • Updated • 3.61k • 49 -
Salesforce/SFR-Embedding-Code-400M_R
Feature Extraction • 0.4B • Updated • 17.8k • 35