Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs Paper • 2401.06209 • Published Jan 11, 2024
Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition Paper • 2309.01262 • Published Sep 3, 2023
LXMERT: Learning Cross-Modality Encoder Representations from Transformers Paper • 1908.07490 • Published Aug 20, 2019 • 1
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts Paper • 2111.02358 • Published Nov 3, 2021 • 1
BERT Loses Patience: Fast and Robust Inference with Early Exit Paper • 2006.04152 • Published Jun 7, 2020
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference Paper • 1902.01007 • Published Feb 4, 2019
Selecting Informative Contexts Improves Language Model Finetuning Paper • 2005.00175 • Published May 1, 2020
Plug and Play Language Models: A Simple Approach to Controlled Text Generation Paper • 1912.02164 • Published Dec 4, 2019