Gen3DEval: Using vLLMs for Automatic Evaluation of Generated 3D Objects Paper • 2504.08125 • Published Apr 10 • 1
Unsupervised 2D-3D lifting of non-rigid objects using local constraints Paper • 2504.19227 • Published Apr 27 • 1
VGRP-Bench: Visual Grid Reasoning Puzzle Benchmark for Large Vision-Language Models Paper • 2503.23064 • Published Mar 29 • 1
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 20 days ago • 134
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper • 2408.11039 • Published Aug 20, 2024 • 63
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference Paper • 2410.00215 • Published Sep 30, 2024
Altogether: Image Captioning via Re-aligning Alt-text Paper • 2410.17251 • Published Oct 22, 2024
CWM: An Open-Weights LLM for Research on Code Generation with World Models Paper • 2510.02387 • Published Sep 30 • 8
Towards Empathetic Open-domain Conversation Models: a New Benchmark and Dataset Paper • 1811.00207 • Published Nov 1, 2018 • 1
Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills Paper • 2004.08449 • Published Apr 17, 2020 • 1
ROBBIE: Robust Bias Evaluation of Large Generative Language Models Paper • 2311.18140 • Published Nov 29, 2023 • 1
Improving Open Language Models by Learning from Organic Interactions Paper • 2306.04707 • Published Jun 7, 2023 • 3