SAGE: Steerable Agentic Data Generation for Deep Search with Execution Feedback Paper • 2601.18202 • Published Jan 26 • 9
Understanding the Impact of Negative Prompts: When and How Do They Take Effect? Paper • 2406.02965 • Published Jun 5, 2024
RouteLLM: Learning to Route LLMs with Preference Data Paper • 2406.18665 • Published Jun 26, 2024 • 7
Multi-hop Evidence Retrieval for Cross-document Relation Extraction Paper • 2212.10786 • Published Dec 21, 2022 • 1
GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles Paper • 2205.12505 • Published May 25, 2022
AMPERE: AMR-Aware Prefix for Generation-Based Event Argument Extraction Model Paper • 2305.16734 • Published May 26, 2023
TAGPRIME: A Unified Framework for Relational Structure Extraction Paper • 2205.12585 • Published May 25, 2022
Summarization as Indirect Supervision for Relation Extraction Paper • 2205.09837 • Published May 19, 2022 • 1
DEGREE: A Data-Efficient Generation-Based Event Extraction Model Paper • 2108.12724 • Published Aug 29, 2021
Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument Extraction Paper • 2203.08308 • Published Mar 15, 2022
CaLM: Contrasting Large and Small Language Models to Verify Grounded Generation Paper • 2406.05365 • Published Jun 8, 2024 • 1
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline Paper • 2406.11939 • Published Jun 17, 2024 • 8
From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline Paper • 2406.11939 • Published Jun 17, 2024 • 8
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference Paper • 2403.04132 • Published Mar 7, 2024 • 40
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 40
LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset Paper • 2309.11998 • Published Sep 21, 2023 • 27