13 18 75

Yang

jacklanda

AI & ML interests

Reasoning, Mech Interp, Semantics

Recent Activity

authored a paper 9 days ago

Xetrieval: Mechanistically Explaining Dense Retrieval

updated a collection 11 days ago

Semantics

upvoted a paper 12 days ago

Xetrieval: Mechanistically Explaining Dense Retrieval

View all activity

Organizations

Collections 3

View 3 collections

Papers 14

spaces 2

Croissant Checker - Dev

🔎

Validate Croissant JSON‑LD for NeurIPS submissions

Distinct

👀

Create a static web page by editing HTML

models 1

jacklanda/Qwen-2.5-1.5B-Simple-RL

Updated Feb 17, 2025

datasets 2

jacklanda/SemanticQA

Updated Apr 24 • 131 • 1

jacklanda/LexBench

Preview • Updated May 21, 2024 • 8

Yang

AI & ML interests

Recent Activity

Organizations

Collections 3

Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

Evaluating Generalization Capabilities of LLM-Based Agents in Mixed-Motive Scenarios Using Concordia

\$OneMillion-Bench: How Far are Language Agents from Human Experts?

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

LM-Lexicon: Improving Definition Modeling via Harmonizing Semantic Experts

Revisiting a Pain in the Neck: Semantic Phrase Processing Benchmark for Language Models

Understanding and Leveraging the Expert Specialization of Context Faithfulness in Mixture-of-Experts LLMs

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

Papers 14

spaces 2

Croissant Checker - Dev

Distinct

models 1

jacklanda/Qwen-2.5-1.5B-Simple-RL

datasets 2

jacklanda/SemanticQA

jacklanda/LexBench

Yang

AI & ML interests

Recent Activity

Organizations

Collections 3

Papers 14

spaces 2 Sort: Recently updated

Croissant Checker - Dev

Distinct

models 1

datasets 2 Sort: Recently updated

spaces 2

datasets 2