Woof woof's picture

11 5

Woof woof

WoofWoof

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

MMGR: Multi-Modal Generative Reasoning

upvoted a paper about 1 month ago

Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

upvoted a paper 2 months ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

View all activity

Organizations

None yet

upvoted a paper 22 days ago

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 22 days ago • 114

upvoted a paper about 1 month ago

Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

Paper • 2511.09586 • Published Nov 12, 2025 • 2

upvoted 2 papers 2 months ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Paper • 2511.02734 • Published Nov 4, 2025 • 20

CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation

Paper • 2510.17853 • Published Oct 15, 2025 • 7

upvoted 2 papers 3 months ago

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting

Paper • 2505.18822 • Published May 24, 2025 • 15

UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios

Paper • 2509.21766 • Published Sep 26, 2025 • 23

upvoted a paper 5 months ago

Diversity-Enhanced Reasoning for Subjective Questions

Paper • 2507.20187 • Published Jul 27, 2025 • 25

upvoted a paper 6 months ago

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30, 2025 • 89

upvoted a paper 8 months ago

MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly

Paper • 2505.10610 • Published May 15, 2025 • 54

upvoted a paper 11 months ago

VLM^2-Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues

Paper • 2502.12084 • Published Feb 17, 2025 • 32

upvoted a paper about 1 year ago

MentalArena: Self-play Training of Language Models for Diagnosis and Treatment of Mental Health Disorders

Paper • 2410.06845 • Published Oct 9, 2024 • 5