BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers? Paper • 2510.18003 • Published Oct 20, 2025
Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? Paper • 2605.12684 • Published 7 days ago • 11
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? zhangchenxu • Feb 25 • 14
view article Article Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty? zhangchenxu • Feb 25 • 14