Ebisu: Benchmarking Large Language Models in Japanese Finance Paper • 2602.01479 • Published 7 days ago • 17
From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models Paper • 2508.13491 • Published Aug 19, 2025 • 59
From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models Paper • 2508.13491 • Published Aug 19, 2025 • 59