When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation Paper • 2510.07238 • Published Oct 8 • 14 • 2
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses Paper • 2510.00232 • Published Sep 30 • 15 • 2
BiasEdit: Debiasing Stereotyped Language Models via Model Editing Paper • 2503.08588 • Published Mar 11 • 7 • 2