Imperceptible Jailbreaking against Large Language Models Paper • 2510.05025 • Published Oct 6 • 33 • 2
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published Sep 26 • 70 • 3