arxiv:2503.18991
Cheng
RosyCheng
·
AI & ML interests
LLM Alignment&Security
Recent Activity
upvoted a paper 1 day ago
OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens authored
a paper
8 months ago
Inverse Reinforcement Learning with Dynamic Reward Scaling for LLM
Alignment authored
a paper
8 months ago
PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack
for Toxicity Maximization