Lipeng (Tony) He
ttttonyhe
ยท
AI & ML interests
Trustworthy Machine Learning
Recent Activity
authored
a paper
2 days ago
Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
submitted
a paper
2 days ago
Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
updated
a collection
4 days ago
Red-Teaming Models & Datasets