Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Filippo Tonini's picture
3

Filippo Tonini

filo362
  • pippot
  • filippo-tonini-35b8a6283

AI & ML interests

LLM safety in multi-agent environments

Recent Activity

upvoted a paper 1 day ago
PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models
upvoted a paper 1 day ago
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling
upvoted a paper 16 days ago
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals
View all activity

Organizations

None yet

filo362 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs