Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Patronus AI

Team
company
Verified
https://patronus.ai
patronusai
Activity Feed Request to join this org

AI & ML interests

LLM Evaluation

Recent Activity

vgtomahawk  published a model 23 days ago
PatronusAI/Qwen3-4B-Instruct-2507-CE-152T-GPT41Tea-notR-L4-M-Ep1-1e-5-Q32-65536-2026Feb21
patronus-bartek  updated a model 23 days ago
PatronusAI/Qwen3-4B-Instruct-2507-CE-152T-GPT41Tea-notR-L4-M-Ep1-1e-5-Q32-65536-2026Feb21
vgtomahawk  published a model 23 days ago
PatronusAI/Qwen3-4B-Instruct-2507-CE-152T-GPT41Tea-notR-L2-M-Ep1-6e-5-Q32-65536-1923Feb21
View all activity

Papers

Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

View all Papers

Rebecca Qian's profile pictureAnand Kannappan's profile pictureBartosz Mielczarek's profile pictureBartosz Mielczarek's profile pictureVarun Joshi's profile pictureArek's profile pictureDarshan Deshpande's profile pictureMaciej Gełdon's profile pictureShivani Jain's profile pictureVarun Gangal's profile pictureEdgar Colque's profile pictureJedrzej's profile pictureChinmayee Kulkarni's profile pictureDevanshu Bansal's profile pictureBartlomiej Olechno's profile pictureJosh W's profile pictureTobi Akomolede's profile pictureYoshinari Fujinuma's profile picture
PatronusAI 's Papers 2
Submitted by
Darshan Deshpande
1

Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis

PatronusAI Patronus AI
3
Submitted by
Darshan Deshpande
3

MEMTRACK: Evaluating Long-Term Memory and State Tracking in Multi-Platform Dynamic Agent Environments

PatronusAI Patronus AI
2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs