| | --- |
| | tags: |
| | - text-classification |
| | - recruitment |
| | - forensics |
| | - security |
| | license: mit |
| | datasets: |
| | - dcata004/recruiter-harvesting-dataset-v1 |
| | pipeline_tag: text-classification |
| | --- |
| | |
| | # 🐍 V.I.P.E.R. Classification Engine (v1.0) |
| | **Maintainer:** [Cata Risk Lab](https://huggingface.co/Cata-Risk-Lab) |
| |
|
| | ## 🧠 Model Overview |
| | This repository contains the configuration and architecture definitions for the **V.I.P.E.R.** recruitment auditing system. It defines the risk thresholds and vectorization parameters used to detect "Resume Harvesting" attacks. |
| |
|
| | ## 🛠️ Configuration |
| | The model operates on a `TfidfVectorizer` pipeline optimized for short-text classification of email subjects and bodies. |
| |
|
| | - **Risk Threshold:** 0.75 (Confidence score required to flag as SPAM) |
| | - **Labels:** `['harvesting', 'legitimate']` |
| | - **Dataset:** Trained on forensic recruitment data (Swiss/US/UK). |
| |
|
| | ## ⚖️ Sovereign AI |
| | Designed for local inference to protect user data privacy. |