Repository navigation
text-anonymization
- Website
- Wikipedia
Context aware, pluggable and customizable data protection and de-identification SDK for text, images and structured data.
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
Deidentify people's names and gender specific pronouns
Clean your Text for Statistical ML and Language Model
Simple project on html anonymization
This repository contains the code and data for the text re-identification attack presented in B. Manzanares-Salor, D. Sánchez, P. Lison, Evaluating the disclosure risk of anonymized documents via a machine learning-based re-identification attack, Data Mining and Knowledge Discovery, 2024.
This repository contains the code and data for the text anonymization enhancement method presented in B. Manzanares-Salor, D. Sánchez, Enhancing text anonymization via re-identification risk-based explainability, Submitted, 2024.