Repository navigation

#

pii

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python
5275
3 小时前

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

Python
2793
18 天前

A powerful scanner to scan your Filesystem, S3, MySQL, Redis, Google Cloud Storage and Firebase storage for PII and sensitive data.

Python
434
8 天前

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Python
320
2 年前

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

Jupyter Notebook
230
8 天前

Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP

Java
95
1 年前

A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)

Python
89
2 年前

KloudDB Shield is a comprehensive Postgres Security Tool - PII Scanner , CIS Benchmarks , SSL audit , 12+ features .. Supports Postgres, RDS ,Aurora, MySQL

Go
89
8 个月前

A simple tool to anonymize LLM prompts.

Svelte
64
7 个月前

The open source PII and PHI redaction and de-identification engine

Java
60
14 天前

🛡️ PII Guard is an LLM-powered tool that detects and manages Personally Identifiable Information (PII) in logs — designed to support data privacy and GDPR compliance

TypeScript
56
2 个月前

Scala library and compiler plugin that prevent inadvertent leakage of sensitive fields in `case classes` (such as credentials, personal data, and other confidential information)

Scala
53
4 天前

Open Privacy Vault - Secure, Performant, Open Source PII as a Service.

Go
50
1 年前

Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.

Python
46
4 年前

A package to build an end-to-end pipeline for detecting personally identifiable information from text.

Python
45
6 年前

A Mongoose plugin that lets you transparently cipher stored PII and use securely-hashed passwords

JavaScript
45
3 年前