Repository navigation

#

pii

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python
5690
20 小时前

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

Python
2892
18 天前

A powerful scanner to scan your Filesystem, S3, MySQL, Redis, Google Cloud Storage and Firebase storage for PII and sensitive data.

Python
450
12 天前

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Python
324
2 年前

Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.

Ruby
264
3 天前

This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.

Jupyter Notebook
238
1 个月前

Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP

Java
95
1 年前

A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)

Python
91
2 天前

KloudDB Shield is a comprehensive Postgres Security Tool - PII Scanner , CIS Benchmarks , SSL audit , 12+ features .. Supports Postgres, RDS ,Aurora, MySQL

Go
89
4 天前

Never give AI companies your secrets! A local LLM-based privacy filter for LLM users. Seamless integration with your existing AI tools as a Python library / OpenAI SDK replacement / API Gatetway / Web Server.

Python
66
1 个月前

A simple tool to anonymize LLM prompts.

Svelte
65
8 个月前

The open source PII and PHI redaction and de-identification engine

Java
65
4 天前

🛡️ PII Guard is an LLM-powered tool that detects and manages Personally Identifiable Information (PII) in logs — designed to support data privacy and GDPR compliance

TypeScript
64
4 个月前

Scala library and compiler plugin that prevent inadvertent leakage of sensitive fields in `case classes` (such as credentials, personal data, and other confidential information)

Scala
55
17 天前

Open Privacy Vault - Secure, Performant, Open Source PII as a Service.

Go
50
1 年前

A package to build an end-to-end pipeline for detecting personally identifiable information from text.

Python
47
6 年前