Repository navigation

#

pii-detection

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python
5270
11 分钟前

Mediapipe-based library to redact faces from videos and images

C++
441
2 年前

A Swiss-Army-knife for your Data Intelligence platform administration.

Python
127
4 个月前

The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.

TypeScript
118
6 个月前

A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)

Python
89
2 年前

🛡️ PII Guard is an LLM-powered tool that detects and manages Personally Identifiable Information (PII) in logs — designed to support data privacy and GDPR compliance

TypeScript
56
2 个月前

A package to build an end-to-end pipeline for detecting personally identifiable information from text.

Python
45
6 年前

Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules

Python
44
1 个月前

Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.

Ruby
42
2 天前

Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation

Python
31
1 年前

Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interface

Python
24
9 小时前

Anonymize / mask personal information before sending prompts to chat AI (like ChatGPT provided by OpenAI)

Python
22
2 年前

Open source PII detection and anonymization tool: easy-to-use, configurable, and extensible

Python
21
3 个月前

Web Scanner written in Python which after scanning the given URL returns it's domain name, ip address, nmap scan results and also the contents the URL's robots.txt.

Python
20
1 年前

LLM Semantic Router: Intelligent Mixture-of-Models (MoM) System with Advanced ML Security. An advanced semantic router that intelligently directs OpenAI API requests to the most suitable backend language model from a defined pool based on deep semantic understanding of request content.

Python
18
1 天前

Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️

Python
18
1 天前

Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources

Python
17
2 年前