Repository navigation

#

personally-identifiable-information

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Python
5690
20 小时前

Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.

Ruby
264
3 天前

The open source PII and PHI redaction and de-identification engine

Java
65
4 天前
Python
30
2 年前

PIITracker: Automatic Tracking of Personally Identifiable Information in Windows

C
20
8 年前

Case study using dotfurther's Open Discover Platform with the RavenDB document store to rapidly create a full-text search/eDiscovery/information governance capable demonstration application.

11
1 年前

Anonymises data inside text files and in sheet files. It recognises and removes various sorts of personally identifiable information (PII). Each removed part is replaced with a suitable generic text, depending on the type of removed data. Currently English and Russian languages are supported. Russian works both with Cyrillic and Latin characters.

Python
4
4 天前

This is an an experimental implementation of field-level data masking of Personally Identifiable Information (PII) for use in Django.

Python
3
2 年前

Shell Script Redact PDF using open source softwares

Shell
0
2 年前

A tool to hide personal details from text and PDF files using AI and regex patterns.

JavaScript
0
3 个月前