Repository navigation

#

dedupe

Fast, secure, efficient backup program

Go
28303
6 天前

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Python
4267
5 个月前

Deduplication tool for yarn.lock files

TypeScript
1389
15 天前

A powerful and modular toolkit for record linkage and duplicate detection in Python

Python
997
1 年前

Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)

C++
922
2 天前

Best-Effort Extent-Same, a btrfs dedupe agent

C++
771
25 天前

Make CSS easier and more maintainable by using JavaScript

TypeScript
709
1 年前

🆔 Command line tool for deduplicating CSV files

Python
420
5 年前

🆔 Examples for using the dedupe library

Python
411
8 个月前

Identifying and removing near-duplicate images using perceptual hashing.

Python
357
2 年前

A fast file deduplicator

Rust
192
2 年前

Fast block-level out-of-band BTRFS deduplication tool.

Python
178
6 个月前

Check your files for data corruption and run quick file deduplication

Go
132
1 个月前

Daxus is a server state management library for React that provides full control over data, leading to a better user experience.

TypeScript
95
6 个月前
Rust
59
2 个月前

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Python
57
4 天前

A simple command line interface to the datamade/dedupe library.

Jupyter Notebook
42
2 年前

Self-contained C# library for data deduplication using Sqlite

C#
36
2 年前