Repository navigation

#

dedupe

Fast, secure, efficient backup program

Go
29724
13 天前

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Python
4358
22 天前

Deduplication tool for yarn.lock files

TypeScript
1393
5 天前
J535D165/recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python

Python
1022
1 年前

Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)

C++
945
3 个月前

Best-Effort Extent-Same, a btrfs dedupe agent

C++
822
1 个月前

Make CSS easier and more maintainable by using JavaScript

TypeScript
707
2 年前

🆔 Command line tool for deduplicating CSV files

Python
427
5 年前

🆔 Examples for using the dedupe library

Python
413
1 年前

Identifying and removing near-duplicate images using perceptual hashing.

Python
375
4 个月前

A fast file deduplicator

Rust
198
16 天前

Fast block-level out-of-band BTRFS deduplication tool.

Python
181
10 个月前

Check your files for data corruption and run quick file deduplication

Go
154
2 个月前

Daxus is a server state management library for React that provides full control over data, leading to a better user experience.

TypeScript
96
10 个月前
Rust
63
4 个月前

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Python
61
1 天前

A lightweight fetching library packed with essential features - retries, interceptors, request deduplication and much more, all while still retaining a similar API surface with regular Fetch.

TypeScript
52
2 天前

A simple command line interface to the datamade/dedupe library.

Jupyter Notebook
42
3 年前