Repository navigation

#

dedupe

Fast, secure, efficient backup program

Go
30231
9 小时前

🆔 A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

Python
4381
2 个月前

Deduplication tool for yarn.lock files

TypeScript
1393
4 天前
J535D165/recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python

Python
1025
2 年前

Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)

C++
955
5 个月前

Best-Effort Extent-Same, a btrfs dedupe agent

C++
841
15 天前

Make CSS easier and more maintainable by using JavaScript

TypeScript
707
2 年前

🆔 Command line tool for deduplicating CSV files

Python
430
6 年前

🆔 Examples for using the dedupe library

Python
414
1 年前

Identifying and removing near-duplicate images using perceptual hashing.

Python
378
5 个月前

A fast file deduplicator

Rust
201
2 个月前

Fast block-level out-of-band BTRFS deduplication tool.

Python
180
1 年前

Check your files for data corruption and run quick file deduplication

Go
163
1 个月前

Daxus is a server state management library for React that provides full control over data, leading to a better user experience.

TypeScript
96
1 年前
Rust
64
1 个月前

Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.

Python
63
2 天前

A lightweight fetching library packed with essential features - retries, interceptors, request deduplication and much more, all while still retaining a similar API surface with regular Fetch.

TypeScript
55
2 小时前

A simple command line interface to the datamade/dedupe library.

Jupyter Notebook
42
3 年前