Repository navigation

#

duplicate-detection

Interact, analyze and structure massive text, image, embedding, audio and video datasets

Python
1636
24 天前

WinDirStat is a disk usage statistics viewer and cleanup tool for Microsoft Windows

C++
1383
1 个月前

Remove duplicates from MASSIVE wordlist, without sorting it (for dictionary-based password cracking)

C++
922
2 天前

A plugin that does one thing only: Detect and manage duplicate items in Zotero.

TypeScript
535
1 个月前

Filter, Sort & Delete Duplicate Files Recursively

Rust
329
10 个月前

Near Duplicate Video Detection (Perceptual Video Hashing) - Get a 64-bit comparable hash-value for any video.

Python
307
9 个月前

⚡ Check your npm modules for unused and duplicate dependencies fast

Go
264
3 年前

The Panako acoustic fingerprinting system.

Java
196
1 年前

Vidupe is a program that can find duplicate and similar video files. V1.211 released on 2019-09-18, Windows exe here:

C++
177
6 年前

CLI utility to find near duplicate images and remove all but the best copy.

Python
161
6 天前

Duplicates Detector is a cross-platform GUI utility for finding duplicate files, allowing you to delete or link them to save space. Duplicate files are displayed and processed on two synchronized panels for efficient and convenient operation.

Python
139
16 天前

Easily delete your YouTube Music library (and manage playlists)

Python
137
4 天前

A collection of free-text bug reports for duplicate issue identification

120
1 年前

Command-line program for Content-Based Image Retrieval of images and videos. Includes tools for general search and de-duplication.

C++
119
3 天前

CLI tool that fast checks if your bundle contains multiple versions of the same package, only by looking in package.json.

JavaScript
115
5 年前

Duplicates finder for various source code formats.

C++
103
19 天前