Repository navigation

#

image-deduplication

Image similarity in Golang. Version 4 (LATEST)

Go
103
1 年前

Tool to detect (and get rid of) similar images using perceptual hashing (pHash lib)

Python
83
9 年前

A utility for locating near duplicate photos irrespective of image resolution, compression settings or file format.

Rust
23
2 年前

A Python tool to identify and remove similar-looking images from a dataset. Utilizes image preprocessing and hashing techniques for efficient comparison.

Python
12
2 年前

Downloader with custom wildcard system: cherry-picking internet with asterisks for HTML or right-carets for API, whether it's for time-critical website moments or just for laziness. Features directory listing and serve, alarm (essentially in-stock tracker), file sorter (organizer), image duplicate finder and tools for naked eyes.

Batchfile
7
10 天前

🏍️ A clustering tool providing exact and near de-duplication of images using vector embeddings.

Python
3
4 个月前

a Python command-line tool that identifies and groups similar images using average hashing. It supports single-level and recursive directory scanning, adjustable similarity threshold, and presents results in JSON format. Ideal for image deduplication, organization, and content-based retrieval tasks.

Python
2
1 年前

A CLI tool for images analysis: checking image integrity, images deduplication, image retrieval.

Rust
2
2 年前

The extended version of simhash supports fingerprint extraction of documents and images.

Python
2
3 年前

高效的Python图像查重工具,支持百万级图片文件的重复检测。集成多种算法包括MD5哈希、感知哈希(dHash/pHash/aHash)和C++加速库,可识别完全相同、分辨率调整、部分截取和水印变更的重复图像。

Python
1
1 个月前

A Python notebook combining MD5 and perceptual hashing to detect exact-duplicate images

Jupyter Notebook
0
5 个月前

Get Similarity adalah alat berbasis Python dengan antarmuka GUI yang memungkinkan pengguna menyaring gambar berkualitas rendah dan mengelompokkan gambar serupa secara otomatis menggunakan embedding CLIP + DINOv2 dan evaluasi kualitas berbasis MusIQ.

Python
0
4 个月前

Finds the images in the directory that are most similar to the others and deletes the N most similar ones. Use to remove similar images before training stable diffusion models.

Rust
0
1 个月前

Sobel Gradient Image Deduplication

Python
0
3 天前

Sort duplicate images into separate folders

PHP
0
6 年前

A python program to detect duplicate images in a specified folder.

Python
0
5 年前

A utility for testing the performance of de-duplication algorithms by randomly generating “noisy” images in a dataset.

Rust
0
9 个月前

This Python script helps in identifying and moving duplicate images within a specified directory to a designated duplicates folder.

Python
0
1 年前