Repository navigation

#

Entity resolution

Created by Halbert L. Dunn

发布于 1946

entity-resolution
维基百科

相关主题

人工智能自然语言处理

Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.

Fast, secure, efficient backup program

Go
30231
9 小时前

Deduplicating archiver with compression and authenticated encryption.

Python
12434
2 天前

Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.

Go
11215
1 天前

Find duplicate files

Python
6884
1 年前

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

C
4600
1 个月前

rustic - fast, encrypted, and deduplicated backups powered by Rust

Rust
2622
12 天前

A fast high-compression read-only file system for Linux, FreeBSD, macOS and Windows

C++
2393
15 小时前

Extremely fast tool to remove duplicates and other lint from your filesystem

C
2185
6 天前

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python
1732
19 小时前
PlakarKorp/plakar

plakar is a backup solution powered by Kloset and ptar

Go
1315
1 天前
J535D165/recordlinkage

A powerful and modular toolkit for record linkage and duplicate detection in Python

Python
1025
2 年前

Data deduplication engine, supporting optional compression and public key encryption.

Rust
855
3 年前

Fast Semantic Text Deduplication & Filtering

Python
810
11 小时前

Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)

PLpgSQL
724
4 天前