Repository navigation

#

Entity resolution

Created by Halbert L. Dunn

发布于 1946

entity-resolution
维基百科

相关主题

人工智能自然语言处理

Entity resolution (also known as data matching, data linkage, record linkage, and many other terms) is the task of finding entities in a dataset that refer to the same entity across different data sources (e.g., data files, books, websites, and databases). Entity resolution is necessary when joining different data sets based on entities that may or may not share a common identifier (e.g., database key, URI, National identification number), which may be due to differences in record shape, storage location, or curator style or preference.

Fast, secure, efficient backup program

Go
28303
6 天前

Deduplicating archiver with compression and authenticated encryption.

Python
11741
19 小时前

Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.

Go
9286
3 天前

Find duplicate files

Python
5944
8 个月前

A C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.

C
4226
15 小时前

rustic - fast, encrypted, and deduplicated backups powered by Rust

Rust
2318
1 天前

A fast high compression read-only file system for Linux, Windows and macOS

C++
2274
1 小时前

Extremely fast tool to remove duplicates and other lint from your filesystem

C
2069
1 个月前

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python
1558
3 天前

A powerful and modular toolkit for record linkage and duplicate detection in Python

Python
997
1 年前

Data deduplication engine, supporting optional compression and public key encryption.

Rust
838
3 年前

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

JavaScript
713
10 个月前

Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)

PLpgSQL
675
23 天前

Fast Semantic Text Deduplication

Python
633
2 天前