Repository navigation

#

data-catalog

datahub-project/datahub

The Metadata Platform for your Data and AI Stack

Java
10964
2 小时前
open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

TypeScript
7351
4 分钟前

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python
4629
19 天前

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java
1773
5 分钟前

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Python
1048
2 个月前

🐳 The stupidly simple CLI workspace for your data warehouse.

Python
727
3 年前

Work with your web service, database, and streaming schemas in a single format.

Python
345
2 个月前

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Python
320
2 年前

Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.

Go
209
10 个月前

An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.

Python
154
5 天前

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Java
147
1 年前

Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.

Python
79
10 天前

Data catalog for everything in your company

Python
51
2 年前

Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard

Python
46
22 天前