Repository navigation

#

data-catalog

datahub-project/datahub

The Metadata Platform for your Data and AI Stack

Java
11103
4 小时前
open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

TypeScript
7649
3 小时前

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python
4664
4 天前

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java
2051
1 天前

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Python
1054
17 天前

🐳 The stupidly simple CLI workspace for your data warehouse.

Python
727
3 年前

Work with your web service, database, and streaming schemas in a single format.

Python
343
1 个月前

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Python
324
2 年前

Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.

Go
214
8 天前

An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.

Python
155
4 天前

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Java
147
1 年前

Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.

Python
80
14 天前

Data catalog for everything in your company

Python
49
2 年前

Registry of data portals, catalogs, data repositories including data catalogs dataset and catalog description standard

Python
47
2 个月前
TypeScript
45
4 个月前