Repository navigation

#

data-catalog

datahub-project/datahub

The Metadata Platform for your Data and AI Stack

Java
10524
4 小时前
open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

TypeScript
6487
31 分钟前

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python
4556
18 天前

World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.

Java
1447
2 天前

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Python
1038
1 个月前

🐳 The stupidly simple CLI workspace for your data warehouse.

Python
726
2 年前

Work with your web service, database, and streaming schemas in a single format.

Python
343
17 天前

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Python
306
1 年前

Meteor is an easy-to-use, plugin-driven metadata collection framework to extract data from different sources and sink to any data catalog.

Go
204
6 个月前

An intake plugin for parsing an Earth System Model (ESM) catalog and loading assets into xarray datasets.

Python
149
6 天前

Reference implementation for real-time Data Lineage tracking for BigQuery using Audit Logs, ZetaSQL and Dataflow.

Java
143
1 年前

Metamapper is a data discovery and documentation platform for improving how teams understand and interact with their data.

Python
79
3 天前

Data catalog for everything in your company

Python
50
2 年前

The documentation repository is part of the Corporate Linked Data Catalog - short: COLID - application.

HTML
43
2 年前