Repository navigation

#

dataquality

open-metadata/OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

TypeScript
7649
5 小时前

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala
3516
1 个月前

Know your data better!Datavines is Next-gen Data Observability Platform, support metadata manage and data quality.

Java
667
3 天前

Possibly the fastest DataFrame-agnostic quality check library in town.

Python
220
5 天前

Frontend for the osmcha-django REST API

JavaScript
140
2 个月前

Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.

Python
132
2 年前

Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility across and down your data estate. Save time with simple, fast data quality test generation and execution. Trust your data, tools, and systems end to end.

Python
127
2 天前

DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling,  new dataset hygiene review, AI generation of data quality validation tests, ongoing testing of data refreshes, & continuous anomaly monitoring

Python
64
18 天前

内嵌AI的数据质量控制系统

Java
48
4 年前