Repository navigation

#

data-integration

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python
17903
3 小时前

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

Java
8444
3 天前
Java
5745
9 小时前
jitsucom/jitsu

Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days

TypeScript
4271
1 天前

A data integration framework

Java
4043
2 个月前

Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.

3353
2 天前

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

Python
2940
2 天前

Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.

Go
2705
12 天前

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

Python
2080
1 年前

BitSail is a distributed high-performance data integration engine which supports batch, streaming and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.

Java
1655
1 年前

Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times

JavaScript
793
3 年前