Repository navigation

#

data-streaming

Apache InLong - a one-stop, full-scenario integration framework for massive data

Java
1469
7 天前

An extensible distributed system for reliable nearline data streaming at scale

Java
944
3 个月前
Scala
700
1 小时前

Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.

Java
191
2 天前

Adapter for dbt that executes dbt pipelines on Apache Flink

Python
95
2 年前

A Python library for machine-learning and feedback loops on streaming data

Python
62
2 年前

Sample code that shows the important aspects of developing custom connectors for Kafka Connect. It provides the resources for building, deploying, and running the code on-premises using Docker, as well as running the code in the cloud.

Java
57
1 年前

Sample Applications for Pravega.

Java
55
2 年前

Docker Compose environments for demonstrating modern data platform architectures using Kafka, Flink, Spark, Iceberg, Pinot + Kpow & Flex by Factor House

Shell
43
13 天前

High-performance and efficient Framework and Agent for creating data pipelines. The core of pipeline descriptions is based on the Configuration As Code concept and the Pkl configuration language by Apple.

Go
34
18 天前

A library for data streaming and augmentation

Python
20
5 个月前

Developer-friendly MCP server bridging Kafka and Pulsar protocols—built with ❤️ by StreamNative for an agentic, streaming-first future.

Go
18
1 个月前