Repository navigation

#

data-streaming

Apache InLong - a one-stop, full-scenario integration framework for massive data

Java
1431
2 天前

An extensible distributed system for reliable nearline data streaming at scale

Java
936
1 年前
Scala
683
18 小时前

Http Connector for Apache Flink. Provides sources and sinks for Datastream , Table and SQL APIs.

Java
177
23 天前

Adapter for dbt that executes dbt pipelines on Apache Flink

Python
94
1 年前

A Python library for machine-learning and feedback loops on streaming data

Python
61
2 年前

Sample Applications for Pravega.

Java
54
1 年前

Sample code that shows the important aspects of developing custom connectors for Kafka Connect. It provides the resources for building, deploying, and running the code on-premises using Docker, as well as running the code in the cloud.

Java
54
10 个月前

High-performance and efficient Framework and Agent for creating data pipelines. The core of pipeline descriptions is based on the Configuration As Code concept and the Pkl configuration language by Apple.

Go
34
16 天前

A library for data streaming and augmentation

Python
20
1 年前

This project implements a real-time data pipeline using Apache Kafka, Python's psutil library for metric collection, and SQL Server for data storage. The pipeline collects metrics data from the local computer, processes it through Kafka brokers, and loads it into a SQL Server database. Additionally, a real-time dashboard is created using Power BI.

Python
11
1 年前

A Federated Learning Method for Real-time Emotion State Classification from Multi-modal Streaming

Python
11
3 年前