Repository navigation

#

spark-streaming

A Flexible and Powerful Parameter Server for large-scale machine learning

Java
6748
1 年前

酷玩 Spark: Spark 源代码解析、Spark 类库等

Scala
3481
3 年前

基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统

Java
2917
6 年前
C#
2049
6 天前

scala、spark使用过程中,各种测试用例以及相关资料整理

Scala
1086
6 年前

Wormhole is a SPaaS (Stream Processing as a Service) Platform

JavaScript
977
2 年前

C# and F# language binding and extensions to Apache Spark

C#
940
1 年前

An open source framework for building data analytic applications.

Java
769
2 天前

Scala examples for learning to use Spark

Scala
445
5 年前

Stream computing platform for bigdata

Java
402
1 年前

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

Python
398
1 个月前

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

C#
302
19 天前
Python
250
2 天前
Scala
245
4 个月前

Spark, Spark Streaming and Spark SQL unit testing strategies

Scala
218
9 年前

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

TypeScript
209
6 年前

Self-contained examples of Apache Spark streaming integrated with Apache Kafka.

Scala
199
7 年前