Repository navigation

#

spark-sql

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python
27223
3 天前

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala
2178
3 天前
C#
2049
6 天前

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala
1321
19 小时前

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala
1279
3 个月前

电商用户行为分析大数据平台

Java
1016
2 年前
Jupyter Notebook
431
2 年前

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

C#
302
19 天前

Apache Spark™ and Scala Workshops

HTML
264
9 个月前

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Scala
226
3 个月前

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

TypeScript
209
6 年前

A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype

Java
199
5 年前

Spark Structured Streaming / Kafka / Cassandra / Elastic

Scala
183
2 年前

Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,推荐算法,协同过滤

Scala
138
5 年前