Repository navigation

#

spark-sql

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python
27833
2 天前

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Scala
2251
3 天前
C#
2078
11 天前

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala
1451
1 天前

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Scala
1345
8 个月前

电商用户行为分析大数据平台

Java
1060
3 年前
Jupyter Notebook
448
3 年前

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

C#
305
6 个月前

Apache Spark™ and Scala Workshops

HTML
263
1 年前

Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!

Scala
233
8 个月前

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

TypeScript
209
7 年前

A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype

Java
198
5 年前

Spark Structured Streaming / Kafka / Cassandra / Elastic

Scala
183
3 年前

Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,推荐算法,协同过滤

Scala
141
6 年前