Repository navigation
sparksql
- Website
- Wikipedia
Compile-time Language Integrated Queries for Scala
Geo Spatial Data Analytics on Spark
Real Time Analytics and Data Pipelines based on Spark Streaming
Scala examples for learning to use Spark
Process Common Crawl data with Python and Spark
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Geospatial Raster support for Spark DataFrames
Quill for Scala 3
A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype
Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
PySpark functions and utilities with examples. Assists ETL process of data modeling
A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino
Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.
type-class based data cleansing library for Apache Spark SQL
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
New generation opensource data stack
已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.