Repository navigation

sparksql

Website
Wikipedia

zio / zio-quill

Compile-time Language Integrated Queries for Scala

数据库 Scala scalajs MySQL PostgreSQL Apache Cassandra jdbc linq Apache Spark sparksql

Scala

2166

348

4 天前

harsha2010 / magellan

Geo Spatial Data Analytics on Spark

geospatial-analytics sparksql Apache Spark GeoJSON shapefile geospatial geospatial-analysis big-data

Scala

535

149

4 年前

Stratio / sparta

Real Time Analytics and Data Pipelines based on Spark Streaming

streaming-data Scala Apache Spark streaming spark-streaming olap kafka hdfs workflow analytics real-time sparksql lambda triggers

Scala

528

196

6 年前

spirom / LearningSpark

Scala examples for learning to use Spark

Scala Apache Spark spark-streaming sparksql

Scala

445

289

5 年前

commoncrawl / cc-pyspark

Process Common Crawl data with Python and Spark

Apache Spark sparksql pyspark

Python

440

11 天前

teeyog / IQL

An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)

Apache Spark sparksql

JavaScript

381

178

2 年前

microsoft / data-accelerator

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Apache Spark spark-streaming spark-sql sparksql streaming-data streaming servicefabric Node.js Docker hdinsight cosmosdb React Azure iothub big-data Internet of things kafka kafka-streams

305

6 个月前

hbutani / spark-druid-olap

Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.

Apache Spark business-intelligence sparksql query-optimization

Scala

282

7 年前

locationtech / rasterframes

Geospatial Raster support for Spark DataFrames

Apache Spark sparksql Scala earth-observation 图像处理机器学习 spark-ml

Jupyter Notebook

252

2 年前

zio / zio-protoquill

Quill for Scala 3

linq PostgreSQL Scala SQL Apache Cassandra jdbc Apache Spark sparksql

Scala

222

4 天前

bluishglc / bdp

A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype

bigdata prototype quickstart Apache Spark spark-streaming spark-sql Demo Redis kafka sqoop sparksql

Java

198

144

5 年前

ZhuXS / Spring-Shiro-Spark

Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试

Spring Boot hibernate-jpa Apache Spark sparksql iview Vue.js shiro-security

Java

114

8 年前

hyunjoonbok / PySpark

PySpark functions and utilities with examples. Assists ETL process of data modeling

Apache Spark hadoop pyspark sparksql

Jupyter Notebook

104

5 年前

saurfang / sparksql-protobuf

Read SparkSQL parquet file as RDD[Protobuf]

protobuf sparksql parquet

Scala

7 年前

CybercentreCanada / jupyterlab-sql-editor

A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino

jupyterlab 插件 lsp trino sparksql SQL formatter auto-completion datagrid JSON schema VS Code Extension notebook Syntax Highlighting dataframe

Jupyter Notebook

2 天前

microsoft / A-TALE-OF-THREE-CITIES

Analyzing the safety (311) dataset published by Azure Open Datasets for Chicago, Boston and New York City using SparkR, SParkSQL, Azure Databricks, visualization using ggplot2 and leaflet. Focus is on descriptive analytics, visualization, clustering, time series forecasting and anomaly detection.

sparksql Azure R eda data workshop-materials time-series-analysis timeseries-forecasting anomaly-detection Open Data visualization leaflet geospatial

4 年前