Repository navigation

#

hadoop-ecosystem

DE직무에 필요한 모든 것

Jupyter Notebook
197
4 天前

HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)

Java
64
2 年前

IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.

Python
50
3 年前

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

Shell
11
5 年前

Instructions on setting up Hadoop, HDFS, java, sbt, kafka, scala, spark and flume on Ubuntu 18.04

Shell
8
4 年前

Dockerfile for running Apache Knox (http://knox.apache.org/) in Docker

Dockerfile
8
3 年前

The goal of this project is to identify the flood-prone areas with probabilities of flood in counties in a future date, using Spark MLLib.

Scala
3
5 年前

Built a Large Scale Distributed Data Processing system for Streaming Analytics using Hadoop Ecosystem (Apache Spark and HDFS), in Cloud for real-time spatial analytics.

Scala
2
4 年前

SparkSQL Quick Start Tutorial

Scala
2
8 年前

Spark Streaming & Kafka Quick Start Tutorial

Scala
1
8 年前

Practise programs in hadoop ecosystem for refrence

1
7 年前

Avro File Format Quick Start Tutorial

Java
1
8 年前

[BigData] one year weblog analysis using PIG

PigLatin
1
7 年前

Big Data is Stored and analyzed of various Customer using Hadoop and other tools like Hive, Zookeeper, Hbase and sqoop and all details of the customer is analyzed then result are given.This result is very useful for companies.

1
4 年前

This project focuses on analyzing movie data using Pyspark tailored for efficient data processing on Hadoop Distributed File System (HDFS)

Jupyter Notebook
1
1 年前