Repository navigation
sqoop
- Website
- Wikipedia
Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources
A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Repository used for Spark Trainings
Big data projects implemented by Maniram yadav
IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.
Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.
Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.
Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary tools that can be used in the BigData domain, It's a collection of docker containers that you can use directly.