Repository navigation

#

sqoop

Mirror of Apache Sqoop

Java
978
4 年前

Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources

Java
447
1 个月前

A prototype project of big data platform, the source codes of the book Big Data Platform Architecture and Prototype

Java
199
5 年前

Educational notes,Hands on problems w/ solutions for hadoop ecosystem

Python
87
6 年前

The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.

Shell
63
2 年前

This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.

Python
54
7 年前

云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件

Shell
54
8 年前
Jupyter Notebook
53
2 年前

IBIS is a workflow creation-engine that abstracts the Hadoop internals of ingesting RDBMS data.

Python
50
3 年前

Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.

Jupyter Notebook
48
2 年前

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

37
5 年前

Export PostgreSQL tables to Google BigQuery

Scala
37
4 年前

异构存储数据迁移

Java
30
7 年前

Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary tools that can be used in the BigData domain, It's a collection of docker containers that you can use directly.

VBA
29
4 年前

一个增量备份关系数据库(MySQL, PostgreSQL, SQL Server, SQLite, Oracle等)到hive的php脚本工具

PHP
21
7 年前