Repository navigation

#

azkaban

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java
13420
1 天前

专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

10056
2 年前
Java
4492
10 个月前

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Java
3155
18 天前

Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display

Java
1240
8 个月前

Schedulis is a high performance workflow task scheduling system that supports high availability and multi-tenant financial level features, Linkis computing middleware, and has been integrated into data application development portal DataSphere Studio

Java
393
3 天前

最好的大数据项目。《Titan数据运营系统》,本项目是一个全栈闭环系统,我们有用作数据可视化的web系统,然后用flume-kafaka-flume进行日志的读取,在hive设计数仓,编写spark代码进行数仓表之间的转化以及ads层表到mysql的迁移,使用azkaban进行定时任务的调度,使用技术:Java/Scala语言,Hadoop、Spark、Hive、Kafka、Flume、Azkaban、SpringBoot,Bootstrap, Echart等;

JavaScript
127
3 年前

azkaban小助手,增加任务web配置、远程脚本调用、报警扩展、跨项目依赖等功能。

Python
119
8 年前

基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。

Java
112
3 年前

Lightweight Azkaban client

Python
77
5 年前

基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步

Java
53
3 年前

Ambari service for Azkaban

Shell
25
4 年前

📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.

Scala
21
8 年前

Define and schedule workflow, support Flink Jar/SQL, ClickHouse/Hive/Mysql SQL, Shell, etc.

Java
17
8 天前
Python
10
6 年前

Azkaban dockerized

Shell
8
5 年前