Repository navigation

#

data-modelling

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

Go
752
1 年前

BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.

Java
150
2 个月前

Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, standardised structure for data and ML and parallel processing out-of-the-box.

Python
53
2 年前

This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration and Monitoring ➲PowerBI for Reporting

Python
12
2 年前

⚙️ ETL pipeline on AWS using S3 and Redshift

Python
5
2 年前

This repo covers the processes of designing a database by performing logical, conceptual and physical data modelling processes, creating the designed database using DML and DDL on various database server systems and performing SQL queries on the created database.

5
2 年前

Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project

Python
4
1 年前

Developed a 3-page Power BI dashboard (global and Asian overview) using Python scripts to load and clean World Bank data (1960–2020), reducing data processing time by 25\%. and Containerized the database in Docker, enabling scalable access, and visualized trends (e.g., 3\% annual GDP growth in Asia), enhancing stakeholder insights.

Python
4
2 个月前

An interactive Tableau dashboard promoting nutritional health through the exploration of micronutrients

Jupyter Notebook
4
15 天前

Repository with files that I worked upon during the DBS211 (Introduction to Database Systems) course.

C++
4
3 年前

Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.

Python
4
2 年前

A CIDOC-CRM-based Application Profile, consisting in a set of entities and properties for representing the digitisation process of cultural heritage objects in a machine-readable format.

Jupyter Notebook
3
2 个月前