Repository navigation
data-modelling
- Website
- Wikipedia
Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.
dbt + Metabase integration
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, standardised structure for data and ML and parallel processing out-of-the-box.
This repository is a working ETL framework which utilizes user data from Spotify API using ➲Python for Extraction and Transformation ➲SQL for Data Loading and Staging ➲Airflow for Data Orchestration and Monitoring ➲PowerBI for Reporting
🎥 Email marketing campaign analysis
COVID-19 Surveillance Data Modelling and Management Pipeline in Piedmont.
Extensible Object Model Data Abstraction
⚙️ ETL pipeline on AWS using S3 and Redshift
This repo covers the processes of designing a database by performing logical, conceptual and physical data modelling processes, creating the designed database using DML and DDL on various database server systems and performing SQL queries on the created database.
Data model for the Participatory Knowledge Practices in Analogue and Digital Image Archives (PIA) project
Developed a 3-page Power BI dashboard (global and Asian overview) using Python scripts to load and clean World Bank data (1960–2020), reducing data processing time by 25\%. and Containerized the database in Docker, enabling scalable access, and visualized trends (e.g., 3\% annual GDP growth in Asia), enhancing stakeholder insights.
An interactive Tableau dashboard promoting nutritional health through the exploration of micronutrients
Repository with files that I worked upon during the DBS211 (Introduction to Database Systems) course.
Formula 1 race data engineering project which utilises azure services and databricks to ingest and analyse the data.
Social blogging community build with React, Next.js, and Firebase.
A CIDOC-CRM-based Application Profile, consisting in a set of entities and properties for representing the digitisation process of cultural heritage objects in a machine-readable format.