142 results for “topic:dimensional-modeling”
Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principles on Adventure Works. Features programmatic model generation, event-enhanced Puppini bridges, and temporal resolution across DAS/DAB/DAR layers.
Analysis of New York State Police Department Arrests dataset. Created Dimensional Model for the provided dataset. Using Alteryx and Talend, built ETL pipelines to process, clean the data and create dimensions and facts in the destination database. Further, visualized the necessary details of the database using Tableau and PowerBI.
This repository is a place for the Data Warehousing course at the Information Systems & Analytics department, Santa Clara University.
No description provided.
Syracuse University, Masters of Applied Data Science - IST 722 Data Warehouse
Designed a multi-dimensional data model using LucidChart. Developed ELT pipeline using Python/Pandas. S&P 500 data obtained via yfinance, an open-source library. Output normalized data to excel. Performed analysis and generated reports with Power BI.
DW de e-commerce (Kimball/Star Schema) em SQL Server, com scripts, dados sintéticos e docs para estudos.
This project demonstrates an ETL pipeline that processes NOAA's fishing survey data, then makes it available for analysis through an interactive web app.
This repository contains the end-to-end pipeline for building a data warehouse for a real estate management company. The pipeline includes data generation, ETL process, creation of star schema dimensions and fact table, visualization using Power BI, and automation with Pabbly Connect.
Dimensional Data Design
2022 SCC Data Science & Analytics Workshop on Databases
This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.
A Data Warehousing project for retail sales using dimension modelling best practices with SCD type 2 on AWS Redshift. Utilizing AWS Lambda, Glue Workflows and Python Shell jobs to create and automate an ELT pipeline where batch data coming into S3 is loaded onto Redshift and necessary transformations are performed to meet requirements.
Code, scripts, and resources for the Data Engineering Fundamentals Course Webinar, covering Python, data pipelines, Apache Airflow, and more.
Source code for the Kimball-style date dimension generator dimdates.com.
OLAP in TSQL and Python
Starts with a conceptual model ends with a Tableau interactive dash board. In between there is building ER diagrams, forward engineering to build normalized databases, dimensional data modelling and visualizations in tableau.
The project involves creating a data model for HNG Hire, implementing it in MySQL, and building a Power BI dashboard to display hiring statistics.
💧 Data warehouse & BI system analyzing water stress across Morocco (2015-2025). 68K+ records | MySQL | QlikView | Star schema. Portfolio showcase - Proprietary.
This project involves creating a dimensional data model using MySQL Workbench for a car repair shop’s operations in western Canada by examining a sample invoice,
A comprehensive dimensional model for COVID data, enabling insights for future vaccination campaigns through robust visual analytics.
End-to-end enterprise sales lakehouse implementing Medallion ELT architecture with dimensional modeling for decision-ready analytics
An examination of a dataset collecting data from food inspections conducted at several Boston restaurants.
Projeto end-to-end da criação de um Data Warehouse para uma companhia fictícia de mineração chamada Astarte Mining Co.
Data Warehouse Dimensional Modeling - Professional Python project
creating a data warehouse for a football game management company and some SQL queries to analyse data.
Production-style healthcare claims Medallion architecture pipeline (Bronze → Silver → Gold) built in Databricks with dimensional modeling, surrogate key management, fact grain enforcement, and data quality controls.
Production-ready dbt project for cancer screening analytics. Demonstrates dimensional modeling, healthcare metrics, and client-facing BI for Color Health Senior Analytics Engineer portfolio.
🏭 Turn scattered pharma quality data into actionable insights | Prevent batch rejections | Automate compliance reporting | Open-source OLAP solution that saves millions | Built by QMS data professionals
Enterprise Data Warehouse with Star Schema, SCD Type 2, ETL pipelines, and multi-currency analytics (SQL Server + Python)