"topic:cloud-data-warehouse" — Search

19 results for “topic:cloud-data-warehouse”

sanketrs/implementation-of-modern-data-engineering-architecture-with-fabric_analytics

Building a next-generation hybrid data pipeline architecture that combines the power of Microsoft Fabric, Azure Cloud, and Power BI. This pipeline is engineered to tackle the challenges of real-time data ingestion, multi-layered processing, and analytics, delivering business-critical insights.

Python82Updated 1 week ago

azureazure-data-factoryazure-fabricbi-analyticsbig-data-analyticsbig-data-projectscloud-data-warehousecloud-dataflowdata-analyticsdata-engineeringdata-engineering-pipelinedata-engineering-projectdata-pipeline-monitoringdata-sciencedata-visualizationdata-warehouseetletl-frameworketl-pipeline

BethanyWeisberg/data-pipeline-practice

Data engineering practice, including building data pipelines (ELT) from a variety of sources.

Python50Updated 1 year ago

apache-airflowawsaws-s3cloud-data-warehouseetl-pipelinemongodbmysqlpostgresqlpythonrds-mysqlredshiftsql

vuanhtuan1012/data-warehouse

Building an ETL pipeline that extracts data from S3, stages them in Redshift.

Jupyter Notebook20Updated 1 year ago

awscloud-data-warehouseetl-pipelineredshift

MaxineXiong/Cloud-Data-Warehousing-with-AWS-Redshift

This project builds a cloud-based ETL pipeline for Sparkify to move data to a cloud data warehouse. It extracts song and user activity data from AWS S3, stages it in Redshift, and transforms it into a star-schema data model with fact and dimension tables, enabling efficient querying to answer business questions.

Jupyter Notebook20Updated 1 month ago

aws-boto3aws-redshiftaws-s3cloud-data-warehousedata-engineeringdata-warehousedata-warehousingdimensional-modeldimensional-modelingetletl-pipelineextract-transform-loadinfrastructure-as-codepostgresqlpostgresql-databaseredshift-cluster

lisa-lumos/Snowflake

Summary/Notes of Snowflake cloud data warehouse. (Complete ✅)

21Updated 2 months ago

cloud-data-warehouse

debashisdash1999/snowflake_proj4_validation_modes_copy_options

Hands-on project covering Snowflake data loading with custom file formats, validation modes, error handling, string length limits, TRUNCATECOLUMNS, and analyzing load history using account_usage.load_history.

00Updated 6 months ago

analyticsaws-s3cloud-data-warehousecopy-commanddata-engineeringdata-loadingdata-qualityerror-handlingetlfile-formatsload-historyon-errorportfolio-projectsnowflakesqlstagestruncatecolumnsvalidation-mode

debashisdash1999/snowflake_proj3_error_handling

Error Handling Hands-on project showcasing Snowflake data loading with error handling using VALIDATION_MODE, ON_ERROR = CONTINUE, ON_ERROR = SKIP_FILE, and ON_ERROR = SKIP_FILE_% while ingesting CSV files from AWS S3.

00Updated 6 months ago

analyticsaws-s3cloud-data-warehousedata-engineeringdata-loadingdata-qualityerror-handlingetlon-errorportfolio-projectsnowflakesqlstagesvalidation-mode

husskhosravi/aws-snowflake-analytics-pipeline

End-to-end pipeline analysing Yelp reviews using AWS S3, Snowflake, Python UDFs and advanced SQL sentiment analysis

Python00Updated 10 months ago

aws-s3cloud-data-warehousejsonpython-udfsnowflaketextblob-sentiment-analysisyelp-dataset

debashisdash1999/snowflake_proj11_data_sampling

This project demonstrates data sampling techniques in Snowflake. It covers loading datasets from S3, performing RANDOM and SYSTEM sampling methods to extract subsets, validating sampled data, and optimizing analysis on datasets.

00Updated 6 months ago

analyticsaws-s3cloud-data-warehousecopy-intodata-analysisdata-engineeringdata-explorationdata-samplingetlportfolio-projectrandom-samplingsample-datasnowflakesqlsystem-sampling

debashisdash1999/snowflake_proj1_warehouse_setup_and_basics

The objective of this task is to create and configure a new virtual warehouse in Snowflake. Warehouses are crucial for query execution and data processing, as they provide the compute resources required to run SQL statements.

00Updated 6 months ago

aws-s3cloud-data-warehousedata-engineeringdatabase-setupetlportfolio-projectsnowflakesql

DEBOGIT87/future-state-investment-data-hub

NDA-safe migration framework: Python quality gate + Snowflake-ready canonical ODS

Python00Updated 4 days ago

capital-marketscloud-data-warehousedata-architecturedata-engineeringdata-migrationdata-qualitydbtfund-accountinginvestment-datainvestment-opspowerbipythonreconciliationsnowflakesnowflakedb

FatemehTarashi/cloud-data-warehouse

moved, cleaned, and transformed data stored in S3 as json to Redshift.

Jupyter Notebook00Updated 5 years ago

cloud-data-warehousedwhetlredshiftudacity-data-engineer-nanodegree

debashisdash1999/snowflake_proj12_streams

This project demonstrates Snowflake Streams for change data capture. It covers creating streams to track INSERT, UPDATE, and DELETE operations on tables, loading data from S3, querying captured changes, and managing stream objects for real-time data monitoring.

00Updated 4 months ago

aws-s3cdcchange-data-capturecloud-data-warehousedata-engineeringdelete-streametlinsert-streamportfolio-projectreal-time-datasnowflakestreamsupdate-stream

debashisdash1999/snowflake_proj8_time_travel

This project explores Snowflake’s Time Travel feature, including querying historical data using offsets, retention periods, and query IDs. It demonstrates restoring previous table states after updates, managing retention settings, and recovering data efficiently.

00Updated 6 months ago

business-continuitycloud-data-warehousecompliancedata-auditingdata-engineeringdata-recoverydata-retentiondata-versioningdebuggingerror-correctionhistorical-dataportfolio-projectquery-idrollbacksnowflakesqltime-travel

debashisdash1999/snowflake_proj9_table_types

This project explores Snowflake’s table types, including Permanent, Temporary, Transient, and External tables. It demonstrates creating tables, loading data from S3 stages, querying and validating data, and understanding differences in persistence, retention, and Time Travel support.

00Updated 6 months ago

analyticsaws-s3cloud-data-warehousedata-engineeringdata-loadingdata-managementetlexternal-tablespermanent-tablesportfolio-projectsnowflakesqltable-typestemporary-tablestime-traveltransient-tables

debashisdash1999/snowflake_proj2_stages_and_transformations

This project demonstrates how to use Snowflake stages for loading data from Amazon S3 into Snowflake tables. It also covers applying transformations during loading and selecting only specific columns from the source data.

00Updated 6 months ago

analyticsaws-s3cloud-data-warehousedata-engineeringdata-loadingdata-transformationetlportfolio-projectsnowflakesqlstages

debashisdash1999/snowflake_proj7_tasks_scheduling

Automating Data Workflows in Snowflake with Task Scheduling & Management.

00Updated 4 months ago

analyticsautomationcloud-data-warehousecron-jobsdata-engineeringdata-pipelinesetl-automationportfolio-projectsnowflakesqltask-dependencytask-monitoringtask-schedulingtasksworkflow-automation

najuzilu/CDW-AWSRedshift

Building a cloud data warehouse with AWS Redshift.

Python00Updated 4 years ago

aws-ec2aws-redshiftcloud-data-warehousepython

debashisdash1999/snowflake_proj10_cloning_swapping_tables

This project demonstrates Snowflake table cloning and swapping techniques. It covers creating original and cloned tables, loading data from S3, verifying cloned data, and performing table swaps to efficiently exchange data between staging and production tables.

00Updated 6 months ago

analyticsaws-s3backup-and-recoverycloned-tablescloud-data-warehousedata-engineeringdata-managementetl-workflowsportfolio-projectsnowflakesqlstaging-tablesswap-tablestable-cloningtable-swapping