Repos
43
Stars
32
Forks
13
Top Language
Python
Loading contributions...
Top Repositories
Business challenge that requires building a data platform for retailer data analytics.
The objective of this project to is to predict customer churn, loss opportunity and provide recommendations to the business team so the company can implement a customer persona in retention strategy and can monitoring throught dashboard interactive.
This project demonstrates how to build a real-time product recommendation system using Pub/Sub Lite and Apache Spark with Dataproc
This project have objective to examine what happens when the first gate in the game was moved from level 30 to level 40. When a player installed the game, he or she was randomly assigned to either gate30 or gate40.
Final Project create an end-to-end credit card fraud pipeline using lambda architecture (providing access to batch and stream processing)
The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments
Repositories
43Business challenge that requires building a data platform for retailer data analytics.
The objective of this project to is to predict customer churn, loss opportunity and provide recommendations to the business team so the company can implement a customer persona in retention strategy and can monitoring throught dashboard interactive.
This project demonstrates how to build a real-time product recommendation system using Pub/Sub Lite and Apache Spark with Dataproc
This repository contains various coding exercises, scripts, and SQL examples for learning and practicing different programming and database concepts.
This project demonstrates how to build a real-time analytics pipeline for mobile game data using Google Cloud Pub/Sub and Apache Beam (Dataflow).
This project builds a data pipeline to identify bank defaulter customers based on credit card and loan payment data using Google Dataflow
This project demonstrates how to build a real-time analytics pipeline using Spark Streaming on Google Cloud Platform (GCP)
This project demonstrates how to build an end-to-end batch processing pipeline using Apache Spark on Google Cloud Platform (GCP)
A list of useful Apache NiFi resources, processor bundles and tools
This project have objective to examine what happens when the first gate in the game was moved from level 30 to level 40. When a player installed the game, he or she was randomly assigned to either gate30 or gate40.
Final Project create an end-to-end credit card fraud pipeline using lambda architecture (providing access to batch and stream processing)
The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments
apache-nifi-templates
practice build Plant API ADD A DATABASE WITH JPA
build traveller API with spring
Interactive Dashboard with Looker
Obejctive project are create a system to help loan assessments automatically & Business Metrics are daily resolved applications and average resolved time
The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin.
Single node Cloudera Hadoop on cloud with quickstart image, docker and docker compose
ingestion data with airflow, spark for ETL and BigQuery for datawarehouse
load stream Bitcoin data with kafka into bigquery
Integrating ksqlDB with PostgreSQL
Ingest data from web data source and publish it to a Kafka topic for consumer application to subscribe and consume messages.
Spark Data Analytics
dbt project about data transformations process with star schema modeling concepts using the Kimball methodology
Build data warehouse in BigQuery
Deploy Apache Airflow with Docker and ingesting data to Google Cloud Storage
Cloud Flower is a project which empowers Cloud environment (in this case Im using Google Cloud Platform) to perform Batch Extract Transform Load (ETL)
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
Data Engineering on Google Cloud Platform