GitHunt

Archie Citra Muhammad

archie-cm

Languages

Python45%Jupyter Notebook32%HTML9%Java9%Shell5%

Repos

43

Stars

32

Forks

13

Top Language

Python

Loading contributions...

Top Repositories

Repositories

43
AR
archie-cm/IBM-Data-Engineering-Capstone-Project

Business challenge that requires building a data platform for retailer data analytics.

Jupyter Notebook188Updated 3 years ago
apache-airflowapache-sparkcognos-analyticsdb2-warehouseetl-pipelinemongodbmysqlpostgresql
AR
archie-cm/Churn-Analysis-Ecommerce-Customer

The objective of this project to is to predict customer churn, loss opportunity and provide recommendations to the business team so the company can implement a customer persona in retention strategy and can monitoring throught dashboard interactive.

Jupyter Notebook83Updated 3 years ago
data-sciencefeature-engineeringmachine-learningpythonscikit-learn
AR
archie-cm/real_time_product_recommendations_with_machine_learning_on_gcp

This project demonstrates how to build a real-time product recommendation system using Pub/Sub Lite and Apache Spark with Dataproc

Python10Updated 1 year ago
dataprocpubsublitespark
AR
archie-cm/note_coding

This repository contains various coding exercises, scripts, and SQL examples for learning and practicing different programming and database concepts.

Python00Updated 1 year ago
AR
archie-cm/Mobile_Game_Analysis_Real-Time_Pipeline_with_PubSub_and_Dataflow

This project demonstrates how to build a real-time analytics pipeline for mobile game data using Google Cloud Pub/Sub and Apache Beam (Dataflow).

Python00Updated 1 year ago
apache-beamdataflowstreaming-data
AR
archie-cm/identify_bank_defaulter_customer_with_beam

This project builds a data pipeline to identify bank defaulter customers based on credit card and loan payment data using Google Dataflow

Python00Updated 1 year ago
apache-beamdataflow
AR
archie-cm/real_time_analytics_with_spark_streaming_on_dataproc

This project demonstrates how to build a real-time analytics pipeline using Spark Streaming on Google Cloud Platform (GCP)

Jupyter Notebook00Updated 1 year ago
pubsubspark-streaming
AR
archie-cm/end_to_end_batch_processing_pipeline_with_dataproc

This project demonstrates how to build an end-to-end batch processing pipeline using Apache Spark on Google Cloud Platform (GCP)

Python00Updated 1 year ago
dataprocspark
AR
archie-cm/awesome-nifiFork

A list of useful Apache NiFi resources, processor bundles and tools

00Updated 5 years ago
AR
archie-cm/A-B-Testing-Mobile-Games

This project have objective to examine what happens when the first gate in the game was moved from level 30 to level 40. When a player installed the game, he or she was randomly assigned to either gate30 or gate40.

Jupyter Notebook10Updated 3 years ago
abtestingdata-analysispythonretention-rate
AR
archie-cm/final-project-credit-card-fraud-pipeline

Final Project create an end-to-end credit card fraud pipeline using lambda architecture (providing access to batch and stream processing)

HTML11Updated 2 years ago
AR
archie-cm/Credit_Risk_Model_VIX_ID-X_Partners

The objective project is to decrease the company's losses by up to 30% through bad loans by creating a machine learning system to assist in automating loan assessments

Jupyter Notebook10Updated 3 years ago
credit-riskdata-analysisdata-visualizationmachine-learningscorecard
AR
archie-cm/apache-nifi-templatesFork

apache-nifi-templates

00Updated 4 years ago
AR
archie-cm/plant_API

practice build Plant API ADD A DATABASE WITH JPA

Java00Updated 2 years ago
AR
archie-cm/Travel_Adventures_API

build traveller API with spring

Java00Updated 2 years ago
AR
archie-cm/visualizing-data-and-batch-processing

Interactive Dashboard with Looker

00Updated 3 years ago
AR
archie-cm/Credit-Score-Home-Credit-Indonesia

Obejctive project are create a system to help loan assessments automatically & Business Metrics are daily resolved applications and average resolved time

Jupyter Notebook10Updated 3 years ago
credit-scoringhome-credit-default-riskmachine-learningscorecard
AR
archie-cm/docker-hadoop-bigdata

The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin.

Shell11Updated 2 years ago
AR
archie-cm/cloudera-docker

Single node Cloudera Hadoop on cloud with quickstart image, docker and docker compose

00Updated 2 years ago
AR
archie-cm/airflow-spark-gcp-docker

ingestion data with airflow, spark for ETL and BigQuery for datawarehouse

Python00Updated 2 years ago
AR
archie-cm/kafka-avro-bigquery

load stream Bitcoin data with kafka into bigquery

Python00Updated 2 years ago
AR
archie-cm/kafka-stream-ksqldb

Integrating ksqlDB with PostgreSQL

HTML00Updated 2 years ago
AR
archie-cm/kafka-streaming-processing

Ingest data from web data source and publish it to a Kafka topic for consumer application to subscribe and consume messages.

Python00Updated 3 years ago
AR
archie-cm/Spark-SQL-and-Data-Frames

Spark Data Analytics

Jupyter Notebook00Updated 3 years ago
AR
archie-cm/dimensional-modelling-dbt

dbt project about data transformations process with star schema modeling concepts using the Kimball methodology

00Updated 3 years ago
AR
archie-cm/data-warehouse

Build data warehouse in BigQuery

00Updated 3 years ago
AR
archie-cm/ingestion-with-airflow

Deploy Apache Airflow with Docker and ingesting data to Google Cloud Storage

Python00Updated 3 years ago
AR
archie-cm/cloud-flower

Cloud Flower is a project which empowers Cloud environment (in this case Im using Google Cloud Platform) to perform Batch Extract Transform Load (ETL)

Python00Updated 3 years ago
AR
archie-cm/training-data-analystFork

Labs and demos for courses for GCP Training (http://cloud.google.com/training).

00Updated 3 years ago
AR
archie-cm/data-engineering-gcpFork

Data Engineering on Google Cloud Platform

00Updated 6 years ago

Gists

Recent Activity

Archie Citra Muhammad (archie-cm) | GitHunt