Shubham Sharma
iShubhamSharma
Aspiring Data Scientist
Languages
Loading contributions...
Top Repositories
Recently asked questions in AMCAT Automata section
"Optimized Indexing of unstructured data for Data Lake environment" is a project which is going to deal with indexing pool of unstructured data in Data Lake environment. Data Lake is a repository which hold vast amount of data in its native form. The idea of data lake is to have a single storehouse of all data in an enterprise ranging from the raw data to transformed data which is used for various purposes including visualization, machine learning, analytics and reporting. This project begins with using unstructured data sets containing data in native format, and then indexing it by Inverted Indexing technique using Hashing so as to get optimized results in speed and time.
The National Emissions Inventory (NEI) is a detailed estimate of air emissions that include criteria pollutants and hazardous air pollutants. Fine particulate matter (PM2.5) is an ambient air pollutant for which there is strong evidence that it is harmful to human health.
Logistic regression in Tableau using R
Peer Assessment 1 for Reproducible Research
The repository contains the Tableau desktop twbx file to perform forecasting using R.
Repositories
15Recently asked questions in AMCAT Automata section
"Optimized Indexing of unstructured data for Data Lake environment" is a project which is going to deal with indexing pool of unstructured data in Data Lake environment. Data Lake is a repository which hold vast amount of data in its native form. The idea of data lake is to have a single storehouse of all data in an enterprise ranging from the raw data to transformed data which is used for various purposes including visualization, machine learning, analytics and reporting. This project begins with using unstructured data sets containing data in native format, and then indexing it by Inverted Indexing technique using Hashing so as to get optimized results in speed and time.
The National Emissions Inventory (NEI) is a detailed estimate of air emissions that include criteria pollutants and hazardous air pollutants. Fine particulate matter (PM2.5) is an ambient air pollutant for which there is strong evidence that it is harmful to human health.
Logistic regression in Tableau using R
Peer Assessment 1 for Reproducible Research
The repository contains the Tableau desktop twbx file to perform forecasting using R.
Plotting Assignment 1 for Exploratory Data Analysis
Human Activity Recognition database is built from the recordings of 30 subjects performing activities of daily living (ADL) while carrying a waist-mounted smartphone with embedded inertial sensors.
Repository for Programming Assignment 2 for R Programming on Coursera
Competitive Programming | Recently Asked Array Questions
Sorting Algorithms | Selection Sort, Bubble Sort, Insertion Sort, Merge Sort and Quick Sort
The objective of this repository is to build a login/sign up, and registration page.
Lending Club connects people who need money (borrowers) with people who have money (investors). Hopefully, as an investor you would want to invest in people who showed a profile of having a high probability of paying you back. We will try to create a model that will help predict this.
The objective of this repository is to develop R libraries in C, visualization of data points and comparative study of space time complexity on different platforms.
Basic To Advanced String Programs