Rakshit Sinha

Raksh710

Graduate MS Information Systems Student at RH Smith School of Business (University of Maryland, College Park). An aspiring Data Scientist.

Robert H Smith School of Business - University of Maryland, College Park

College Park, MD, USA

https://rakshitsinha.net/

Languages

Jupyter Notebook52%HTML36%Python12%

Loading contributions...

Top Repositories

Shark_Tank_Analysis

Predicting whether a company on ABC's popular program "Shark Tank" accepted an offer is our goal. Entrepreneurs present their ideas to a panel of investors on the show Shark Tank, which debuted in 2009. In exchange for stock, the entrepreneur requests that the investors invest money.

1Jupyter Notebook

Sentiment-Analysis-YouTube-Comments

Uses Google's v3 API to get the top 100 relevant comments and do a sentiment analysis on each comment and then, finally return the 'Average' sentiment. The application is hosted using Salesforce Heroku which is a PaaS.

1Python

Anime_Recommender_System

Recommends Anime using Content based filtering (using TFIDF vectorization and sigmoid kernel) and collaborative filtering (using KNN)

1HTML

Data_Scientist_Salaries

Predicting the Salary of data science jobs (for example Data Scientist, Data Engineer, Machine Learning Engineer, Data Analyst, BI Engineer etc.) in USD based on various factors like Work Year (the year in which you are looking for job), Pay grade, Average pay scale in the Country (where the job is located), experience level, Employment type etc.

1Jupyter Notebook

Plotly-dash-stock-dashboard

Dynamic Dashboard created using plotly-dash for stock price historical values.

1Python

King_county_house_price_regression

Did a comparison between CatBoostRegressor and Keras to find out which model performed best on king county house price regression dataset from kaggle. Link to the notebook: https://www.kaggle.com/raksh710/catboost-vs-keras-cb-wins

1HTML

Repositories

Raksh710/Shark_Tank_Analysis

Jupyter Notebook10Updated 3 years ago

behavioral-analyticsconsumer-analyticsflask-applicationheroku-deploymentkaggleknn-classificationpythonshark-tank-analysis

Raksh710/Sentiment-Analysis-YouTube-Comments

Python11Updated 2 years ago

Raksh710/Diet_recommendation_fastembed

Recommends a recipe along with its full description using fastembed library and

Python00Updated 1 year ago

dockerdocker-imagedockerfiledockerhub-imagefastapifastembedheroku-appheroku-deploymentnlp-machine-learningnumpypandasrecommender-systemuvicorn

Raksh710/Raksh710

Config files for my GitHub profile.

00Updated 1 year ago

configgithub-config

Raksh710/MNIST

The input data contained image data ( grayscale(color_scale = 1) data of width=28, height=28) of digits from 0 to 9 which are to be identified by the model. I implemented CNN which consisted of convolutional layers as well as MaxPool layers. I achieved 99.6 % accuracy on the test set. Link to my notebook: https://www.kaggle.com/raksh710/mnist-using-cnn-99-6-test-accuracy

HTML00Updated 4 years ago

Raksh710/Comment-Processing-tool

Comment Processing-Tool

Jupyter Notebook00Updated 2 years ago

Raksh710/Anime_Recommender_System

Recommends Anime using Content based filtering (using TFIDF vectorization and sigmoid kernel) and collaborative filtering (using KNN)

HTML10Updated 2 years ago

animecollaborative-filteringcontent-based-recommendationkagglerecommendation-systemtfidf-text-analysis

Raksh710/PDF-Bot

Customized chatbot for a particular PDF file

Jupyter Notebook00Updated 2 years ago

Raksh710/ice_breakerFork

ice_breaker project forked from emarco177 to test Langchain's capabilities with various APIs

00Updated 2 years ago

apilangchainlangchain-pythonllmsopenaiopenai-api

Raksh710/Predict-Sleep-Rob-Mulla-Kaggle-Competition

Predict My Sleep is a Kaggle Competition hosted by Rob Mulla (Youtuber and Twitch Streamer). Tryint to predict his sleep patterns since 2022 using historic data.

Jupyter Notebook00Updated 2 years ago

Raksh710/Space-Titanic-Kaggle

Rank-121 as of March 20, 2023. The task is to predict whether a passenger was transported to an alternate dimension during the Spaceship Titanic's collision with the spacetime anomaly. To make these predictions, we're given a set of personal records recovered from the ship's damaged computer system.

Jupyter Notebook00Updated 2 years ago

Raksh710/Building-Efficient-Portfolio-Using-Various-Trade-Strategies

Building an efficient Active Portfolio which yields a high Sharpe Ratio on 8 instruments using various trade strategies in order to get a high Sharpe Ratio.

Jupyter Notebook00Updated 3 years ago

capmfinancial-informationmonte-carlo-simulationportfolio-constructionquantitative-financeyfinance-library

Raksh710/Data_Scientist_Salaries

Jupyter Notebook10Updated 3 years ago

flask-applicationheroku-deploymentrandom-forestregressionregression-modelssalary-prediction

Raksh710/Loan_Default_Prediction

A major chunk of bank revenue is generated by credit cards. Customers who fail to pay their credit card dues on time could potentially cost banks a lot of revenue. Issuing credit cards to customers who have a higher likelihood of not paying their dues on time involves a higher risk for the bank. Issuing these customers' cards with a higher interest rate would work in favor of the bank. Inorder to make a informed decision about which customer is high risk and which one is low risk, the firm would benefit from a predition model which would accurately predict if the customer would default or not. Prediction can be done based on factors like job, education, balance, loans, and house ownership. Finding out which are the most common factors that defaulters have will also help the bank to be cautious before issuing a credit card to customers who fall into one of those categories.

HTML00Updated 4 years ago

Raksh710/Plotly-dash-stock-dashboard

Dynamic Dashboard created using plotly-dash for stock price historical values.

Python10Updated 3 years ago

Raksh710/PySpark_DDOS_attack_classification

No description provided.

Jupyter Notebook00Updated 3 years ago

Raksh710/AJAX-Movie-Recommendation-System-with-Sentiment-AnalysisFork

Content-Based Recommender System recommends movies similar to the movie user likes and analyses the sentiments on the reviews given by the user for that movie.

Jupyter Notebook00Updated 3 years ago

Raksh710/My_Resume

My Resume

00Updated 3 years ago

Raksh710/Landscape_Classification

Given an input image, classify the image in the following category: 'buildings': 0, 'forest': 1, 'glacier': 2, 'mountain': 3, 'sea': 4, 'street': 5 <br> </br> Above are the keys along with their tag (or value) are mentioned. A CNN model has been used with 3 Conv2D, 3 MaxPool2d, 1 Flatten, one dropout and 2 Dense layers. <br> </br> After training the CNN model on 14034 images belonging to 6 classes, the CNN model was validated on a validation set with 3000 images belonging to 6 classes, on which an accuracy of 84.17% was achieved. Steps: 1) Specify train, validation and test directory (where images are stored) 2) Use Image Generator to create more samples out of the given number of training samples (in order to detect the class more accurately). Images went through various processes like: zoomed in/out, sheared, rorated etc. 3) Images from train and validation were subjected to the Image Generator created in step: 2. Note that in training the shuffle was True and that in validation it was False, because we want to keep the validation set in order to evalue the accuracy (which required the images to be in order) 4) Image samples from train directory were fed to the CNN model and evaluated on the validation directory. 5) Image samples from test directory were also predicted and evaluated manually.

HTML00Updated 3 years ago

Raksh710/King_county_house_price_regression

HTML10Updated 4 years ago

Raksh710/Malware_Attack_Classification

We are working on UMD's info challenge and our dataset is ISCXIDS2012 cybersecurity dataset.

Jupyter Notebook00Updated 3 years ago

Raksh710/Malicious_Website_Recognition

Classifying Malicious website from benign ones using CatBoost Classifier. Process involves Exploration of data, Data Cleaning, Resampling of data (to handle highly imbalanced data), Model implementation and Evaluation.

HTML00Updated 4 years ago

Raksh710/Healthcare_analytics

The task is to correctly predict the number of days a patient would be staying in a hospital, out of 10 different categories. 16 different parameters were given. EDA, Feature Engineering, resampling has been performed to properly do data preprocessing. Ultimately CatBoost Classification model has been implemented to achieve more than 41% accuracy.

Jupyter Notebook00Updated 4 years ago

Raksh710/Feature-Engineering-Live-sessionsFork

No description provided.

00Updated 4 years ago

Raksh710/covid-19_tweets_sentiment_analysis

Predicted the sentiment associated with tweets made on the topic of Covid-19 pandemic. Tweets were classified into "Positive", "Extremely Positive", "Neutral","Negative" and "Extremely Negative". TF-IDF Vectorization was used to vectorize the tokens present in the tweets and then to classify "CatBoost" algorithm was used. Ultimately achieving an accuracy of around 57%.

HTML00Updated 4 years ago

Raksh710/Medical_Personal_Cost

Task was to forecast the medical cost associated with each patient given their medical parameters and health history. CatBoost algorithm was implemented on the data after scaling (Standardization) was done.

HTML00Updated 4 years ago

Raksh710/CNN_for_detecting_pneumonia

Using CNN to detect and classify which chest x-ray images have pneumonia and which ones are normal. The data is taken from Kaggle platform. : https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia

HTML00Updated 4 years ago

Raksh710/Flower_detection_using_CNN

Flower detection using CNN

00Updated 4 years ago

Raksh710/Heart_Attack_Analysis_and_Prediction

Performed an analysis on a dataset and predicting which patients are more likely to suffer from a heart attack. link: https://www.kaggle.com/raksh710/87-accuracy-85-f1-score-knn-14-lr-svc-rf-cbc The dataset is available on kaggle and so is my notebook on this

Jupyter Notebook00Updated 4 years ago

Raksh710/Bike_Sharing_Data

Implementing various ML Regression model on bike sharing data shared by Capital Bikeshare (Washington D.C.)

Jupyter Notebook00Updated 4 years ago

Rakshit Sinha

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity