16 results for “topic:simpleimputer”
No description provided.
Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).
This repository is a collection of basic code templates for Data Preparation. All codes I am sharing are from the practical exercises I did from the Data Science Infinity Program.
This repository is totally focused on Feature Engineering Concepts in detail, I hope you'll find it helpful.
This is a project where use the Random Forest Classifier and XGBoost Machine Learning Techniques to held predict what passengers survived the sinking of the Titanic.
This is a project where I use the Random Forest Regression and XGBoost Machine Learning Techniques to held predict the Sales Price of Houses..
This is a machine learning project which implements three different types of regression techniques and formulates differences amongst them by predicting the price of a house based on Boston housing Data.
Poland Bankruptcy Prediction (2009) This project aims to predict whether a Polish company went bankrupt in 2009 based on its financial data. The dataset contains several features derived from companies' balance sheets, and the goal is to build models that can identify bankruptcy effectively — despite the challenge of high class imbalance.
while we load the dataset we get some missing values from dataset. so to replace the missing values we use a technique in Machine Learning called Imputation. Imputation --- 1. SimpleImputer 2.KNNImputer
This project is aimed at predicting the likelihood of coronary heart disease (CHD) in individuals over the next ten years using Logistic Regression.
Real-Fake-Job-Post
Predicting passenger survival on the Titanic using an ensemble machine learning approach, achieving a Kaggle score of 0.77990. This project leverages stacking with Random Forest, Gradient Boosting, and SVM, enhanced by feature engineering and hyperparameter tuning, to model survival patterns effectively.
The online payment fraud analysis project follows several step approach from data preprocessing through model evaluation, result comparison and final model selection, using transaction patterns to identify fraud indicators including account draining, suspicious transfers, and balance inconsistencies.
🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.
This project predicts whether a person survived the Titanic disaster based on various features using machine learning. It utilizes pipelines, ColumnTransformer, and model serialization for efficient processing and prediction.
This repository provides a comprehensive and hands-on guide to performing data analysis using the essential Python libraries: Pandas, Matplotlib, and Seaborn.