"topic:simpleimputer" — Search

16 results for “topic:simpleimputer”

No description provided.

anacondalinuxmachine-learningpythonrrstudiosimpleimputersklearnspyder

Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).

Python30Updated 1 year ago

accuracy-scorebagging-classifiercursesdecision-tree-classifierkagglelabelencoderpandaspythonsimpleimputersklearn-librarytrain-test-split

ZL63388/data-preparation-codes

This repository is a collection of basic code templates for Data Preparation. All codes I am sharing are from the practical exercises I did from the Data Science Infinity Program.

Python21Updated 4 years ago

feature-scalingfeature-selectionknn-imputeronehot-encodingoutlier-detectionpandassimpleimputer

zuhaib1214/Feature-Engineering

This repository is totally focused on Feature Engineering Concepts in detail, I hope you'll find it helpful.

Jupyter Notebook10Updated 2 years ago

binarizationdiscritisationfeature-engineeringfrequent-value-imputationiterative-imputerknn-imputerlabelencodermean-median-imputationnormalisationonehot-encodingordinal-encodingpercentile-methodprincipal-component-analysissimpleimputerstandardizationwinsorizationz-score

BradyFisher/Machine-Learning-Titanic-Project

This is a project where use the Random Forest Classifier and XGBoost Machine Learning Techniques to held predict what passengers survived the sinking of the Titanic.

Jupyter Notebook10Updated 2 years ago

jupyter-notebookmachine-learningmean-absolute-errorone-hot-encodingrandom-forest-classifiersimpleimputertrain-test-splitvalidationxgboost-classifier

BradyFisher/Housing-Prices-Machine-Learning-Project

This is a project where I use the Random Forest Regression and XGBoost Machine Learning Techniques to held predict the Sales Price of Houses..

Jupyter Notebook10Updated 2 years ago

jupyter-notebookmachine-learningmean-absolute-errorone-hot-encodingrandom-forest-regressionsimpleimputertrain-test-splitvalidationxgboost-regression

Agisthemantobeat/Linear-regression-Decision-Tree-Random-Forest-Regression-on-Housing-Data.

This is a machine learning project which implements three different types of regression techniques and formulates differences amongst them by predicting the price of a house based on Boston housing Data.

Python11Updated 5 years ago

decision-treesjobliblinear-regressionnumpypandasrandom-forestsimpleimputersklearnstratified-cross-validation

maheshvarade/Poland-Bankruptcy-Prediction

Poland Bankruptcy Prediction (2009) This project aims to predict whether a Polish company went bankrupt in 2009 based on its financial data. The dataset contains several features derived from companies' balance sheets, and the goal is to build models that can identify bankruptcy effectively — despite the challenge of high class imbalance.

Jupyter Notebook00Updated 11 months ago

decision-tree-classifiermake-pipelinepca-analysisrandom-forest-classifierrandom-oversamplingrandom-undersamplingroccurvesimpleimputersmote-samplingstanderdscaler

Rahitya5/IMPUTATION

while we load the dataset we get some missing values from dataset. so to replace the missing values we use a technique in Machine Learning called Imputation. Imputation --- 1. SimpleImputer 2.KNNImputer

00Updated 1 year ago

knnimputersimpleimputer

Soumyapro/Heart-Disease-Prediction

This project is aimed at predicting the likelihood of coronary heart disease (CHD) in individuals over the next ten years using Logistic Regression.

Jupyter Notebook00Updated 1 year ago

logistic-regressionnumpypandassimpleimputersklearn

Machine-Learning-Related-Projects/Real-Fake-Job-Post

Real-Fake-Job-Post

Jupyter Notebook00Updated 11 months ago

datamodelingdataprepardatapreprocessingdatavisualizationdecision-tree-classifierexploratory-data-analysisfeature-engineeringfeatureimportanceslogistic-regressionmlp-classifiernatural-language-processingrandom-forest-classifiersimpleimputerwordcloud-generator

Marlyn-Mayienga/Titanic-Survival-Prediction

Predicting passenger survival on the Titanic using an ensemble machine learning approach, achieving a Kaggle score of 0.77990. This project leverages stacking with Random Forest, Gradient Boosting, and SVM, enhanced by feature engineering and hyperparameter tuning, to model survival patterns effectively.

Jupyter Notebook00Updated 11 months ago

gradientboostingclassifierhot-encodinglogisticregression-classifiermatplotlib-pyplotonehot-encodingprecisionrandomforestclassifierrecall-precisionsimpleimputerstandardscalersvc

Rnamrata/online-payment-fraud-analysis

The online payment fraud analysis project follows several step approach from data preprocessing through model evaluation, result comparison and final model selection, using transaction patterns to identify fraud indicators including account draining, suspicious transfers, and balance inconsistencies.

Jupyter Notebook00Updated 10 months ago

bayessearchcvgradientboostingclassifiergridsearchcvhistgradientboostingclassifierlgbmclassifierlogisticregressionrandomforestclassifierrandomundersamplerrobustscalersgdclassifiersimpleimputersvcxgbclassifier

saifalibaig/Crop-Yield-Prediction

🌾 A machine learning-based crop production prediction system using historical Indian agricultural data with advanced regression models and hyperparameter tuning.

Jupyter Notebook00Updated 7 months ago

edafeature-encodingfeature-selectionkaggle-datasetlasso-regressionlinear-regressionmatplotlib-pyplotnumpyonehot-encodingpandaspython3ridge-regressionseabornsimpleimputerxgboost-regression

Khushi130404/Titanic_Pipeline

This project predicts whether a person survived the Titanic disaster based on various features using machine learning. It utilizes pipelines, ColumnTransformer, and model serialization for efficient processing and prediction.

Jupyter Notebook00Updated 1 year ago

column-transformerdecisiontreeclassifierminmaxscaleronehotencoderpicklepipelinesimpleimputer

ChanchalSoorma/Data-Analysis-with-Python

This repository provides a comprehensive and hands-on guide to performing data analysis using the essential Python libraries: Pandas, Matplotlib, and Seaborn.

Jupyter Notebook00Updated 6 months ago

data-analysisdata-visualizationmatplotlibnumpypandaspyplotpythonseabornsimpleimputersklearn