GitHunt
MA

MacMat01/pima-indians-outliers-removal-classification

In this project we are removing the outliers before doing the classification. This is done for a research paper purpose for our Research Methodologies exam course

Pima Indians Diabetes โ€“ Outlier Removal & Classification

An analysis based on the well-known Pima Indians Diabetes Dataset focusing on outlier detection and classification using various Machine Learning algorithms.


๐Ÿ“‹ Contents

  • data/ โ€“ Contains the original and cleaned datasets.
  • notebooks/ โ€“ Jupyter notebooks detailing the full analysis pipeline:
    • Data exploration
    • Outlier detection and removal
    • Feature engineering and normalization
    • Model training and comparison
    • Performance evaluation
  • requirements.txt โ€“ List of required Python packages.

๐Ÿš€ Installation & Setup

git clone https://github.com/MacMat01/pima-indians-outliers-removal-classification.git
cd pima-indians-outliers-removal-classification

# (optional) create a virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# install dependencies
pip install -r requirements.txt

# launch Jupyter Notebook
jupyter notebook

Languages

Jupyter Notebook100.0%

Contributors

Created June 3, 2025
Updated July 19, 2025
MacMat01/pima-indians-outliers-removal-classification | GitHunt