76 results for “topic:cleaning-dataset”
General pipeline used for analyzing EEG data where Raw EEG data gets transformed into ERPS and Stats are done in R (Mixed effects models)
A simple tool for cleaning image datasets at a glance.
Data Analysis
Performed the data exploration and cleaning using SQL for a dataset about an e-commerce store to provide answers for smart business questions.
In this project, I cleaned up a large FIFA 2021 dataset with 18,000+ player records. The data was messy, with inconsistencies in 77 columns. I focused on making the data consistent and usable for analysis. This repository documents my step-by-step process, demonstrating how I transformed the data into a clean format.
NutriTrack is a personalised food recommendation system designed to promote healthier eating habits for both general users and individuals with dietary restrictions such as diabetes and hypertension. It combines structured nutritional data, user profiles, and rule-based filtering logic to suggest suitable meals across different meal times(4).
No description provided.
This repository contains a SQL-based data cleaning project where raw layoffs data was transformed into a clean and structured dataset. The project showcases practical SQL techniques such as duplicate removal, data standardization, null handling, and schema optimization, following real-world data preparation best practices.
This repository contains the full machine learning workflow to predict Global Horizontal Irradiance (GHI) using Saudi Arabia’s weather data (2015–2020).
Project of cleaning of data 'Flats in Moscow and Moscow region'
Binary classification of residential utility problems in NYC; Capstone project for the IBM Certificate in Data Science
No description provided.
This project involves cleaning and preparing data for entip project
Hii i created Explore-Data-Analysis using python , Jupyter Notebook
Use of Beautiful soup to scrap data from couple of websites and perform exploratory analysis
Unsupervised Machine Learning- CyrptoCurrency Analysis, using several models on a cryptocurrency data in order to discover patterns and groups in data. Analysis done to create a report that includes what cryptocurrencies are on the trading market and how they could be grouped in order to create a classification system for potential new investments into the cryptocurrency market.
Flight Data Scraping: Analysis and Visualizations in Tableau
The goal of this project is to analyze data related to a marketing campaign and subsequently develop a machine learning model that can predict customers' response to the campaign. The overall benefit of this application is the efficient utilization of marketing budget.
Iplémentation de techniques d'extractions,de traitements et d'analyse de données
Just cleaning an Airbnb dataset with no more digging
No description provided.
host & listing characteristics to detect illegitimate listing rental
build a models that predicts whether an individual makes over $50,000 per year.
Netflix is a streaming service that offers a wide variety of award winning TV Shows, Movies, Anime, Documentaries, and more. The service primarily distributes original and acquired films and television shows from various genres, and its availability in multiple languages.
Help an organization to improve employee performance and improve employee retention (Reduce Attrition) by creating Interactive and Dynamic HR Analytics Dashboard.
Simple project that extract, clean and process a dataset and import the data to a nosql database. Implementation of a simple app to work with.
DataCamp project from the Associate Data Scientist track, focusing on optimizing dataset storage by transforming data types and filtering. Prepares data for efficient machine learning workflows
Unsupervised Machine Learning- CyrptoCurrency Analysis - For this project I am performing unsupervised machine learning on cryptocurrency data in order to discover patterns and groups in data. The purpose of this analysis is to create a report that includes what cryptocurrencies are on the trading market and how they could be grouped in order to create a classification system for potential new investments into the cryptocurrency.
SAS-based data cleaning and sales reporting project
Analysis of the relationship between social media usage and emotional well-being using Python data analysis and visualization.