"topic:data-standardization" — Search

41 results for “topic:data-standardization”

A modular ecosystem under this. namespace.

ai-ready-datadata-interoperabilitydata-standardizationdata-transformationdeep-learning-preprocessingintelligent-data-handlingmachine-learning-datamodular-frameworkneurons-me-ecosystemstructured-datastructured-media-processing

NhanPhamThanh-IT/SVM-Diabetes-PredictionArchived

🩺 Machine Learning diabetes prediction model using Support Vector Machine (SVM) classifier. Analyzes 8 medical features (glucose, BMI, age, etc.) from Pima Indian dataset to predict diabetes risk with 75-80% accuracy. Built with Python, scikit-learn, pandas. Includes data preprocessing, model training, and prediction system for diabetes..

Jupyter Notebook151Updated 8 months ago

accuracy-scoreartificial-intelligenceclassificationdata-collectiondata-standardizationdatasethealthcarejupyter-notebooklearning-materialsmachine-learning-algorithmsmodel-evaluationmodel-trainingnumpypandasprediction-systempythonscikit-learnsupervised-learningsupport-vector-machinetrain-test-split

rutishauserlab/workingmem-release-NWB

Example code accompanying the sternberg concept cell data release for Kyzar et al. (2024)

MATLAB120Updated 2 years ago

cognitive-neurosciencedata-standardizationneurophysiologyneurosurgerynwbopen-sourcesingle-neuronsworking-memory

DAF-Digital-Transformation-Office/CyberDataSchema

A digital transformation of cyber assessment and authorization data with a relational schema

51Updated 3 months ago

assessmentauthorizationcyber-risk-quantificationcybersecuritydata-modelingdata-schemadata-standardizationdecomposition-modelinteroperabilityrisk-managementrmfthreat-intelligence

themrityunjaypathak/Feature-Engineering

Feature Engineering with Python

Jupyter Notebook30Updated 2 months ago

column-transformerdata-normalizationdata-standardizationdummy-variablesimbalanced-dataiqrknn-imputerlabel-encodingmodified-zscoreonehot-encodingordinal-encodingoutlier-removalpipelinesimple-imputerzscore

AtlasOfLivingAustralia/corella

Prepare and check data to comply with Darwin Core Standard in R

R31Updated 2 months ago

darwin-coredata-standardisationdata-standardizationecologyrrstats

zedomel/thesis_2023

Unifying Biotic Interactions Data: Terminology, Data Analysis, Standardization, and Proposal of a Data Schema for Plant-Pollinator Interactions

Jupyter Notebook20Updated 2 years ago

biotic-interactionsdarwin-coredata-standardizationplant-pollinator-interactions

erreduarte/data-migration-project

Highlighting expertise in data migration, data normalization and standardization, this project demonstrates successful data transfer from Snowflake to Databricks. It emphasizes optimized data flow and enhanced accessibility through standardization, showcasing a commitment to ethical data practices.

21Updated 1 year ago

data-managementdata-manipulationdata-migrationdata-modelingdata-normalizationdata-optimizationdata-standardizationdatabrickssnowflakesql

DevasivaBA/quickbooks_invoice_analysis

A Python-based data cleaning project to streamline Quickbooks invoice data for analysis, paving the way for improved insights into sales, pricing, and inventory management.

Jupyter Notebook20Updated 1 year ago

dashboardsdata-cleaningdata-manipulationdata-modelingdata-standardizationforecastingpowerbipython-scriptquickbooks-desktop

Fedi-AB/SQL_Data_Warehouse_Project

Building a modern data warehouse with SQL Server, including ETL Processes, Data Modeling and Analytics

TSQL10Updated 4 months ago

ctedata-aggregationdata-analyticsdata-cleaningdata-engineeringdata-integrationdata-modelingdata-schemadata-standardizationdata-tablesdata-warehouse-architecturedatabasedatascienceetletl-pipelinemedallion-architecturesqlsql-serverstored-proceduresviews

chigwell/drone-capability-parser

A new package processes textual descriptions of drone designs to extract structured summaries of their operational capabilities. It focuses on identifying and categorizing key features such as locomot

Python10Updated 3 months ago

actuation-systemdata-standardizationdesigndroneengineering-summaryfeature-extractionflightlocomotionmultimodal-functionalitynatural-language-processingunified-actuationwheeled-mobility

StatAziz/Global-Layoffs-Data-Cleaning-with-SQL

This project is about cleaning and preparing a global layoffs dataset for analysis, focusing on handling null values, correcting data types, and ensuring data integrity for more accurate insights.

10Updated 1 year ago

data-cleaningdata-standardizationlayoffsmysqlsqlsql-server

Rihana5rose/Career-Aspirations-of-Gen-Z

This Data Analytics project focused on understanding the career preferences and motivations of Generation Z.Through survey data and analysis, this project aims to identify key trends and factors influencing their career choices, providing insights for employers,educators, and recruiters looking to engage with this new generation of talent.

10Updated 1 year ago

data-analysisdata-analyst-projectsdata-cleaningdata-standardizationdata-visualizationexcelgoogle-formsmysqlpivot-tables

chigwell/vuln-structure

vuln-structure is a package that extracts vulnerability details from raw text and outputs standardized, structured data for security teams.

Python10Updated 3 months ago

accuracy-upgradeaffected-systemscybersecurity-flawsdata-standardizationefficiency-improvementinformation-extractionit-securityllmatch-messagesnatural-language-processingpotential-impactrecommended-actionsremote-code-executionresponse-accelerationsecurity-vulnerabilitytext-processingthreat-assessmentunstructured-text-analysisvulnerability-extractionvulnerability-managementwatchguard-firewall

softwaresalt/csv-managed

csv-managed is a Rust command-line utility for high‑performance exploration and transformation of CSV data at scale, emphasizing streaming, typed operations, and reproducible workflows via schema and index files.

Rust10Updated 2 weeks ago

big-datacli-appdata-cleansingdata-engineeringdata-standardizationdata-transformationdata-wranglinghigh-performanceml-engineering

sahiltech55/KultureHire-Data-Analyst-Internship-Milestones

Hi folk, During my internship at KultureHire, I completed an end to end Data Analytics project. I created an executive and functional dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.

10Updated 1 year ago

data-analyticsdata-cleaningdata-standardizationdata-visualizationexcelmysqlpivot-tables

Abdullah321Umar/Internee.pk-DataAnalytics_Internship-Assignment5

🌟 Data Cleaning and Processing 🌟 Handled missing values, removed duplicates, standardized salary formats, and treated outliers for consistency.Revealed trends in company performance, job roles, and salary distributions after refining the dataset. This project highlights the power of data preprocessing as the backbone of reliable analytics.

Jupyter Notebook10Updated 4 months ago

aesthetic-designanalytical-thinkingcleaningcommunicationconversiondata-cleaningdata-interpretationdata-standardizationdata-transformationedahandling-missing-datajupyter-notebookmatplotlibnumpypalettespandaspython-programmingtransformationvs-code

priyashadapad/sql_data_cleaning_project

This repository contains a SQL-based data cleaning project where raw layoffs data was transformed into a clean and structured dataset. The project showcases practical SQL techniques such as duplicate removal, data standardization, null handling, and schema optimization, following real-world data preparation best practices.

10Updated 2 months ago

analyticscleaning-datasetdata-preparationdata-qualitydata-standardizationduplicate-detectionsql

mzr312312/iot-config-ledger

基于 Python 的 ETL 流水线，用于标准化 12 个制造基地的异构 IoT 配置数据。具备自动架构映射、多源合并及用于配置生命周期管理的每日变更日志生成功能--自动化聚合 50W+ IoT 资产并生成每日审计追踪，确保平台逻辑与边缘侧实施的一致性。

Python00Updated 1 month ago

change-log-automationconfiguration-auditdata-standardizationetl-pipelineiot-integration

andrewnana/CDISC-Data-Standardization-with-R-and-SAS

CDIS data standardization with SAS and R

HTML00Updated 5 months ago

adamcdashcdiscclinical-data-standardsclinical-programmingdata-analysisdata-standardizationdata-transformationdata-visualizationrregulatory-compliancesassas-proc-sgplotsdtmstatistical-reportingtlf

VaishnaviKenjale/Career-Aspirations-Of-Gen-Z

☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.

01Updated 1 year ago

data-analysisdata-analystdata-cleaningdata-cleaning-and-preprocessingdata-standardizationexcelgen-zgenz-aspirationsmysqlpivot-tables

salsabila-rahmah/SQL-Based-Data-Preparation-and-Validation-for-BI-Analysis

This project uses SQL to transform messy transactional sales data into a clean, validated dataset for accurate KPI and profitability analysis before BI reporting. I also built a Tableau Public dashboard from this final dataset; it can be viewed via the link below.

00Updated 3 weeks ago

bussiness-analystbussiness-intelligencebussiness-logicdata-analysisdata-analyticsdata-cleaningdata-controldata-standardizationdata-structuresdata-validationerd-diagrampercentile-methodsqlsql-aggregationsql-ctesql-joins-and-relationshipssqlitestar-schemawindow-functions

lsgggggg/Excel-Standardizer

🧹 Excel 数据标准化清洗工具 | 100+智能规则 · 两阶段安全处理 · 公式不动 · 逐条审核 · 变更日志导出

Python00Updated 2 weeks ago

data-cleaningdata-standardizationexcelexcel-toolsflaskpythontest-normalization

katrina-maestro/CallCentre_DataCleaning

The call center provided a messy dataset of customers. The objective was to clean, standardize, and remove duplicates to create an accurate, organized contact list. I used Pandas to load, explore, clean, and export the data, delivering a refined list ready for effective customer outreach.

Jupyter Notebook00Updated 1 year ago

data-cleaningdata-standardizationpandaspandas-dataframe

muthazir/sql-data-cleaning-project

A practical SQL data cleaning project that standardizes and prepares the Global Layoffs dataset for analysis using SQL techniques like window functions, staging tables, and data quality checks.

00Updated 1 month ago

data-cleaningdata-preprocessingdata-qualitydata-standardizationdatabasedatasetetlglobal-layoffsmysqlsqlwindow-functions

The-National-Neighborhood-Data-Archive/data-curation-templates

Standardized Stata templates for NaNDA data curation, quality control, and publication workflows

Stata00Updated 1 month ago

census-datadata-cleaningdata-curationdata-qualitydata-quality-checksdata-standardizationgeospatial-dataneighborhoodneighborhoodsopen-dataopen-datasetspublic-healthresearch-dataresearch-data-managementstata

mmzong/PCA_BreastCancerDetection

Tutorial code for performing PCA (with mathematical explanation) on breast cancer features computed from digitized images of fine needle aspirate (FNA) of a breast mass. Center the data, calculate correlation matrix, compute principal components, visualize and interpret results.

00Updated 1 year ago

bar-plotbiplotcorrelation-matrixdata-analysisdata-manipulationdata-sciencedata-standardizationdata-visualizationdplyrfactoextraggcorrplotggplot2pca-analysisrscatter-plotscree-plottutorialtutorial-code

KeerthanaPalanikumar/Data-Cleaning-on-SQL

This repository contains SQL scripts and documentation for cleaning and standardizing data in the NashvilleHousing table within the sqlproject2 database. The project aims to prepare the dataset for analysis by addressing inconsistencies, filling missing values, standardizing formats, and removing duplicates.

00Updated 1 year ago

data-cleaningdata-deduplicationdata-manipulationdata-standardizationdatabase-managementmssqlssms

Rihana5rose/KultureHire-Projects

This repository contains the projects completed as part of the KultureHire internship program. The projects focus on real-world business and data analysis problems, covering data collection, cleaning, analysis, visualization, and insight generation using tools such as Excel, SQL, and Power BI.

00Updated 3 months ago

advanced-exceldata-standardizationdata-visualizationedapivot-tablespower-query-editorpowerbisql

Sinnick4r/Analisis-ingreso-causas-2024Archived

Este proyecto incluye un proceso detallado de limpieza de datos de registros judiciales para la generación de estadísticas relevantes, utilizando Excel y Power BI. También se incluye la visualización interactiva de los datos procesados.

00Updated 1 year ago

data-cleaningdata-cleaning-and-preprocessingdata-standardizationexcelmicrosoft-excelmicrosoft-powerbipowerbistatistics

Page 1 of 2