GitHunt
DA

danilo-alm/data-profiling

Generate reports with ydata_profiling

Data Profiling

This script uses ydata_profiling to generate a report for all tables within a schema or a select subset of tables, plus a simple HTML table with the percentage of missing and present values per column for quick reference.

Running

Use uv. If you don't have it:

pipx install uv  # or pip

Create your .env and set the required values. Run the script:

uv run python main.py

Config (.env)

Value Description Default
DATABASE_URL Database URL for SQLAlchemy -
OUTPUT_DIR Output directory -
SCHEMA Database schema public
TABLES Comma-separated list of tables All tables

Languages

Python100.0%

Contributors

GNU General Public License v3.0
Created January 24, 2026
Updated January 24, 2026
danilo-alm/data-profiling | GitHunt