GitHunt
RO

roshankoirala/pySpark_tutorial

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning

pySpark_tutorial

List of contents

  • RDDs and DataFrame
  • Exploratory data analysis
  • Handeling multiple dataframes
  • Visualization
  • Machine learning

Languages

Jupyter Notebook100.0%

Contributors

Created August 18, 2020
Updated April 23, 2025
roshankoirala/pySpark_tutorial | GitHunt