Python Data Science Notebooks

Python Data Science Notebooks

Description


For statistics class, I used various online datasets and cleaned and analyzed them using NumPy/Pandas. I used Pandas to analyze things like confidence intervals and probability, and then afterwards visualized the data findings using Matplotlib.

Examples


Pandas Notebook 1 - Airbnb Data

  • Used the data in order to see what percentage of user sessions result in an airbnb booking
  • Found what day of the week users are most active
  • Determined the distribution of devices users use to access the site

Pandas Notebook 2 - IGN Game Scores

  • Used a histogram to show the frequency distribution of IGN game scores
  • Determined the most popular genres and platforms
  • Found probability of any game having score over 8/10

Pandas Notebook 3 - iPhone vs Android Speedtest

  • Created graphical distribution of iOS and Android manufacturers
  • Calculated averages of different countries internet speeds
  • Calculated confidence intervals for download speeds

Picture 1

Picture 1