Exploring Data with Python

February 16, 2022



Exploring Data with Python - Glenn Lehman

In this presentation, we'll review some mathematical foundations needed to perform detailed data analysis, applied in Python. If you know basic Python, you can join us as we explore data, learn how to use statistical analysis to make predictions, and understand the initial steps of exploring a data set to see what might be lurking there. What makes a data set a valid sample instead of meaningless noise? How can we interpret visualizations to determine the quality of data samples?

Mathematical Topics Covered:

- Population
- Sample
- Categorical and Quatitative Data (Probability Distribution)
- Basic visualizations (Bar Plot, Histograms, Stem and Leaf displays)
- Cross Tabulation - Mean, Median, Mode
- Group Frequency
- Standard Deviation
- Percentiles, Quartiles, Quintiles, Deciles
- Plotting (elementary box plot)
- IQR (interquartile range) - Box Plots (traditional/modified)

Python libraries introduced:

- numpy
- pandas
- matplotlib

Working code will be provided in a Jupyter notebook. This presentation will be recorded.

Meetup link: https://www.meetup.com/bostonpython/events/283752347/

Back to Past Events Page