# Exploratory data analysis ## Learning objectives * Define *exploratory data analysis* (EDA) and types of pattern exploration * Demonstrate types of graphs useful for EDA and precautions when interpreting them * Practice exploring data --- ### Approaches to EDA - Look for _variation_ and _covariation_ - Don't need to have pretty charts (for publishing purposes, you want to add titles, legends, themes etc.) #### Variation - dispersion of one variable - Histograms - w/ Rug plot gives 1D marginal distributions (best used with smaller, high-variance datasets) - Bin width is important! - Continous variables - Bar Chart - Categorical variables #### Covariation - multiple variables (x and y axis) - Categorical on x, continuous on y -> boxplot - Two continuous -> scatterplots - Could also facet or layer variables on a single graph - When layering, use different channels to distinguish patterns