# WEEK 3 (30-4/12/20) ## Descriptive Statistics (Mon, 30/11/20) ### Types of data ![](https://i.imgur.com/jx7f7jp.png) *Qualitative contains: *Categorical which contains: *Nominal (categories have no connection with each other), Ordinal(categories can be ranked) *Dichotomous: only 2 categories (M or F) ==**Descriptive vs Inferential Statistics**== * Descriptive: describe sample, no conclusion drawn * Inferential: hypothesis testing, draw conclusion about populaiton ==**Descriptive Statistics**== **1. Central Tendency**: identifies a single value as a representative of an entire data * Mean (Arithmetic mean): average * Median: the middlemost number in a sorted set of data * Mode: value that appears the most in a set of data (Bimodal: set of data that has 2 modes) ==**Ad and Dis of Mean, Median, Mode**== ![](https://i.imgur.com/2NGEMz5.png) **2. Dispersion**: describe how scattered the values are and how much they differ from the mean value * Range * Quartiles: values that break down the dataset into quarters * IQR: Q3-Q1, describes the middle 50% of values * Outliers: values that are far from the middle ![](https://i.imgur.com/L29LJnL.png) * Variance: average of the squared differences from the mean * Standard deviation: Square root of variance ![](https://i.imgur.com/OIDaFzc.png) --- ## (Tue, 1/12/20) ### --- ## (Wed, 2/12/20) ### Data Visualization **==Purpose of data visualization==**: to extract insights, make sense of the data **==Types of data==** 1. Qualitative (Categorical) 2. Quantitative (Discrete vs Continuous) **==Types of visualization==** * **Univariate**: describe 1 variable only * Histogram: distribution of data * Box whisker plot * Bar chart * Pie chart * Line plot * **Bivariate**: show relationship between 2 variables * Scatter plot * Heat map * **Mulivariate**:multiple variable * Breakdown dimension **==Anatomy of a figure==** ![](https://i.imgur.com/d9IKdIj.png) **==canvas--->figure--->axes==** --- ## (Thu, 3/12/20) ### --- ## Module Test (Fri, 4/12/20) ### ---