SOCS
Describing graphs of quantitative data
SOCS Acronym
Shape - relatively symmetric, skewed left/right
Outlier - extremely low or high data points in the distribution
Center - median of the distribution
Spread - range of the distribution
Types of Skew
Computing Outliers
Low outlier: any value < Q1 - 1.5(IQR)
High outlier: any value > Q3 + 1.5(1QR)
Measures of Center
Mode - most frequent
Mean - average of the data (nonresistant - outliers affect greatly)
Median - middle point of data (resistant - outliers barely affect)
Measures of Spread
Range - largest - smallest (nonresistant)
Interquartile Range (IQR) - middle 50% of the data (resistant)
Standard Deviation - on average, how much data varies from the mean (nonresistant)
Variance - standard deviation squared
Comparing Distributions Numerically Using 5 Number Summary
5 number summary - minimum, Q1, median, Q3, maximum
1) explain why 5 number summary best fits (nonresistant measurements)
2) 2 SOCS (one for each graph)
3) write outlier calculation
4) make 2 comparisons with 5 number summary (don't compare minimum to minimum, Q1 to Q1, median to median, etc.)
5) compare variability and utilize IQR