SOCS

Describing graphs of quantitative data 

SOCS Acronym

Shape - relatively symmetric, skewed left/right

Outlier - extremely low or high data points in the distribution

Center - median of the distribution

Spread - range of the distribution 

Types of Skew

Computing Outliers

Low outlier: any value < Q1 - 1.5(IQR)

High outlier: any value > Q3 + 1.5(1QR)

Measures of Center

Mode - most frequent

Mean - average of the data (nonresistant - outliers affect greatly)

Median - middle point of data (resistant - outliers barely affect)

Measures of Spread

Range - largest - smallest (nonresistant) 

Interquartile Range (IQR) - middle 50% of the data (resistant) 

Standard Deviation - on average, how much data varies from the mean (nonresistant)

Variance - standard deviation squared 

Comparing Distributions Numerically Using 5 Number Summary

5 number summary - minimum, Q1, median, Q3, maximum 

1) explain why 5 number summary best fits (nonresistant measurements) 

2) 2 SOCS (one for each graph)

3) write outlier calculation 

4) make 2 comparisons with 5 number summary (don't compare minimum to minimum, Q1 to Q1, median to median, etc.)

5) compare variability and utilize IQR