Mathematics
Grade 8
15 min
Identify an outlier
Identify an outlier
Tutorial Preview
1
Introduction & Learning Objectives
Learning Objectives
Define what an outlier is in a data set.
Calculate the median, first quartile (Q1), and third quartile (Q3) of a given data set.
Compute the Interquartile Range (IQR) for a data set.
Apply the IQR rule to determine the lower and upper bounds for identifying outliers.
Identify specific data points that are outliers using the calculated bounds.
Explain the potential impact of outliers on data analysis.
Ever seen a number in a list that just doesn't seem to fit in with the rest? 🤔 It's like a basketball player who's suddenly 7 feet tall in a team of 5-footers!
In this lesson, you'll learn how to spot these 'unusual' numbers, called outliers, in any data set. Understanding outliers is important because they can sometimes skew o...
2
Key Concepts & Vocabulary
TermDefinitionExample
OutlierA data point that is significantly different from other data points in a set. It lies an abnormal distance from other values.In the data set {10, 12, 11, 100, 13}, the number 100 is likely an outlier because it's much larger than the other numbers.
Data SetA collection of related numerical or categorical information.The ages of students in a class: {13, 14, 13, 15, 14} is a data set.
MedianThe middle value of an ordered data set. If there's an even number of data points, it's the average of the two middle values.For {2, 5, 8, 10, 12}, the median is 8. For {2, 5, 8, 10}, the median is (5+8)/2 = 6.5.
First Quartile (Q1)The median of the lower half of an ordered data set. It represents the 25th percentile.For {1, 2, 3, 4, 5, 6, 7}, the median is 4....
3
Core Formulas
Interquartile Range (IQR) Formula
$IQR = Q_3 - Q_1$
This formula calculates the spread of the middle 50% of your data, which is crucial for setting the outlier boundaries.
Lower Bound for Outliers
$Lower Bound = Q_1 - 1.5 \times IQR$
Any data point that is less than this calculated value is considered a lower outlier.
Upper Bound for Outliers
$Upper Bound = Q_3 + 1.5 \times IQR$
Any data point that is greater than this calculated value is considered an upper outlier.
Outlier Identification Rule
A data point $x$ is an outlier if $x < Lower Bound$ or $x > Upper Bound$.
This rule combines the bounds to definitively identify any data point that falls outside the acceptable range.
5 more steps in this tutorial
Sign up free to access the complete tutorial with worked examples and practice.
Sign Up Free to ContinueSample Practice Questions
Challenging
Consider the data set {10, 20, 22, 25, 28, 30}. What is the smallest integer that could be added to this set to be classified as an upper outlier?
A.45
B.46
C.41
D.40
Challenging
The data set {5, 80, 82, 85, 88, 90, 95} represents quiz scores. The score '5' is an outlier. If this outlier is removed, what is the primary effect on the Interquartile Range (IQR) and the Range?
A.Both the IQR and the Range will increase.
B.The IQR will stay the same, but the Range will decrease significantly.
C.The IQR will decrease, and the Range will decrease significantly.
D.The Range will stay the same, but the IQR will decrease.
Challenging
A student's work to find outliers in {3, 20, 22, 24, 25, 40} is shown: Step 1: Median=(22+24)/2=23. Step 2: Lower half {3, 20, 22}, Q1=20. Step 3: Upper half {24, 25, 40}, Q3=25. Step 4: IQR=25-20=5. Step 5: Upper Bound = 25 + 1.5*5 = 32.5. Step 6: 40 is an outlier. In which step did the student make their first mistake?
A.Step 2: Q1 is incorrect.
B.Step 3: Q3 is incorrect.
C.Step 4: The IQR calculation is wrong.
D.There are no mistakes in the student's work.
Want to practice and check your answers?
Sign up to access all questions with instant feedback, explanations, and progress tracking.
Start Practicing Free