Contributor: Lynn Ellis. Lesson ID: 13757
To begin to understand a data set, statisticians look at center, spread, and shape. This lesson will explore variance and standard deviation as measures of spread.
He thought he could cross because it was only a foot and a half deep on average.
Clearly knowing the average depth of the river didn't give that statistician enough information.
Maybe there is a sharp drop-off in the middle of the river and then a very shallow sandbar. Consider these two scenarios:
The average depth of the river is 1.5 feet.
However, with a depth of 3.5 feet at one point in the river, the statistician will get pretty wet. That's because the data are fairly spread out.
Again, the average depth of the river is 1.5 feet.
However, he won't get as wet because there is much less spread to the data.
We need more than just a measure of average, called mean in statistics, to have enough information about a set of data.
This is where variance and standard deviation come in. They are two measures of variation, closely tied together as you will see, that give information about the spread of data around the mean.
Begin with the idea of variation in data as shown in Variability (Statistics) from Lydia Flynn:
With that introduction to the idea of variability, let's take it a step further.
We can talk about variability by talking about the deviations of each data point, but for a large data set, that's a lot of numbers.
What would be the most helpful is a single number that puts all of those deviations together. We have one! It's called the standard deviation.
Simply put, the standard deviation represents a kind of average of all the deviations within the data set.
Here are the steps to calculating a standard deviation from a data set:
Make sure you have those steps written in your notes, and then watch Finding a Sample Standard Deviation from Parabola Magic:
Let's go back to our statistician crossing the river.
In the first scenario, where the depths were much more spread out, the standard deviation is 1.43 feet and the variance is 2.04.
In the second scenario, the standard deviation is 0.40 and the variance is 0.16.
If he had the information about the standard deviation and variance of the data, the statistician would have sufficient information to avoid unexpected depths.
Now that you know what a standard deviation is and how to calculate one, go to the Got It! section to practice your new skills.