Normal Distribution
   

   

 Math Help -> Statistics -> Normal Distribution 

Contents of this section:

Here's what you'll find in this section:

Normal Distribution, Sampling Distribution of a Statistic
bulletThe Normal Curve

The Normal Curve

The normal distribution (also called the Gaussian distribution or the famous ``bell-shaped'' curve) is the most important and widely used distribution in statistics. The normal distribution is characterized by two parameters, tex2html_wrap_inline2651 and tex2html_wrap_inline2693 , namely, the mean and variance of the population having the normal distribution. We will use the notation tex2html_wrap_inline3473 to denote a random variable which is distributed normally with mean tex2html_wrap_inline2651 and variance tex2html_wrap_inline2693 .

 
bulletHow Mean and Variance Relate to Curve
bullet

How Mean and Variance Relate to Curve

For the normal distribution, the mean, median, and mode are all equal to value where the curve is highest. The mean tex2html_wrap_inline2651 is sometimes called a location parameter because increasing (decreasing) it shifts the entire normal curve right (left). Likewise, tex2html_wrap_inline2697 is called a scale parameter because increasing (decreasing) it causes the curve to be wider (narrower). See the ``Z, t, Chi-square, F'' concept lab for examples. The value of tex2html_wrap_inline2697 is the distance from tex2html_wrap_inline2651 to the inflection points of the normal curve. (The inflection points are the locations where the curve changes from turning downward to turning upward.) Note that the total area under any continuous curve is equal to one.

In the figure below, we have drawn the standard normal (or Z) curve, that is, the normal curve for tex2html_wrap_inline3489 and tex2html_wrap_inline3491 . We have also placed on the plot the percent of the area under the curve between -3 and -2 (2.15%), between -2 and -1 (13.59%), and so on. Note how the curve is symmetric about 0, how around 68% of the area is between -1 and 1, and approximately 95.44% between -2 and 2. (Recall the empirical rule from Week 2.)

The plot of any normal curve will look basically the same as this figure except that the numbers on the horizontal axis instead of being centered at 0 will be centered at tex2html_wrap_inline2651 and the other numbers will be at multiples of tex2html_wrap_inline2697 away from tex2html_wrap_inline2651 . For example, the 1 would be replaced at tex2html_wrap_inline3511 , the 2 by tex2html_wrap_inline3513 , the -3 by tex2html_wrap_inline3517 and so on. The IQ histogram back in Week 1 gives an example of this where the curve is centered at tex2html_wrap_inline2651 = 100 and the labels on the horizontal axis are at multiples of tex2html_wrap_inline2697 = 15 away from 100.

 

 

Since there are an infinite number of normal distributions (there is one for any choice of tex2html_wrap_inline2651 and tex2html_wrap_inline2693 ), we will standardize all our normal distribution problems to a tex2html_wrap_inline3527 standard normal distribution. The notation we will use is tex2html_wrap_inline3529 . So for any tex2html_wrap_inline3473 ,

displaymath3533

where tex2html_wrap_inline3529 .

 

bulletFinding Areas and Quantiles for Z Curve
bullet

Finding Areas and Quantiles for Z Curve

Our method for finding probabilities for normal distribution problems will be as follows:
bulletConvert any normal distribution problem to a standard normal.
bulletLook up the probability in the standard normal (or Z) table (or use StataQuest).

EXAMPLE:\ Let X be the blood platelet count (measured in thousands per cc of blood) in humans, and suppose tex2html_wrap_inline3545 . What is the probability that an individual has a platelet count between 185.4 and 360.2? It is a good idea to always draw a picture. The solution is as follows. We have

displaymath3547

is given by

displaymath3549

which, in turn is P(-1.96<Z<1.96), which is

displaymath3553

 

We might also ask what the value of X is equivalent to the 99% percentile (99% of all individuals will have a platelet count below this value). To solve this we reverse the process used above:
bulletLook up the Z value corresponding to our probability in the standard normal table.
bulletConvert this value of Z to X by tex2html_wrap_inline3563 .
From the Z table, we find that z=2.33 has area to the left of 0.99. Now using the equation for Z we get

displaymath3571

which means that

displaymath3573

Thus, someone with a blood-platelet count of 376.72 has a higher count than 99% of the population.

 

 

Internet References

 

Related pages in this website

 

 

The webmaster and author of the Math Help site is Graeme McRae.
     [home]  [email]  [search]  [Links to Math Sites]  [Whiteboard]