Giáo trình

# Introductory Statistics

Mathematics and Statistics

## The Uniform Distribution

Tác giả: OpenStaxCollege

The uniform distribution is a continuous probability distribution and is concerned with events that are equally likely to occur. When working out problems that have a uniform distribution, be careful to note if the data is inclusive or exclusive.

The data in [link] are 55 smiling times, in seconds, of an eight-week-old baby.

 10.4 19.6 18.8 13.9 17.8 16.8 21.6 17.9 12.5 11.1 4.9 12.8 14.8 22.8 20 15.9 16.3 13.4 17.1 14.5 19 22.8 1.3 0.7 8.9 11.9 10.9 7.3 5.9 3.7 17.9 19.2 9.8 5.8 6.9 2.6 5.8 21.7 11.8 3.4 2.1 4.5 6.3 10.7 8.9 9.4 9.4 7.6 10 3.3 6.7 7.8 11.6 13.8 18.6

The sample mean = 11.49 and the sample standard deviation = 6.23.

We will assume that the smiling times, in seconds, follow a uniform distribution between zero and 23 seconds, inclusive. This means that any smiling time from zero to and including 23 seconds is equally likely. The histogram that could be constructed from the sample is an empirical distribution that closely matches the theoretical uniform distribution.

Let X = length, in seconds, of an eight-week-old baby's smile.

The notation for the uniform distribution is

X ~ U(a, b) where a = the lowest value of x and b = the highest value of x.

The probability density function is f(x) = $\frac{1}{b-a}$ for axb.

For this example, X ~ U(0, 23) and f(x) = $\frac{1}{23-0}$ for 0 ≤ X ≤ 23.

Formulas for the theoretical mean and standard deviation are

$\mu =\frac{a+b}{2}$ and $\sigma =\sqrt{\frac{{\left(b-a\right)}^{2}}{12}}$

For this problem, the theoretical mean and standard deviation are

μ = = 11.50 seconds and σ = = 6.64 seconds.

Notice that the theoretical mean and standard deviation are close to the sample mean and standard deviation in this example.

a. Refer to [link]. What is the probability that a randomly chosen eight-week-old baby smiles between two and 18 seconds?

a. Find P(2 < x < 18).

P(2 < x < 18) = (base)(height) = (18 – 2)$( 1 23 )$ = $( 16 23 )$.

b. Find the 90th percentile for an eight-week-old baby's smiling time.

b. Ninety percent of the smiling times fall below the 90th percentile, k, so P(x < k) = 0.90

$P(x

$(base)(height)=0.90$

$(k−0)( 1 23 )=0.90$

$k=( 23 )( 0.90 )=20.7$

c. Find the probability that a random eight-week-old baby smiles more than 12 seconds KNOWING that the baby smiles MORE THAN EIGHT SECONDS.

c. This probability question is a conditional. You are asked to find the probability that an eight-week-old baby smiles more than 12 seconds when you already know the baby has smiled for more than eight seconds.

Find P(x > 12|x > 8) There are two ways to do the problem. For the first way, use the fact that this is a conditional and changes the sample space. The graph illustrates the new sample space. You already know the baby smiled more than eight seconds.

Write a new f(x): f(x) = = $1 15$

for 8 < x < 23

P(x > 12|x > 8) = (23 − 12)$( 1 15 )$ = $( 11 15 )$

For the second way, use the conditional formula from Probability Topics with the original distribution X ~ U (0, 23):

P(A|B) =

For this problem, A is (x > 12) and B is (x > 8).

So, P(x > 12|x > 8) =

The amount of time, in minutes, that a person must wait for a bus is uniformly distributed between zero and 15 minutes, inclusive.

a. What is the probability that a person waits fewer than 12.5 minutes?

a. Let X = the number of minutes a person must wait for a bus. a = 0 and b = 15. X ~ U(0, 15). Write the probability density function. f (x) = = $1 15$ for 0 ≤ x ≤ 15.

Find P (x < 12.5). Draw a graph.

$P(x

The probability a person waits less than 12.5 minutes is 0.8333.

b. On the average, how long must a person wait? Find the mean, μ, and the standard deviation, σ.

b. μ = = = 7.5. On the average, a person must wait 7.5 minutes.

σ = $\sqrt{\frac{\left(b-a{\right)}^{2}}{12}}=\sqrt{\frac{\left(\mathrm{15}-0{\right)}^{2}}{12}}$ = 4.3. The Standard deviation is 4.3 minutes.

c. Ninety percent of the time, the time a person must wait falls below what value? This asks for the 90th percentile.

c. Find the 90th percentile. Draw a graph. Let k = the 90th percentile.

$P\left(x

$0.90=\left(k\right)\left(\frac{1}{15}\right)$

$k=\left(0.90\right)\left(15\right)=13.5$

k is sometimes called a critical value.

The 90th percentile is 13.5 minutes. Ninety percent of the time, a person must wait at most 13.5 minutes.

Suppose the time it takes a nine-year old to eat a donut is between 0.5 and 4 minutes, inclusive. Let X = the time, in minutes, it takes a nine-year old child to eat a donut. Then X ~ U (0.5, 4).

b. Find the probability that a different nine-year old child eats a donut in more than two minutes given that the child has already been eating the donut for more than 1.5 minutes.

The second question has a conditional probability. You are asked to find the probability that a nine-year old child eats a donut in more than two minutes given that the child has already been eating the donut for more than 1.5 minutes. Solve the problem two different ways (see [link]). You must reduce the sample space. First way: Since you know the child has already been eating the donut for more than 1.5 minutes, you are no longer starting at a = 0.5 minutes. Your starting point is 1.5 minutes.

Write a new f(x):

f(x) = $\frac{1}{4-1.5}$ = $\frac{2}{5}$ for 1.5 ≤ x ≤ 4.

Find P(x > 2|x > 1.5). Draw a graph.

P(x > 2|x > 1.5) = (base)(new height) = (4 − 2)$\left(\frac{2}{5}\right)$= ?

b. $\frac{4}{5}$

The probability that a nine-year old child eats a donut in more than two minutes given that the child has already been eating the donut for more than 1.5 minutes is $\frac{4}{5}$.

Second way: Draw the original graph for X ~ U (0.5, 4). Use the conditional formula

P(x > 2|x > 1.5) =

Ace Heating and Air Conditioning Service finds that the amount of time a repairman needs to fix a furnace is uniformly distributed between 1.5 and four hours. Let x = the time needed to fix a furnace. Then x ~ U (1.5, 4).

# Chapter Review

If X has a uniform distribution where a < x < b or axb, then X takes on values between a and b (may include a and b). All values x are equally likely. We write XU(a, b). The mean of X is $\mu =\frac{a+b}{2}$. The standard deviation of X is $\sigma =\sqrt{\frac{{\left(b-a\right)}^{2}}{12}}$. The probability density function of X is $f\left(x\right)=\frac{1}{b-a}$ for axb. The cumulative distribution function of X is P(Xx) = $\frac{x-a}{b-a}$. X is continuous.

The probability P(c < X < d) may be found by computing the area under f(x), between c and d. Since the corresponding area is a rectangle, the area may be found simply by multiplying the width and the height.

# Formula Review

X = a real number between a and b (in some instances, X can take on the values a and b). a = smallest X; b = largest X

X ~ U (a, b)

The mean is $\mu =\frac{a+b}{2}$

The standard deviation is

Probability density function: $f\left(x\right)=\frac{1}{b-a}$ for $a\le X\le b$

Area to the Left of x: P(X < x) = (xa)$\left(\frac{1}{b-a}\right)$

Area to the Right of x: P(X > x) = (bx)$\left(\frac{1}{b-a}\right)$

Area Between c and d: P(c < x < d) = (base)(height) = (dc)$\left(\frac{1}{b-a}\right)$

Uniform: X ~ U(a, b) where a < x < b

• pdf: $f\left(x\right)=\frac{1}{b-a}$ for a ≤ x ≤ b
• cdf: P(Xx) = $\frac{x-a}{b-a}$
• mean µ = $\frac{a+b}{2}$
• standard deviation σ $=\sqrt{\frac{{\left(b-a\right)}^{2}}{12}}$
• P(c < X < d) = (dc)$\left(\frac{1}{b–a}\right)$

# References

McDougall, John A. The McDougall Program for Maximum Weight Loss. Plume, 1995.

Use the following information to answer the next ten questions. The data that follow are the square footage (in 1,000 feet squared) of 28 homes.

 1.5 2.4 3.6 2.6 1.6 2.4 2 3.5 2.5 1.8 2.4 2.5 3.5 4 2.6 1.6 2.2 1.8 3.8 2.5 1.5 2.8 1.8 4.5 1.9 1.9 3.1 1.6

The sample mean = 2.50 and the sample standard deviation = 0.8302.

The distribution can be written as X ~ U(1.5, 4.5).

What type of distribution is this?

In this distribution, outcomes are equally likely. What does this mean?

It means that the value of x is just as likely to be any number between 1.5 and 4.5.

What is the height of f(x) for the continuous probability distribution?

What are the constraints for the values of x?

1.5 ≤ x ≤ 4.5

Graph P(2 < x < 3).

What is P(2 < x < 3)?

0.3333

What is P(x < 3.5| x < 4)?

What is P(x = 1.5)?

zero

What is the 90th percentile of square footage for homes?

Find the probability that a randomly selected home has more than 3,000 square feet given that you already know the house has more than 2,000 square feet.

0.6

Use the following information to answer the next eight exercises. A distribution is given as X ~ U(0, 12).

What is a? What does it represent?

What is b? What does it represent?

b is 12, and it represents the highest value of x.

What is the probability density function?

What is the theoretical mean?

six

What is the theoretical standard deviation?

Draw the graph of the distribution for P(x > 9).

Find P(x > 9).

Find the 40th percentile.

4.8

Use the following information to answer the next eleven exercises. The age of cars in the staff parking lot of a suburban college is uniformly distributed from six months (0.5 years) to 9.5 years.

What is being measured here?

In words, define the random variable X.

X = The age (in years) of cars in the staff parking lot

Are the data discrete or continuous?

The interval of values for x is ______.

0.5 to 9.5

The distribution for X is ______.

Write the probability density function.

f(x) = $\frac{1}{9}$ where x is between 0.5 and 9.5, inclusive.

Graph the probability distribution.

1. Sketch the graph of the probability distribution.
2. Identify the following values:
1. Lowest value for $\overline{x}$: _______
2. Highest value for $\overline{x}$: _______
3. Height of the rectangle: _______
4. Label for x-axis (words): _______
5. Label for y-axis (words): _______

Find the average age of the cars in the lot.

μ = 5

Find the probability that a randomly chosen car in the lot was less than four years old.

1. Sketch the graph, and shade the area of interest.
2. Find the probability. P(x < 4) = _______

Considering only the cars less than 7.5 years old, find the probability that a randomly chosen car in the lot was less than four years old.

1. Sketch the graph, shade the area of interest.
2. Find the probability. P(x < 4|x < 7.5) = _______
1. Check student’s solution.
2. $\frac{3.5}{7}$

What has changed in the previous two problems that made the solutions different?

Find the third quartile of ages of cars in the lot. This means you will have to find the value such that $\frac{3}{4}$, or 75%, of the cars are at most (less than or equal to) that age.

1. Sketch the graph, and shade the area of interest.
2. Find the value k such that P(x < k) = 0.75.
3. The third quartile is _______
1. Check student's solution.
2. k = 7.25
3. 7.25

# Homework

For each probability and percentile problem, draw the picture.

Births are approximately uniformly distributed between the 52 weeks of the year. They can be said to follow a uniform distribution from one to 53 (spread of 52 weeks).

1. X ~ _________
2. Graph the probability distribution.
3. f(x) = _________
4. μ = _________
5. σ = _________
6. Find the probability that a person is born at the exact moment week 19 starts. That is, find P(x = 19) = _________
7. P(2 < x < 31) = _________
8. Find the probability that a person is born after week 40.
9. P(12 < x|x < 28) = _________
10. Find the 70th percentile.
11. Find the minimum for the upper quarter.

A random number generator picks a number from one to nine in a uniform manner.

1. X ~ _________
2. Graph the probability distribution.
3. f(x) = _________
4. μ = _________
5. σ = _________
6. P(3.5 < x < 7.25) = _________
7. P(x > 5.67)
8. P(x > 5|x > 3) = _________
9. Find the 90th percentile.
1. X ~ U(1, 9)
2. Check student’s solution.
3. $f\left(x\right)=\frac{1}{8}$ where $1\le x\le 9$
4. five
5. 2.3
6. $\frac{15}{32}$
7. $333 800$
8. $2 3$
9. 8.2

According to a study by Dr. John McDougall of his live-in weight loss program at St. Helena Hospital, the people who follow his program lose between six and 15 pounds a month until they approach trim body weight. Let’s suppose that the weight loss is uniformly distributed. We are interested in the weight loss of a randomly selected individual following the program for one month.

1. Define the random variable. X = _________
2. X ~ _________
3. Graph the probability distribution.
4. f(x) = _________
5. μ = _________
6. σ = _________
7. Find the probability that the individual lost more than ten pounds in a month.
8. Suppose it is known that the individual lost more than ten pounds in a month. Find the probability that he lost less than 12 pounds in the month.
9. P(7 < x < 13|x > 9) = __________. State this in a probability question, similarly to parts g and h, draw the picture, and find the probability.

A subway train on the Red Line arrives every eight minutes during rush hour. We are interested in the length of time a commuter must wait for a train to arrive. The time follows a uniform distribution.

1. Define the random variable. X = _______
2. X ~ _______
3. Graph the probability distribution.
4. f(x) = _______
5. μ = _______
6. σ = _______
7. Find the probability that the commuter waits less than one minute.
8. Find the probability that the commuter waits between three and four minutes.
9. Sixty percent of commuters wait more than how long for the train? State this in a probability question, similarly to parts g and h, draw the picture, and find the probability.
1. X represents the length of time a commuter must wait for a train to arrive on the Red Line.
2. X ~ U(0, 8)
3. $f\left(x\right)=\frac{1}{8}$ where ≤ x ≤ 8
4. four
5. 2.31
6. $\frac{1}{8}$
7. $\frac{1}{8}$
8. 3.2

The age of a first grader on September 1 at Garden Elementary School is uniformly distributed from 5.8 to 6.8 years. We randomly select one first grader from the class.

1. Define the random variable. X = _________
2. X ~ _________
3. Graph the probability distribution.
4. f(x) = _________
5. μ = _________
6. σ = _________
7. Find the probability that she is over 6.5 years old.
8. Find the probability that she is between four and six years old.
9. Find the 70th percentile for the age of first graders on September 1 at Garden Elementary School.

Use the following information to answer the next three exercises. The Sky Train from the terminal to the rental–car and long–term parking center is supposed to arrive every eight minutes. The waiting times for the train are known to follow a uniform distribution.

What is the average waiting time (in minutes)?

1. zero
2. two
3. three
4. four

d

Find the 30th percentile for the waiting times (in minutes).

1. two
2. 2.4
3. 2.75
4. three

The probability of waiting more than seven minutes given a person has waited more than four minutes is?

1. 0.125
2. 0.25
3. 0.5
4. 0.75

b

The time (in minutes) until the next bus departs a major bus depot follows a distribution with f(x) = $\frac{1}{20}$ where x goes from 25 to 45 minutes.

1. Define the random variable. X = ________
2. X ~ ________
3. Graph the probability distribution.
4. The distribution is ______________ (name of distribution). It is _____________ (discrete or continuous).
5. μ = ________
6. σ = ________
7. Find the probability that the time is at most 30 minutes. Sketch and label a graph of the distribution. Shade the area of interest. Write the answer in a probability statement.
8. Find the probability that the time is between 30 and 40 minutes. Sketch and label a graph of the distribution. Shade the area of interest. Write the answer in a probability statement.
9. P(25 < x < 55) = _________. State this in a probability statement, similarly to parts g and h, draw the picture, and find the probability.
10. Find the 90th percentile. This means that 90% of the time, the time is less than _____ minutes.
11. Find the 75th percentile. In a complete sentence, state what this means. (See part j.)
12. Find the probability that the time is more than 40 minutes given (or knowing that) it is at least 30 minutes.

Suppose that the value of a stock varies each day from \$16 to \$25 with a uniform distribution.

1. Find the probability that the value of the stock is more than \$19.
2. Find the probability that the value of the stock is between \$19 and \$22.
3. Find the upper quartile - 25% of all days the stock is above what value? Draw the graph.
4. Given that the stock is greater than \$18, find the probability that the stock is more than \$21.
1. The probability density function of X is $\frac{1}{25-16}=\frac{1}{9}$.
P(X > 19) = (25 – 19) $\left(\frac{1}{9}\right)$ = $\frac{6}{9}$ = $\frac{2}{3}$.
2. P(19 < X < 22) = (22 – 19) $\left(\frac{1}{9}\right)$ = $\frac{3}{9}$ = $\frac{1}{3}$.
3. The area must be 0.25, and 0.25 = (width)$\left(\frac{1}{9}\right)$, so width = (0.25)(9) = 2.25. Thus, the value is 25 – 2.25 = 22.75.
4. This is a conditional probability question. P(x > 21| x > 18). You can do this two ways:
• Draw the graph where a is now 18 and b is still 25. The height is $\frac{1}{\left(25-18\right)}$ = $\frac{1}{7}$
So, P(x > 21|x > 18) = (25 – 21)$\left(\frac{1}{7}\right)$ = 4/7.
• Use the formula: P(x > 21|x > 18) =
= $\frac{P\left(x>21\right)}{P\left(x>18\right)}$ = $\frac{\left(25-21\right)}{\left(25-18\right)}$ = $\frac{4}{7}$.

A fireworks show is designed so that the time between fireworks is between one and five seconds, and follows a uniform distribution.

1. Find the average time between fireworks.
2. Find probability that the time between fireworks is greater than four seconds.

The number of miles driven by a truck driver falls between 300 and 700, and follows a uniform distribution.

1. Find the probability that the truck driver goes more than 650 miles in a day.
2. Find the probability that the truck drivers goes between 400 and 650 miles in a day.
3. At least how many miles does the truck driver travel on the furthest 10% of days?
1. P(X > 650) = $\frac{700-650}{700-300}=\frac{500}{400}=\frac{1}{8}$ = 0.125.
2. P(400 < X < 650) = $700−650 700−300 = 250 400$ = 0.625
3. 0.10 = $\frac{\text{width}}{\text{700}-\text{300}}$, so width = 400(0.10) = 40. Since 700 – 40 = 660, the drivers travel at least 660 miles on the furthest 10% of days.