Page 246 - Statistics for Dummies
P. 246
230
Part IV: Guesstimating and Hypothesizing with Confidence
I mean dropping below 30 or so) the second case is when the population
standard deviation, σ, is not known, and you have to estimate it using the
sample standard deviation, s. In both cases, you have less reliable informa-
tion on which to base your conclusions, so you have to pay a penalty for this
by using a distribution with more variability in the tails than a Z-distribution
has. Enter the t-distribution. (See Chapter 10 for all things t-distribution,
including its relationship with the Z.)
A hypothesis test for a population mean that involves the t -distribution is
called a t-test. The formula for the test statistic in this case is:
, where t is a value from the t-distribution with n–1 degrees
n-1
of freedom.
Note it is just like the test statistic for the large sample and/or normal distri-
bution case (see the section “Testing One Population Mean”), except σ is not
known, so you substitute the sample standard deviation, s, instead, and use a
t-value rather than a z-value.
Because the t-distribution has fatter tails than the Z-distribution, you get a
larger p-value from the t-distribution than one that the standard normal (Z-)
distribution would have given you for the same test statistic. A bigger p-value
means less chance of rejecting H . Having less data and/or not knowing the
o
population standard deviation should create a higher burden of proof.
Putting the t-test to work
Suppose a delivery company claims they deliver their packages in 2 days on
average, and you suspect it’s longer than that. The hypotheses are H : μ = 2
o
versus H : μ > 2. To test this claim, you take a random sample of 10 packages
a
and record their delivery times. You find the sample mean is days,
and the sample standard deviation is 0.35 days. (Because the population
standard deviation, σ, is unknown, you estimate it with s, the sample stan-
dard deviation.) This is a job for the t-test.
Because the sample size is small (n =10 is much less than 30) and the popula-
tion standard deviation is not known, your test statistic has a t-distribution. Its
degrees of freedom is 10 – 1 = 9. The formula for the test statistic (referred to
as the t-value) is:
To calculate the p-value, you look in the row in the t-table (in the appendix)
for df = 9. Your test statistic (2.71) falls between two values in the row for
df = 9 in the t-table: 2.26 and 2.82 (rounding to two decimal places). To calculate
3/25/11 8:14 PM
22_9780470911082-ch15.indd 230 3/25/11 8:14 PM
22_9780470911082-ch15.indd 230