Page 80 - Intermediate Statistics for Dummies
P. 80
07_045206 ch03.qxd 2/1/07 9:46 AM Page 59
Chapter 3: Building Confidence and Testing Models
Determining strength of
evidence with a p-value
If you want to know whether your data has the brawn to stand up against Ho,
you want to figure out the p-value and compare it to a prespecified cutoff, α
(typically 0.05). The p-value is a measure of the strength of your evidence
against Ho. You can calculate the p-value by doing the following:
1. Calculate the test statistic. See the preceding section for more info
on this.
2. Look up the test statistic on the appropriate table (such as the t-table,
A-1 in the Appendix).
3. Find the percentage of values on the table that fall beyond your test
statistic. This percentage is the p-value.
Suppose you’re conducting a hypothesis test and have already decided you 59
will reject Ho at level α = 0.05. You collect your data and find the test statistic
(see preceding section). If your test statistic is extremely high or extremely
low compared to other values on the table (whatever that table is), then you
reject Ho.
For example, say the cutoff value for rejecting Ho at a level α = 0.05 is 1.645,
where you’re testing for the mean of one population. If you get a test statistic
of 1.7, you reject Ho. If you get a test statistic of 2.7, you really reject Ho. That
is, you have more evidence against Ho with a test statistic of 2.7 than with a
test statistic of 1.7. The two p-values of 1.7 and 2.7 are what statisticians call
marginally significant and highly significant results respectively, to use proper
terms.
Your friend, α, is the cutoff for your p-value — and the star of this chapter.
(α is typically set at 0.05 — sometimes 0.10.) If your p-value is less than your
predetermined value of α, reject Ho, because you have sufficient evidence
against it. If your p-value is greater than or equal to α, you can’t reject Ho.
For example, if your p-value is 0.002, then your test statistic is so far away
from Ho that the chance of getting this result only by chance is only 2 out
of 1,000. So, you conclude that Ho is very likely to be false. However, if your
p-value turns out to be 0.30, then this result happens 30 percent of the time
anyway, so you see no red flags there, and you can’t reject Ho. You don’t have
enough evidence against it. It doesn’t mean Ho is true, but you don’t have the
evidence to say it’s false — a subtle, but important, difference.
When I compare the p-value to the α (the cutoff value), I like to think of a foot-
ball analogy, assuming that Ho is “the opposing team can’t make a touch-
down.” The burden is on the other team to show enough evidence to reject