Page 147 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R

P. 147

4.4 Inference on Two Populations 127

correlated and their correlation is high, one may contemplate the possibility of
discarding one of the variables, since a highly correlated variable only conveys
redundant information.
Let ρ represent the true value of the Pearson correlation mentioned in section
2.3.4. The correlation test is formalised as:

H 0: ρ = 0, H 1: ρ ≠ 0, for a two-sided test.

For a one-sided test the alternative hypothesis is:

H 1: ρ > 0 or ρ < 0.

Let r represent the sample Pearson correlation when the null hypothesis is
verified and the sample size is n. Furthermore, assume that the random variables
are normally distributed. Then, the (r.v. corresponding to the) following test
statistic:

n − 2
t = r , 4.6
*
−
1 r 2

has a Student’s t distribution with n – 2 degrees of freedom.
The Pearson correlation test can be performed as part of the computation of
correlations with SPSS and STATISTICA. It can also be performed using the
Correlation Test sheet of Tools.xls (see Appendix F) or the
Probability Calculator; Correlations of STATISTICA (see also
Commands 4.2).

Example 4.5
Q: Consider the variables PMax and T80 of the meteorological dataset ( Meteo )
for the “moderate” category of precipitation (PClass = 2) as defined in 2.1.2. We
then have n = 16 measurements of the maximum precipitation and the maximum
temperature during 1980, respectively. Is there evidence, at α = 0.05, of a negative
correlation between these two variables?
A: The distributions of PMax and T80 for “moderate” precipitation are reasonably
well approximated by the normal distribution (see section 5.1). The sample
correlation is r = –0.53. Thus, the test statistic is:

*
r = –0.53, n = 16 ⇒ t = –2.33.

*
Since t 14 . 0 , 05 = − 1. 76 , the value of t falls in the critical region ] –∞, –1.76];
therefore, the null hypothesis is rejected, i.e., there is evidence of a negative
correlation between PMax and T80 at that level of significance. Note that the
*
observed significance of t is 0.0176, below α.

142 143 144 145 146 147 148 149 150 151 152