Page 240 - Probability and Statistical Inference
P. 240

4. Functions of Random Variables and Sampling Distribution  217

                              Francis Galton introduced a numerical measure, r, which he termed “re-
                           version” in a lecture at the Royal Statistical Society on February 9, 1877 and
                           later called “regression”. The term “cor-relation” or “correlation” probably
                           appeared first in Galton’s paper to the Royal Statistical Society on December
                           5, 1888. At that time, “correlation” was defined in terms of deviations from
                           the median instead of the mean. Karl Pearson gave the definition and calcula-
                           tion of correlation as in (4.6.7) in 1897. In 1898, Pearson and his collabora-
                           tors discovered that the standard deviation of r happened to be
                           when n was large. “Student” derived the “probable error of a correlation
                           coefficient” in 1908. Soper (1913) gave large sample approximations for the
                           mean and variance of r which were better than those proposed earlier by
                           Pearson. Refer to DasGupta (1980) for some of the historical details.
                              The unsolved problem of finding the exact pdf of r for normal variates
                           came to R. A. Fisher’s attention via Soper’s 1913 paper. The pdf of r was
                           published in the year 1915 by Fisher for all values of ρ ∈ (–1, 1). Fisher, at the
                           age of 25, brilliantly exploited the n-dimensional geometry to come up with
                           the solution, reputedly within one week. Fisher’s genius immediately came
                           into limelight. Following the publication of Fisher’s results, however, Karl
                           Pearson set up a major cooperative study of the correlation. One will notice
                           that in the team formed for this cooperative project [Soper et al. (1917)]
                           studying the distribution of the sample correlation coefficient, the young Fisher
                           was not included. This happened in spite of the fact that Fisher was right
                           there and he already earned quite some fame. Fisher felt hurt as he was left
                           out of this project. One thing led to another. R. A. Fisher and Karl Pearson
                           continued criticizing each other even more as each held on to his philosophi-
                           cal stand.
                              We will merely state the pdf of r when ρ = 0. This pdf is given by




                           where                                    for n ≥ 3. Using (4.6.9) and
                           some simple transformation techniques, one can easily derive the following
                           result:






                           The verification of the claim in (4.6.10) is left as the Exercise 4.6.9. Fisher
                           (1915) also gave the exact pdf of r in the form of an infinite power series for
                           all values of ρ ∈ (–1, 0) ∪ (0, 1).
   235   236   237   238   239   240   241   242   243   244   245