Page 32 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 32

1.4 Probabilities and Distributions   11


           Probability, since for the first time, mathematical grounds were established and the
           application of probability to statistics was presented. The notion of probability is
           originally associated with the notion of frequency of occurrence of one out of k
           events in a sequence of trials, in which each of the events can occur by pure
           chance.
              Let us assume a sample dataset, of size n, described by a discrete variable, X.
           Assume further that there are k distinct values x i of X each one occurring n i times.
           We define:

           –  Absolute frequency of x i:    n i ;
                                                         n            k
                                                                        n .
           –  Relative frequency (or simply frequency of x i):     f  =  i  with  n =  ∑ i
                                                      i
                                                          n           = i 1
              In the classic frequency interpretation, probability is considered a limit, for large
           n,  of the relative frequency of an event:  P i  ≡ P (X  = x i ) =  lim n → ∞  f i  [ ∈  ] 1 , 0  . In
           Appendix  A,  a more rigorous  definition  of  probability  is presented, as well as
           properties of the convergence of such a limit to the probability of the event (Law of
           Large Numbers), and the justification for computing  (XP  =  x i  )  as the  ratio of the
                                                                     “
                                                                ”
           number of favourable events over the number of possible events  when the event
           composition  of the random experiment is  known  beforehand.  For instance, the
           probability of obtaining two heads when tossing two coins is ¼ since only one out
           of the four possible events (head-head, head-tail, tail-head, tail-tail) is favourable.
           As exemplified in Appendix A, one often computes probabilities of events in this
           way, using enumerative and combinatorial techniques.
              The values of  P i constitute the  probability function  values of the random
           variable X, denoted  P(X). In the case the discrete random variable is an ordinal
           variable the accumulated sum of  P i is called the  distribution function, denoted
           F(X). Bar graphs are often used to display the values of probability and distribution
           functions of discrete variables.
              Let us again consider the classification data of Example 1.2, and assume that the
           frequencies  of the classifications are  correct estimates of the  respective
           probabilities. We will then have the  probability and distribution  functions
           represented in Table 1.5 and Figure 1.5. Note that the probabilities add up to 1
           (total certainty) which is the largest value  of the monotonic increasing function
           F(X).

           Table 1.5. Probability and distribution functions for Example 1.2, assuming that
           the frequencies are correct estimates of the probabilities.
                x i        Probability Function P(X)  Distribution Function F(X)
                1                   0.06                       0.06
                2                   0.20                       0.26
                3                   0.24                       0.50
                4                   0.30                       0.80
                5                   0.20                       1.00
   27   28   29   30   31   32   33   34   35   36   37