Page 23 - Applied Statistics Using SPSS, STATISTICA, MATLAB and R
P. 23

2        1 Introduction


              In the case of the body fall there is a law that allows the exact computation of
           one  of the variables  h or  t (for given  h 0 and  g) as a function of the other one.
           Moreover,  if we repeat  the body-fall  experiment under identical conditions,  we
           consistently obtain the same results,  within the precision of the measurements.
           These are the attributes  of deterministic  data:  the  same data will be obtained,
           within the  precision  of the  measurements, under repeated experiments in well-
           defined conditions.
              Imagine now  that we were  dealing  with  Stock Exchange data, such  as, for
           instance, the daily share value throughout one year of a given company. For such
           data there is no known law to describe how the share value evolves along the year.
           Furthermore, the possibility of experiment repetition with identical results does not
           apply here. We are, thus, in presence of what is called random data.
              Classical examples of random data are:

              −  Thermal noise generated in electrical resistances, antennae, etc.;
              −  Brownian motion of tiny particles in a fluid;
              −  Weather variables;
              −  Financial variables such as Stock Exchange share values;
              −  Gambling game outcomes (dice, cards, roulette, etc.);
              −  Conscript height at military inspection.

              In  none of these examples can a precise mathematical law describe the data.
           Also, there is no possibility of obtaining the same data in repeated experiments,
           performed under similar conditions. This is mainly due  to the  fact that several
           unforeseeable or immeasurable causes play a role in the generation of such data.
           For instance, in the case of the Brownian motion, we find that, after a certain time,
           the trajectories followed by several particles that have departed from exactly the
           same point, are completely different among them. Moreover it is found that such
           differences largely exceed the precision of the measurements.
              When dealing with a random dataset, especially if it relates to the temporal
           evolution of some variable, it is often convenient to consider such dataset as one
           realization (or one instance) of a set (or ensemble) consisting of a possibly infinite
           number of realizations of  a generating process. This  is the so-called  random
           process (or  stochastic process, from the Greek “stochastikos” = method or
           phenomenon composed of random parts). Thus:

              −  The wandering voltage signal  one  can  measure in an  open electrical
                 resistance is an instance of a thermal noise process (with an ensemble of
                 infinitely many continuous signals);
              −  The succession of face values when tossing n times a die is an instance of a
                 die tossing process (with an ensemble of finitely many discrete sequences).
              −  The trajectory of a tiny particle in a  fluid is an instance of a Brownian
                 process (with an ensemble of infinitely many continuous trajectories);
   18   19   20   21   22   23   24   25   26   27   28