Page 55 - Geochemical Anomaly and Mineral Prospectivity Mapping in GIS
P. 55

54                                                              Chapter 3

























             Fig. 3-3. Histogram and EDA graphics (density  trace, jittered one-dimensional scatterplot,
             boxplot) depicting the empirical density distribution of the soil Fe data in Fig. 3-1.


             often used together with a histogram. That is because the visual impression one obtains
             about univariate data behaviour and structure from a histogram alone is influenced by
             the choice of number of classes for constructing a histogram. The combination of the
             three EDA graphics with a histogram provides better insight to univariate data behaviour
             and structure than from a histogram alone. Unlike a histogram, the three EDA graphics
             can readily indicate any ‘abnormalities’ in a univariate data set.
                A density trace is similar to a histogram, but it describes the empirical density
             distribution of univariate data in a much more realistic manner and its shape does not
             change significantly with change in the number of classes. The appropriate number of
             classes for constructing a  histogram can be  controlled  by constructing a jittered  one-
             dimensional scattergram, in which the univariate data are plotted at random positions
             across a narrow band (usually with a range of 0-1) orthogonal to the univariate data axis.
             Because a jittered one-dimensional scattergram is not based on classes of univariate data,
             it provides additional pieces information about data (i.e.,  structure,  behaviour, local
             densities, gaps, outliers) that should be portrayed by a histogram and density trace. A
             boxplot displays  information about the characteristics (e.g., central tendency, spread,
             etc.) of the empirical density distribution of a univariate data set, such that it is probably
             the most useful of the EDA graphical tools applied in geochemical data analysis and so it
             is given special attention here.

             Boxplot and 5-number statistics of univariate data

                A  boxplot is created by  first ordering  data values  from the  minimum  to the
             maximum, or vice versa (Fig. 3-4). The median value is then determined by counting
   50   51   52   53   54   55   56   57   58   59   60