Page 58 - Geochemical Anomaly and Mineral Prospectivity Mapping in GIS
P. 58

Exploratory Analysis of Geochemical Anomalies                         57

           Classification of uni-element geochemical data
              Based on a boxplot, an exploration uni-element geochemical data set can usually be
           divided into five robust classes (Fig. 3-4): (1) minimum–LW; (2) LW–LH; (3) LH–UH;
           (4)  UH–UW; and (5)  UW–maximum. The  UIF is usually considered the threshold
           separating background values and anomalies (e.g., Bounessah and Atkin, 2003; Reimann
           et al., 2005), although the  UOF can also be used as the threshold (e.g.,  Yusta et al.,
           1998). However, an estimated value representing the UIF (equation (3.4)) may not be
           actually part of an exploration uni-element geochemical data set, so that outliers beyond
           the UW represent anomalies. Thus, data values in the UH–UW class (at most 25% of a
           data set) can be considered high background, data values in the LH–UH class (at most
           50% of a data set) are background, data values in the LW–LH class (at most 25% of a
           data set) are low background and data values in the minimum–LW class are extremely
           low background.
              Aside  from the boxplot-defined threshold  (e.g.,  UIF or  UW), a threshold can  be
           defined from  the EDA statistics as  median+2MAD. The  MAD is the median absolute
           deviation, which is estimated as the median of absolute deviations of all data values from
           the data median (Tukey, 1977):

            MAD =  median [ X −  median (X i  ] )                              (3.8)
                           i

           where the values in brackets are absolute differences between values X i and median of
           such values.  The  MAD is analogous to the  SDEV in classical statistics, so the EDA
           median+2MAD threshold is also analogous to the classical mean+2SDEV threshold.

           Standardisation of classified uni-element geochemical data
              When dealing with individual uni-element geochemical data sets showing presence
           of multiple populations (e.g., as shown in Fig. 3-3), analysis of only the whole of a uni-
           element geochemical data set is inadequate for recognition of anomalies that may be
           associated with individual populations in the data. It is imperative to subdivide a uni-
           element geochemical data set into subsets representing the various populations present.
           The empirical data distribution of a geochemical data set as depicted in a boxplot or in a
           cumulative probability plot (Tennant and White, 1959; Sinclair, 1974) and in equivalent
           Q-Q (quantile-quantile)  or Normal  Q-Q plots (Figs. 3-1  and 3-2)  can be useful  in
           graphical examination of  multiple populations and in defining breaks or inflection
           points, at  which to subdivide a  uni-element geochemical data set into subsets
           representing those populations. Alternatively, if populations  present in a uni-element
           geochemical data set are considered to be strongly related to certain geogenic variables
           (e.g., lithology) that have also been recorded  during the geochemical sampling, then
           individual uni-element geochemical data sets may be subdivided into subsets according
           to such variables.
              For example, Fig. 3-5A shows boxplots of subsets of the soil Fe data according to
           rock type at the sample sites. Comparing and contrasting these boxplots of subsets of the
   53   54   55   56   57   58   59   60   61   62   63