Page 214 - Geochemical Remote Sensing of The Sub-Surface
P. 214

Light hydrocarbons for petroleum and gas prospecting                  191

           concentrations,  regardless  of location.  A  magnitude  threshold,  or series  of thresholds,  is
           chosen  and  those  locations  with  values  above  this  threshold  (anomalously-high
           concentrations)  are  identified  on  a  map.  The  second  technique  focuses  on  the  spatial
           clustering  of anomalous  stations.  This  is  accomplished  by  the  identification  of regions
           where  the  number  of  stations  with  magnitudes  above  a  threshold  is  statistically
           significant.
              The  traditional  method  of identifying  a  magnitude  threshold  has  been  accomplished
           by a variety of techniques.  These  include:  (1) the mean plus two standard deviations of a
           normally-distributed   data  set;  (2)  arbitrarily  selecting  the  90 th  percentile  or  95 th
           percentile,  etc., of the data; (3) identifying the inflection point on a cumulative  frequency
           plot that deviates from a straight line (Sinclair,  1976).
              In  the  opinion  of  the  authors  it  is  dangerous  to  select  any  hard-and-fast  rule  for
           defining an anomalous population,  although the  approach  of Sinclair  (1976)  is the  most
           appropriate  for a mixed mode data  set.  Sample populations  should be  normal,  or at least
           log normal, for many of the statistical tests to be valid, and bias in sample sites should be
           avoided  if  possible.  Ideally  a  training  set  made  up  of  a  data  subset  with  known
           hydrocarbon potential  should be employed.  This gives a means to tie-in  data to a known
           feature,  whether  it  be  a  source  bed,  a  reservoir  or  a  barren  area.  Once  the  results  are
           available,  a first step  is to construct  histograms to  determine  the  spread  of the  data.  The
           data  can  then  be  plotted  on  cumulative  frequency  plots  to  determine  the  different
           populations.  Scatter  plots  of key  components,  such  as  methane  versus  propane,  or  iso-
           butane  versus  normal  butane,  often  yield  multiple  trends  for multiple  populations  in  the
           data.  Pearson  correlation  analysis also  yields useful  information  on  the  "cleanliness"  of
           the  data,  with  single  populations  generally  showing  a  high  degree  of  inter-correlation.
           Filtering  or  screening  the  data  according  to  composition  prior  to  applying  statistics  is
           also an effective means of determining areas of favourable potential.
              The  method  of  identifying  regions  of  anomalously-high  leakage  by  clustering
           (Dickinson  and  Matthews,  1993)  is  accomplished  by  first  identifying  a  magnitude
           threshold  and  a  search  area.  The  magnitude  threshold,  which  is  somewhat  arbitrary,  is
           used  to  transform  the  distribution  of magnitudes  into  a  binomial  population  (above  the
           threshold  "heads"  and  below  the  threshold  "tails").  The  size  of  the  search  area  (the
           "cell")  is such that it includes 20 or more sample stations regardless of its location within
           the surveyed area.  Once these parameters are chosen,  the cell  is placed at one position on
           the  map,  usually  in  one  comer,  and  the  percentage  of  heads  and  the  total  number  of
           stations within the cell are recorded. The cell is then translated to a new location and the
           same  parameters  are  recorded.  This  process  is  repeated  until  the  entire  survey area  has
           been  examined.  Because  the  properties  of  the  binomial  distribution  are  well  known,
           statistical  tests  of the  chance  of a particular cell  having a particular  percentage  of heads
           can  be  made  and  probability  maps  contoured.  Thus,  regions  of  anomalously-high
           frequency  of  magnitudes  above  the  threshold  can  be  identified,  and  their  chance  of
           arising due to random processes,  instead  of focused leakage,  can be estimated.  There  is,
   209   210   211   212   213   214   215   216   217   218   219