Page 59 - Petrology of Sedimentary Rocks
P. 59

The  value   t  is  found   in  a  table   with   the  levels   to  be  selected   (5%,   IO%,  20%,  etc.)
      plotted   against   the  “degrees   of  freedom”   (which   in  this   case   simply   means   the  number
     of  values   minus   I);  the   value   of  t  to  be  used   lies   at  the   intersection   of  our   chosen
     confidence   level   and  the  degrees   of  freedom.   For   example,   with   21  values,   at  the  5%
      (.05)   level,   t  =  2.09,   as  shown   in  t  table   on  page  59.


                                      Populations   and  Probability


            In  the   tests   that   follow   it  is  essential   that   we  understand   the   two   concepts   of
     “populations”   and  “probability.”

            Population   is  the   term   used   for   the   data   we  are  sampling;   if  we  are  measuring
     porosities,   the   “population”   that   we   are   sampling   is  the   vast   number   of   porosities
     present   in  every   cubic   inch   of  that   formation,   of  which   we  test   what   we  assume   are  a
     representative   few;   if  we  are  measuring   the  mica   content   of  a  bed,   the  “population”   is
     the  percent   of  mica   in  each   minute   part   of  that   bed;  if  we  fill   a  jar  with   500  black   and
      1,000   red  beans   and  proceed   blindfolded   to  pull   out   50  beans,   then   we  hope   that   our
     sample   of  50  is  representative   of  the  true   “population”--   i.e.,  the   1,500  total   beans.   The
     population   is  the   vast   amount   of  numerical   data   available,   of  which   we  take   only   a
     smal  I  sample.


      In  many   statistical   tests,   we  try   to  answer   the   question,   “what   are   the  odds   that   we
     could   have   obtained   as  great   a  difference   or  greater   by  chance   sampling   of  the  same
     population?”    This   gives   us  an  insight   whether   or  not  there   is  a  real   difference   in  the
     properties   of  the   two   formations   we  have   sampled,   or  whether   we  could   have   gotten
     just   as  large   a  difference   in,   say   chert   content,   by  chance   sampling   of   a  single
     formation   (a  “homogenous”   population).   Let’s   think   about   it  this  way.   We  have   a  square
     jar   and  a  round   jar,   each   filled   with   a  certain   mixture   of  1,000  black   and  white   beans.
     The   round   jar   has  a  population   with   a  certain   number   of  blacks,   the   square   jar   has  a
     different   proportion   of  blacks.   You  are  now  blindfolded   and  asked   to  pull  50  beans   from
     one  of  the  jars;   let’s  say  you  got  30  blacks   and  20  whites.   Now,   still   blindfolded,   you  are
     asked   to  reach   again   into  a  jar  (you  still   don’t   know   which   one  you  are  reaching   into)   and
     pick   50  more   beans.   This   time   you  get  22  blacks   and  28  whites.   Statistics   enables   us  to
     anwer   the   question,   “on  the  second   drawing,   did   I  take   from   the  same   jar   as  I  did  the
     first   time;   what   is  the   probability   that   I  drew   both   sets  of  beans   from   the   same   jar?”
     Or  phrased   differently,   “what   are   the  odds   that   I  drew   both   samples   of  50  from   the
     same   population   (i.e.,   the   same   jar?)”   This   is  essentially   what   we  do  when   we  obtain
     numerical   data   from   rocks.   We  take   only   a  small   sample   of   the   numerical   data
     available   in  the   formation,   then   sample   another   formation   and   see   if   there   is  any
     difference   between   the   two   formations--and   difference   in  the   population   of  beans   in
     the  jar,   so  to  speak.

           Probability.    Many    statistical   tables   have   a  critical   value   called   “P”   as  an
     important   part.   P  stands   for   probability--in   most   cases,   P,  represented   as  a  percent,
     stands   for   the   probability   of  a  certain   event   happening.   In  the  jar   experiment   above,
     after   running   through   the   computations   of  the   statistical   test,   we   enter   a  table   and
     come   out  with   a  certain   value   for   P.   In  this  case  let  us  say  P  came   out  to  be  .I0  (10%).
     This   means   that   if  we  had   repeatedly   sampled   the   same   jar,   only   once   in  every   ten
     times   would   we  have   gotten   as  large,   or  larger,   a  difference   as  we  did  on  the  two  draws.
     Or,   there   is  only   one   chance   in  ten   that   we   have   sampled   the   same   population.
     Although   it  is  not  technically   correct   to  say  it  this  way,   we  may  state   the  corollary   that
     there   is  a  90%  chance   that   we  have   sampled   two   different   jars.   Most   statisticians   do
     not  accept   an  experiment   as  significant   unless   P  reaches   the  5%  level   (.05  on  the  table,
     pages   59  and  60).



                                                       53
   54   55   56   57   58   59   60   61   62   63   64