Page 99 - Modern Analytical Chemistry
P. 99

1400-CH04  9/8/99  3:54 PM  Page 82





              82     Modern Analytical Chemistry


                                              making the confidence interval too broad, or you may conclude that the analyte’s
                                              concentration is too small to detect accurately.*
                                                  A second example is also informative. When samples are obtained from a nor-
                                              mally distributed population, their values must be random. If results for several
                                              samples show a regular pattern or trend, then the samples cannot be normally dis-
                                              tributed. This may reflect the fact that the underlying population is not normally
                                              distributed, or it may indicate the presence of a time-dependent determinate error.
                                              For example, if we randomly select 20 pennies and find that the mass of each penny
                                              exceeds that of the preceding penny, we might suspect that the balance on which
                                              the pennies are being weighed is drifting out of calibration.


                                               4 E Statistical Analysis of Data

                                              In the previous section we noted that the result of an analysis is best expressed as a
                                              confidence interval. For example, a 95% confidence interval for the mean of five re-
                                              sults gives the range in which we expect to find the mean for 95% of all samples of
                                              equal size, drawn from the same population. Alternatively, and in the absence of de-
                                              terminate errors, the 95% confidence interval indicates the range of values in which
                                              we expect to find the population’s true mean.
                                                  The probabilistic nature of a confidence interval provides an opportunity to ask
                                              and answer questions comparing a sample’s mean or variance to either the accepted
                                              values for its population or similar values obtained for other samples. For example,
                                              confidence intervals can be used to answer questions such as “Does a newly devel-
                                              oped method for the analysis of cholesterol in blood give results that are signifi-
                                              cantly different from those obtained when using a standard method?” or “Is there a
                                              significant variation in the chemical composition of rainwater collected at different
                                              sites downwind from a coalburning utility plant?” In this section we introduce a
                          Values
                                              general approach to the statistical analysis of data. Specific statistical methods of
              (a)
                                              analysis are covered in Section 4F.

                                              4 E.1 Significance Testing
                                              Let’s consider the following problem. Two sets of blood samples have been collected
                                              from a patient receiving medication to lower her concentration of blood glucose.
                                              One set of samples was drawn immediately before the medication was adminis-
                          Values              tered; the second set was taken several hours later. The samples are analyzed and
              (b)                             their respective means and variances reported. How do we decide if the medication
                                              was successful in lowering the patient’s concentration of blood glucose?
                                                  One way to answer this question is to construct probability distribution curves
                                              for each sample and to compare the curves with each other. Three possible out-
                                              comes are shown in Figure 4.9. In Figure 4.9a, the probability distribution curves
                                              are completely separated, strongly suggesting that the samples are significantly dif-
                                              ferent. In Figure 4.9b, the probability distributions for the two samples are highly
                                              overlapped, suggesting that any difference between the samples is insignificant. Fig-
                          Values
                                              ure 4.9c, however, presents a dilemma. Although the means for the two samples ap-
              (c)
                                              pear to be different, the probability distributions overlap to an extent that a signifi-
              Figure 4.9                      cant number of possible outcomes could belong to either distribution. In this case
              Three examples of possible relationships  we can, at best, only make a statement about the probability that the samples are
              between the probability distributions for two
              populations. (a) Completely separate  significantly different.
              distributions; (b) Distributions with a great
              deal of overlap; (c) Distributions with some
              overlap.                        *The topic of detection limits is discussed at the end of this chapter.
   94   95   96   97   98   99   100   101   102   103   104