Page 296 - Six Sigma Demystified
P. 296

276        Six SigMa  DemystifieD


                        for the process represented by the bottom set of control charts because the
                        process is out of control. By definition, then, the data are from multiple process
                        distributions.
                          Thus, if the process is out of control, then, by definition, a single distribution
                        cannot be fit to the data. Therefore, always use a control chart to determine
                        statistical control before attempting to fit a distribution (or determine capabil-
                        ity) for the data. Once statistical control is established, use goodness-of-fit tests
                        (described earlier) to determine if an assumed distribution provides a reason-
                        able approximation. Remember that a histogram provides only part of the pic-
                        ture and can never be used to assess process stability.



                 Hypothesis Testing


                        Hypothesis testing refers to a general class of problems where we seek to com-
                        pare, at a stated degree of confidence, a sample statistic against a standard value
                        or a statistic from another sample. Hypothesis testing is used in regression anal-
                        ysis and designed experiments to determine if factors included in the analysis
                        contribute to the variation observed in the data (as discussed under “Regression
                        Analysis” below). Hypothesis testing is a tool of statistical inference, where we
                        use sample statistics (such as a sample average  X  or a sample standard devia-
                        tion s) to infer properties of a population (such as its mean µ or standard devia-
                        tion σ).
                        When to Use


                        Key assumptions include

                          1. The population is both constant (it does not change over time) and homo-
                             geneous (a given sample is representative of the sample as a whole).
                          2. Samples are randomly collected from the population and representative
                             of the population under investigation. In surveys, low response rates typi-
                             cally would provide extreme value estimates (i.e., the subpopulation of
                             people who have strong opinions one way or the other).
                          3. The hypothesis tests shown below and their associated α and β risks, de-
                             pend on the normality of the population. If the population is significantly
                             nonnormal, then the tests are not meaningful, and nonparametric tests
                             should be used. Goodness-of-fit tests are used to test this assumption. Non-
                             parametric tests can be used if the populations are significantly nonnormal.
   291   292   293   294   295   296   297   298   299   300   301