Page 26 - Mechanical Engineers' Handbook (Volume 2)
P. 26

3 Statistics in the Measurement Process  15


                                         n                      t(v,0.10) a      t/ n
                                         2          1            6.31            3.65
                                         3          2            2.92            1.46
                                         5          4            2.13            0.87

                                         a From a t-statistic table.  9
                           Thus a sample of three tires is sufficient to establish the tire life within  10% at a 90%
                           level of confidence.


            3.7  Goodness of Fit
                           Statistical methods can be used to fit a curve to a given data set. In general, the least-squares
                           principle is used to minimize the sum of the squares of deviations away from the curve to
                           be fitted. The deviations from an assumed curve y   ƒ(x) are due to errors in y,in x,or in
                           both y and x. In most cases the errors in the independent variable x are much smaller than
                           the dependent variable y. Therefore, only the errors in y are considered for the least-squares
                           curve. The goodness of fit of an assumed curve is defined by the correlation coefficient r,
                           where
                                                      (y   y) 2          (y   y) 2
                                                                           i
                                              r       (y   y) 2       1     (y   y) 2
                                                        i
                                                                           i
                                                         ˆ   2 y,x
                                                     1    ˆ   2 y                               (31)
                           where  (y   y) 2    total variation (variation about mean)
                                   i
                                        2
                                 (y   y)   unexplained variation (variation about regression)
                                   i
                                 (y   y) 2    explained variation (variation based on assumed regression equation)
                                       ˆ   y    estimated population standard deviation of y variable
                                      ˆ   y,x    standard error of estimate of y on x
                              When the correlation coefficient r is zero, the data cannot be explained by the assumed
                           curve. However, when r is close to  1, the variation of y with respect to x can be explained
                           by the assumed curve and a good correlation is indicated between the variables x and y.
                              The probabilistic statement for the goodness-of-fit test is given by
                                                     P[r    r]   	   1                          (32)
                                                       calc
                           where r calc  is calculated from Eq. (31) and the null and alternate hypotheses are as follows:

                              H : No correlation of assumed regression equation with data.
                                0
                              H : Correlation of regression equation with data.
                                1
                           The goodness-of-fit for a straight line is determined by comparing r calc  with the value of r
                           obtained at n   2 degrees of freedom at a selected confidence level   from tables. If r calc
                           r, the null hypothesis is rejected and a significant fit of the data within the confidence level
                           specified is inferred. However, if r calc    r, the null hypothesis cannot be rejected and no
                           correlation of the curve fit with the data is inferred at the chosen confidence level.

                           Example 5  Goodness-of-Fit Test. The given x–y data were fitted to a curve y   a   bx
                           by the method of least-squares linear regression. Determine the goodness of fit at a 5%
                           significance level (95% confidence level):
   21   22   23   24   25   26   27   28   29   30   31