Page 255 - Computational Statistics Handbook with MATLAB
P. 255

Chapter 7: Data Partitioning                                    243


                             the importance of looking at scatterplots, because it is obvious from the plots
                             that the relationships between the variables are not similar. The scatterplots
                             are shown in Figure 7.3.
                                % Here is another example.
                                % We have 4 data sets with essentially the same
                                % correlation coefficient.
                                % The scatterplots look very different.
                                % When this file is loaded, you get four sets
                                % of x and y variables.
                                load anscombe
                                % Do the scatterplots.
                                subplot(2,2,1),plot(x1,y1,'k*');
                                subplot(2,2,2),plot(x2,y2,'k*');
                                subplot(2,2,3),plot(x3,y3,'k*');
                                subplot(2,2,4),plot(x4,y4,'k*');
                             We now determine the jackknife estimate of bias and standard error for  ρ ˆ
                             using csjack.

                                % Note that 'corr' is something we wrote.
                                [b1,se1,jv1] = csjack([x1,y1],'corr');
                                [b2,se2,jv2] = csjack([x2,y2],'corr');
                                [b3,se3,jv3] = csjack([x3,y3],'corr');
                                [b4,se4,jv4] = csjack([x4,y4],'corr');
                             The jackknife estimates of bias are:
                                b1 = -0.0052
                                b2 =  0.0008
                                b3 =  0.1514
                                b4 =  NaN
                             The jackknife estimates of the standard error are:
                                se1 = 0.1054
                                se2 = 0.1026
                                se3 = 0.1730
                                se4 = NaN
                             Note that the jackknife procedure does not work for the fourth data set,
                             because when we leave out the last data point, the correlation coefficient is
                             undefined for the remaining points.


                              The jackknife method is also described in the literature using pseudo-val-
                             ues. The jackknife pseudo-values are given by


                                              )                i – ()
                                                                          ,
                                                                             ,
                                                     –
                                             T i =  nT ( n –  1)T    i =  1 … n  ,         (7.12)
                            © 2002 by Chapman & Hall/CRC
   250   251   252   253   254   255   256   257   258   259   260