Page 191 - Computational Statistics Handbook with MATLAB
P. 191

178                        Computational Statistics Handbook with MATLAB


                                Zp2 = Z*proj2;
                                figure
                                plot(Zp1(:,1),Zp1(:,2),'k.'),title('Structure 1')
                                xlabel('\alpha^*'),ylabel('\beta^*')
                                figure
                                plot(Zp2(:,1),Zp2(:,2),'k.'),title('Structure 2')
                                xlabel('\alpha^*'),ylabel('\beta^*')
                             The results are shown in Figure 5.45 and Figure 5.46, where we see that pro-
                             jection pursuit did find two structures. The first structure has a projection
                             pursuit index of 2.67, and the second structure has an index equal to 0.572.




                                   u
                                   T
                                    ou
                                   T
                             Grand
                             Grand
                             Grand T  To ou  r r r
                             Grand
                                    r
                                    ou
                             The grand tour of Asimov [1985] is an interactive visualization technique that
                             enables the analyst to look for interesting structure embedded in multi-
                             dimensional data. The idea is to project the d-dimensional data to a plane and
                             to rotate the plane through all possible angles, searching for structure in the
                             data. As with projection pursuit, structure is defined as departure from nor-
                             mality, such as clusters, spirals, linear relationships, etc.
                              In this procedure, we first determine a plane, project the data onto it, and
                             then view it as a 2-D scatterplot. This process is repeated for a sequence of
                             planes. If the sequence of planes is smooth (in the sense that the orientation
                             of the plane changes slowly), then the result is a movie that shows the data
                             points moving in a continuous manner. Asimov [1985] describes two meth-
                             ods for conducting a grand tour, called the torus algorithm and the random
                             interpolation algorithm. Neither of these methods is ideal. With the torus
                             method we may end up spending too much time in certain regions, and it is
                             computationally intensive. The random interpolation method is better com-
                             putationally, but cannot be reversed easily (to recover the projection) unless
                             the set of random numbers used to generate the tour is retained. Thus, this
                             method requires a lot of computer storage. Because of these limitations, we
                             describe the pseudo grand tour described in Wegman and Shen [1993].
                              One of the important aspects of the torus grand tour is the need for a con-
                             tinuous space-filling path through the manifold of planes. This requirement
                             satisfies the condition that the tour will visit all possible orientations of the
                             projection plane. Here, we do not follow a space-filling curve, so this will be
                             called a pseudo grand tour. In spite of this, the pseudo grand tour has many
                             benefits:

                                • It can be calculated easily;
                                • It does not spend a lot of time in any one region;
                                • It still visits an ample set of orientations; and
                                • It is easily reversible.


                            © 2002 by Chapman & Hall/CRC
   186   187   188   189   190   191   192   193   194   195   196