Page 89 -

P. 89

76 3 Data Clusterine

where P(A) is the proportion of times the k judges agree and P(E) is the proportion
of times that we would expect the k judges to agree by chance. If there is complete
agreement among the judges, then el (P(A)=l, P(E)=O). If there is no agreement
among the judges other than what would be expected by chance, then GO
(P(A)=P(E)). The values of P(A) and P(E) are computed as follows:

For the Rocks example these quantities are computed as P(A)=0.971 and
P(E)=0.418, resulting in a high value of K, ~=0.95. In order to test the significance
of K, the following statistic, approximately normally distributed for large n with
zero mean and unit standard deviation, is used:

The value of z = 9.5 is obtained for the present example, allowing us to conclude
significantly high agreement at a a= 0.01 significance level (zeo.ol = 2.32).

Bibliography

Andrews HC (1972) Introduction to Mathematical Techniques in Pattern Recognition. John
Wiley & Sons, Inc.
Barnett S (1979) Matrix Methods for Engineers and Scientists. McGraw Hill Book Co.
Borg I, Groenen P (1997) Modern Multidimensional Scaling. Springer-Verlag.
Chambers JM, Kleiner B (1982) Graphical Techniques for Multivariate Data and for
Clustering. In: Krishnaiah PR, Kanal LN (eds), Handbook of Statistics vo1.2, North
Holland Pub. Co., 209-244.
Chien, Yi Tzuu (1978) Interactive Pattcrn Recognition. Marcel Dekker Inc.
De Leeuw J, Heiser W (1982) Theory of Multidimensional Scaling. In: Krishnaiah PR,
Kanal LN (eds), Handbook of Statistics vo1.2, North Holland Pub. Co., 285-316.
Duda R, Hart P (1973) Pattern Classification and Scene Analysis. Wiley, New York.
Gordon A (1996) Hierarchical Classification. In: Arabie P, Hubert LJ, De Soete G (eds)
Clustering and Classification. World Scientific Pub. Co., Singapore.
Hartigan JA (1975) Clustering Algorithms. John Wiley & Sons, New York.
Kaufman L, Rousseeuw PJ (1990) Finding Groups in Data. An Introduction to Cluster
Analysis. John Wiley & Sons, Inc.

84 85 86 87 88 89 90 91 92 93 94