Page 22 -

P. 22

04-fore-xix-xxii-9780123814791
3:32
2011/6/1
#3
Page xxi
HAN

Foreword to Second Edition

We are deluged by data—scientiﬁc data, medical data, demographic data, ﬁnancial data,
and marketing data. People have no time to look at this data. Human attention has
become the precious resource. So, we must ﬁnd ways to automatically analyze the
data, to automatically classify it, to automatically summarize it, to automatically dis-
cover and characterize trends in it, and to automatically ﬂag anomalies. This is one
of the most active and exciting areas of the database research community. Researchers
in areas including statistics, visualization, artiﬁcial intelligence, and machine learning
are contributing to this ﬁeld. The breadth of the ﬁeld makes it difﬁcult to grasp the
extraordinary progress over the last few decades.
Six years ago, Jiawei Han’s and Micheline Kamber’s seminal textbook organized and
presented Data Mining. It heralded a golden age of innovation in the ﬁeld. This revision
of their book reﬂects that progress; more than half of the references and historical notes
are to recent work. The ﬁeld has matured with many new and improved algorithms, and
has broadened to include many more datatypes: streams, sequences, graphs, time-series,
geospatial, audio, images, and video. We are certainly not at the end of the golden age—
indeed research and commercial interest in data mining continues to grow—but we are
all fortunate to have this modern compendium.
The book gives quick introductions to database and data mining concepts with
particular emphasis on data analysis. It then covers in a chapter-by-chapter tour the
concepts and techniques that underlie classiﬁcation, prediction, association, and clus-
tering. These topics are presented with examples, a tour of the best algorithms for each
problem class, and with pragmatic rules of thumb about when to apply each technique.
The Socratic presentation style is both very readable and very informative. I certainly
learned a lot from reading the ﬁrst edition and got re-educated and updated in reading
the second edition.
Jiawei Han and Micheline Kamber have been leading contributors to data mining
research. This is the text they use with their students to bring them up to speed on

xxi

17 18 19 20 21 22 23 24 25 26 27