ABSTRACT

Knowledge discovery and data mining (KDD) emerged as a rapidly growing interdisciplinary eld that merges together databases, statistics, machine learning, and related areas to discover and extract valuable knowledge in large volumes of data. With the rapid computerization in the past two decades, almost all organizations have collected huge amounts of data in their databases. These organizations need to understand their data or to discover useful knowledge as patterns or models from their data. Meeting this increasing need in the digitalized society, KDD has been becoming an attractive science and technology in both theory and practice. This chapter will provide basic concepts and methods of KDD as well as its typical applications. It starts by providing an overview of data, information, and

CONTENTS

4.1 Introduction ..................................................................................................58 4.2 Knowledge Discovery and Data Mining .................................................. 59

4.2.1 Denition and Examples ................................................................ 59 4.2.2 Knowledge Discovery Process ....................................................... 61 4.2.3 Model Selection in Knowledge Discovery ...................................63