ABSTRACT

Data mining is the art and science of discovering and extracting potentially useful, unexpected, and understandable knowledge from large data sets. Data mining is also referred to as information discovery, information harvesting, data archaeology, induction of knowledge from databases, and knowledge extraction. The process of data mining consists of determining certain comprehensible attributes and their values from large amounts of mostly unsupervised data in some application domain. Data mining and knowledge discovery have several applications. Some of these are in medicine and health care, genomic data, weather prediction, sensor data, electronic commerce like personal profile marketing, security, fraud detection, multimedia documents, and several more scientific and engineering applications. Data mining consists of characterizing collected data by fitting data to certain models. Data mining techniques which use association rules discover useful and interesting associations among members of large data sets. Algorithms which implement sequential data mining generally deal with categorical patterns.