ABSTRACT

Executive summary: Success in software development as a business relies on a combination of processes (methodologies), technologies employed (toolsets), and domain knowledge related to the nature and uses of the software being developed. Legacy data mining is partially technology and partially application domain specific—it refers to “knowledge discovery” derived from databases and data warehouses. Data mining, in general, is about finding information by identifying patterns, associations, and relationships within large sets of data. Some of the raw data exists in readily accessible form within relational databases and data warehouses, while other data may exist in a less readily accessible form within legacy application systems. Data mining has grown rapidly and continues to do so. An awareness of data mining approaches and potential uses may be relevant when developing many different kinds of software systems.