ABSTRACT

Introduction Data Acquisition and Integration is a name given to the set of applications that populate a data warehouse (Figure 6.1). This process consists of three main functions.

Extract: Otherwise known as Data Acquisition, this function reaches into a source system to retrieve data. The data yielded by this function is known as Source Data. Transform: The first half of Data Integration, this function inspects, cleanses, and conforms Source Data to the needs of a data warehouse. The data yielded by this function is known as Load Data. Load: The second half of Data Integration, this function updates a data warehouse using the data provided in the Load Data.