ABSTRACT

The ETL workflows that you developed for the Alpha version were completely dependent upon knowing the structure and content of the source datasets that you were loading. In Chapter 7, you finished filling in some of the pieces of the design that weren’t included in the fast-moving Alpha version, including the definition and content of the Metadata dimension on which this Beta version will be based. The approach to analyzing your source data and modeling the needed metadata for data sourcing in Chapter 8 has removed that knowledge from our view by creating a standardized mapping for every piece of source data to its warehouse target location. Chapter 8 ended with the defining of the queries needed to actually extract data from the source, those queries being the last places in your ETL subsystem where knowledge of the sources will be needed in the workflows.