ABSTRACT

This chapter introduces the concept of data reuse, a powerful data mining effect of the GenIQ Model. Data reuse is appending new variables, found when building a GenIQ Model, to the original dataset. The chapter provides two illustrations of data reuse as a powerful data mining technique. The GenIQ Model computer code is the GenIQ Profit Model equation. Data-reused variables, as found in GenIQ modeling, come about by the inherent mechanism of genetic programming (GP). GP starts out with an initial random set, a genetic population, of, say, 100 genetic models, defined by a set of predictor variables, numeric variables, and a set of functions. The chapter modifies the definition of data reuse to let the data mining prowess of GenIQ enhance the results of an already-built statistical regression model. It defines data reuse as appending new variables, found when building a GenIQ or any model, to the original dataset.