Variable selection is intended to select the “best” subset of predictors. Several reasons for wanting to do this follow:

1. We want to explain the data in the simplest way. Redundant predictors should be removed. The principle of Occam’s Razor states that among several plausible explanations for a phenomenon, the simplest is best. Applied to regression analysis, this implies that the smallest model that fits the data is best.