ABSTRACT

We used the function cutree() to obtain a clustering formed by 30 groups of variables. We then checked how many variables (genes) belong to each cluster. Based on this clustering we can create sets of predictors by randomly selecting one variable from each cluster. The reasoning is that members of the same cluster will be similar to each other and thus somehow redundant.