Statistical Indices for Computational and Data Driven Class Discovery in Microarray Data

doi:10.1201/9781420086850-19

Chapter

Statistical Indices for Computational and Data Driven Class Discovery in Microarray Data

ABSTRACT

The problem of discovering new taxonomies (classiﬁcations of objects according to some natural relationships) from data has received considerable attention in the statistics and machine learning community. In this chapter, we are concerned with a particular type of taxonomy discovery, namely, cluster analysis, the discovery of distinct and nonoverlapping subpopulations within a larger population, the member items of each subpopulation sharing some common features or properties deemed relevant in the problem domain of study. This type of unsupervised analysis is of particular signiﬁcance in the emerging ﬁeld of functional genomics and microarray data analysis.