ABSTRACT

In the last chapter, we described the supervised learning setting where we had observations with given class labels. In these cases, we know that the data come from different groups or classes, and we know how many groups are represented by the data. However, in many situations, we do not know the class membership for our observations at the start of our analysis. These are called unsupervised learning or clustering applications.