Weighting Method for Feature Selection in K-Means

doi:10.1201/9781584888796-19

ABSTRACT

The k-means type of clustering algorithms [13, 16] are widely used in realworld applications such as marketing research [12] and data mining due to their eﬃciency in processing large datasets. One unavoidable task of using k-means in real applications is to determine a set of features (or attributes). A common practice is to select features based on business domain knowledge and data exploration. This manual approach is diﬃcult to use, time consuming, and frequently cannot make a right selection. An automated method is needed to solve the feature selection problem in k-means.