ABSTRACT

Clustering is one of the important unsupervised machine learning methods in data mining. In the cases of massive data, existing clustering algorithm’s time complexity and space complexity encountered a bottleneck, which demands the study of the field of parallelizing clustering algorithms.