ABSTRACT

This chapter discusses the clustering techniques that have been applied to wide variety of image data, including the application to visual words learning. It explains different clustering algorithms used in the context of video and audio data, including video summarization, video event detection, video story clustering, and music summarization. The chapter also discusses the clustering with multimodal data primarily with respect to image and text data. Visual words learning, which involves vector quantization, is among one of the earliest adaptions of clustering algorithms in multimedia applications. Video clustering algorithms have also been applied in the analysis and detection of abnormal or suspicious events in many surveillance applications. Story-level clustering of videos has been an indispensable ingredient to discover the evolving stories based on different event themes. One of the prominent emerging multimedia applications involves the interaction between multimedia data with the fast developed social networks.