ABSTRACT

Automatic video summarization requires characterization and a means to rank or select a small subset of a video as summary output. For accurate characterization, techniques must be used that incorporate audio, image and language features from video. Multimodal analysis uses different features to achieve improved results over a single mode of data. A single modal feature description may be combined or used individually to select video for summarization.