Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
529976 | Journal of Visual Communication and Image Representation | 2012 | 12 Pages |
Key frame extraction is an important technique in video summarization, browsing, searching and understanding. In this paper, we propose a novel approach to extract the most attractive key frames by using a saliency-based visual attention model that bridges the gap between semantic interpretation of the video and low-level features. First, dynamic and static conspicuity maps are constructed based on motion, color and texture features. Then, by introducing suppression factor and motion priority schemes, the conspicuity maps are fused into a saliency map that includes only true attention regions to produce attention curve. Finally, after time-constraint cluster algorithm grouping frames with similar content, the frames with maximum saliency value are selected as key-frames. Experimental results demonstrate the effectiveness of our approach for video summarization by retrieving the meaningful key frames.
► Dynamic and static conspicuity maps are constructed based on low-level features. ► Fusion model based on suppression factor and motion priority schemes is proposed. ► Time-constraint cluster algorithm is designed to group frames. ► High quality semantic key frames are extracted by using visual attention model.