Article ID Journal Published Year Pages File Type
532458 Journal of Visual Communication and Image Representation 2014 11 Pages PDF
Abstract

•An event classification framework using temporal context of multimodal features.•Link low-level features and high-level semantics by mid-level interval features.•Explore full temporal relations among mid-level features from multiple modalities.•Propose a co-occurrence symbol transformation method for HMM classification.•Provide more promising baseball event classification results.

Semantic high-level event recognition of videos is one of most interesting issues for multimedia searching and indexing. Since low-level features are semantically distinct from high-level events, a hierarchical video analysis framework is needed, i.e., using mid-level features to provide clear linkages between low-level audio-visual features and high-level semantics. Therefore, this paper presents a framework for video event classification using temporal context of mid-level interval-based multimodal features. In the framework, a co-occurrence symbol transformation method is proposed to explore full temporal relations among multiple modalities in probabilistic HMM event classification. The results of our experiments on baseball video event classification demonstrate the superiority of the proposed approach.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, ,