Article ID Journal Published Year Pages File Type
528362 Image and Vision Computing 2016 11 Pages PDF
Abstract

•TFK: a kernel framework between arbitrary length sequences.•Some complex activities are defined by the order of sub-actions.•The new kernel framework improves results in complex activities recognition.•Combination of several levels of granularity in temporal divisions reduces clutter.

This work deals with the challenging task of activity recognition in unconstrained videos. Standard methods are based on video encoding of low-level features using Fisher Vectors or Bag of Features. However, these approaches model every sequence into a single vector with fixed dimensionality that lacks any long-term temporal information, which may be important for recognition, especially of complex activities. This work proposes a novel framework with two main technical novelties: First, a video encoding method that maintains the temporal structure of sequences and second a Time Flexible Kernel that allows comparison of sequences of different lengths and random alignment. Results on challenging benchmarks and comparison to previous work demonstrate the applicability and value of our framework.

Figure optionsDownload full-size imageDownload high-quality image (158 K)Download as PowerPoint slide

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , , ,