A Time Flexible Kernel framework for video-based activity recognition *

Article ID	Journal	Published Year	Pages	File Type
528362	Image and Vision Computing	2016	11 Pages	PDF

Abstract

•TFK: a kernel framework between arbitrary length sequences.•Some complex activities are defined by the order of sub-actions.•The new kernel framework improves results in complex activities recognition.•Combination of several levels of granularity in temporal divisions reduces clutter.

This work deals with the challenging task of activity recognition in unconstrained videos. Standard methods are based on video encoding of low-level features using Fisher Vectors or Bag of Features. However, these approaches model every sequence into a single vector with fixed dimensionality that lacks any long-term temporal information, which may be important for recognition, especially of complex activities. This work proposes a novel framework with two main technical novelties: First, a video encoding method that maintains the temporal structure of sequences and second a Time Flexible Kernel that allows comparison of sequences of different lengths and random alignment. Results on challenging benchmarks and comparison to previous work demonstrate the applicability and value of our framework.

Figure optionsDownload full-size imageDownload high-quality image (158 K)Download as PowerPoint slide

Keywords

Activity recognition Kernel methods