Article ID Journal Published Year Pages File Type
10322220 Expert Systems with Applications 2015 14 Pages PDF
Abstract
Developing an expert text detection system for video indexing and retrieving is a challenging task due to low resolution, complex background, non-illumination and movement of text present in a video. Besides, text detection is vital for several real time applications, such as license plate recognition, assisting a blind person and other surveillance applications. In this paper, we introduce a new descriptor called Histogram Oriented Moments (HOM) for text detection in video, which is invariant to rotation, scaling, font, and font size variations. The HOM finds orientations with the second order geometrical moments for each sliding window (overlapped block) of the input frame. The proposed method performs histogram operations on the orientations of each window to identify the dominant orientation (as a representative). Then, a new hypothesis is defined based on the dominant orientations of a connected component as the numbers of orientations, which point towards centroid of the connected components are larger than the number of dominant orientations which point away from the centroid of the connected components. The components that satisfy the above hypothesis are considered as text candidates, or else as non-text candidates. Further, to detect a moving text- we explore optical flow properties, such as velocity of text candidates to estimate the motions between temporal frames. The components which move with constant velocity and uniform direction are considered as text candidates otherwise non-text candidates. We demonstrate the proposed method's dominance over state of the art methods by testing on benchmark database, namely ICDAR 2013 and our own video datasets in terms of recall, precision and F-measure.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, , ,