Article ID Journal Published Year Pages File Type
532309 Pattern Recognition 2013 13 Pages PDF
Abstract

We discuss the use of histogram of oriented gradients (HOG) descriptors as an effective tool for text description and recognition. Specifically, we propose a HOG-based texture descriptor (T-HOG) that uses a partition of the image into overlapping horizontal cells with gradual boundaries, to characterize single-line texts in outdoor scenes. The input of our algorithm is a rectangular image presumed to contain a single line of text in Roman-like characters. The output is a relatively short descriptor that provides an effective input to an SVM classifier. Extensive experiments show that the T-HOG is more accurate than Dalal and Triggs's original HOG-based classifier, for any descriptor size. In addition, we show that the T-HOG is an effective tool for text/non-text discrimination and can be used in various text detection applications. In particular, combining T-HOG with a permissive bottom-up text detector is shown to outperform state-of-the-art text detection systems in two major publicly available databases.

► We discuss the use of the histogram of oriented gradients for text classification. ► We propose a novel HOG-based texture descriptor nicknamed T-HOG. ► The T-HOG allows to characterize single-line texts in outdoor scenes and video frames. ► We show that the T-HOG is an effective tool for text/non-text discrimination. ► We show that the T-HOG can be used in various text detection applications.

Related Topics
Physical Sciences and Engineering Computer Science Computer Vision and Pattern Recognition
Authors
, , , , ,