Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
6940760 | Pattern Recognition Letters | 2018 | 10 Pages |
Abstract
Automatic segmentation-free and training-free word spotting is a challenging task due to the large intra-class variability of handwritten shapes and the need to process the whole document image. In this work, a novel unsupervised hierarchical handwriting representation is introduced, where the spherical k-means algorithm is used to learn a hierarchy of features for representing document images. A matching system is then employed for word spotting, which consists of two stages: (1) a fast pre-selection stage applying a sliding-window approach over compressed document image representations, and (2) a re-ranking stage based on a discriminative description that encodes the spatial layout of local features. The proposed approach is evaluated using three well-known benchmark datasets, the Lord Byron (LB), the George Washington (GW) and the IAM datasets. Results show our method to yield competitive performance compared to state-of-the-art approaches for segmentation-free and training-free word spotting. In addition, since our proposed framework has a low computational and memory complexity, it can be applied to large datasets.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Computer Vision and Pattern Recognition
Authors
Mohamed Mhiri, Sherif Abuelwafa, Christian Desrosiers, Mohamed Cheriet,