Article ID Journal Published Year Pages File Type
515587 Information Processing & Management 2008 12 Pages PDF
Abstract

In this paper, we propose a text matching method for document image retrieval without any language model. Two word images are first normalized to an appropriate size and image features are extracted using the local crowdedness method. Similarity between the two features is then measured by calculating a Hausdorff distance. We performed three experiments. The first experiment proves the effectiveness of the proposed method for text matching, and the other two experiments verify the language independence and font size independence of the proposed method.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
, , ,