کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
533374 870109 2012 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Synthesizing queries for handwritten word image retrieval
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Synthesizing queries for handwritten word image retrieval
چکیده انگلیسی

We propose a method to perform text searches on handwritten word image databases when no ground-truth data is available to learn models or select example queries. The approach proceeds by synthesizing multiple images of the query string using different computer fonts. While this idea has been successfully applied to printed documents in the past, its application to the handwritten domain is not straightforward. Indeed, the domain mismatch between queries (synthetic) and database images (handwritten) leads to poor accuracy.Our solution is to represent the queries with robust features and use a model that explicitly accounts for the domain mismatch. While the model is trained using synthetic images, its generative process produces samples according to the distribution of handwritten features. Furthermore, we propose an unsupervised method to perform font selection which has a significant impact on accuracy. Font selection is formulated as finding an optimal weighted mixture of fonts that best approximates the distribution of handwritten low-level features. Experiments demonstrate that the proposed method is an effective way to perform queries without using any human annotated example in any part of the process.


► We focus on the problem of text searches on handwritten document collections.
► We propose to synthesize images of the query word in a variety of computer fonts.
► Model learned on-the fly with these images -domain asymmetry is explicitly modeled.
► Unsupervised, efficient and keyword-independent font weighting approach.
► No human-annotated/selected data is needed (i.e method uses 0 prototype images).

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 45, Issue 9, September 2012, Pages 3270–3276
نویسندگان
, ,