کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
530349 869760 2014 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Segmentation-free word spotting with exemplar SVMs
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Segmentation-free word spotting with exemplar SVMs
چکیده انگلیسی


• An unsupervised segmentation-free word spotting method is proposed.
• A sliding window approach is used to locate document regions similar to the query image.
• Exemplar SVM is used to improve the query representation.
• PQ compression allows one to deal with large collections of documents with a minimum loss in accuracy.
• Reranking and query expansion can be applied in word spotting to improve performance.

In this paper we propose an unsupervised segmentation-free method for word spotting in document images. Documents are represented with a grid of HOG descriptors, and a sliding-window approach is used to locate the document regions that are most similar to the query. We use the Exemplar SVM framework to produce a better representation of the query in an unsupervised way. Then, we use a more discriminative representation based on Fisher Vector to rerank the best regions retrieved, and the most promising ones are used to expand the Exemplar SVM training set and improve the query representation. Finally, the document descriptors are precomputed and compressed with Product Quantization. This offers two advantages: first, a large number of documents can be kept in RAM memory at the same time. Second, the sliding window becomes significantly faster since distances between quantized HOG descriptors can be precomputed. Our results significantly outperform other segmentation-free methods in the literature, both in accuracy and in speed and memory usage.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 47, Issue 12, December 2014, Pages 3967–3978
نویسندگان
, , , ,