کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4969878 1449979 2017 23 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A survey of document image word spotting techniques
ترجمه فارسی عنوان
نظرسنجی از تکنیک های تصحیح کلمه سند تصویر
کلمات کلیدی
علامت گذاری به کلمه بازیابی، نمایه سازی سند، امکانات، نمایندگی، بازخورد مربوطه
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
چکیده انگلیسی
Vast collections of documents available in image format need to be indexed for information retrieval purposes. In this framework, word spotting is an alternative solution to optical character recognition (OCR), which is rather inefficient for recognizing text of degraded quality and unknown fonts usually appearing in printed text, or writing style variations in handwritten documents. Over the past decade there has been a growing interest in addressing document indexing using word spotting which is reflected by the continuously increasing number of approaches. However, there exist very few comprehensive studies which analyze the various aspects of a word spotting system. This work aims to review the recent approaches as well as fill the gaps in several topics with respect to the related works. The nature of texts and inherent challenges addressed by word spotting methods are thoroughly examined. After presenting the core steps which compose a word spotting system, we investigate the use of retrieval enhancement techniques based on relevance feedback which improve the retrieved results. Finally, we present the datasets which are widely used for word spotting, we describe the evaluation standards and measures applied for performance assessment and discuss the results achieved by the state of the art.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition - Volume 68, August 2017, Pages 310-332
نویسندگان
, , , ,