کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
528533 869581 2013 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Integrating multiple character proposals for robust scene text extraction
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
Integrating multiple character proposals for robust scene text extraction
چکیده انگلیسی


• Proposed system separates text regions from images under unconstrained environment.
• Generalized clustering utilizes properties of scene text to detect text boundaries.
• Multiple image segmentations provide various interpretations on text regions.
• Two-step CRF approach models properties and relationship of text in graph structure.
• Character proposals are generated and integrated to find proper character regions.

Text contained in scene images provides the semantic context of the images. For that reason, robust extraction of text regions is essential for successful scene text understanding. However, separating text pixels from scene images still remains as a challenging issue because of uncontrolled lighting conditions and complex backgrounds. In this paper, we propose a two-stage conditional random field (TCRF) approach to robustly extract text regions from the scene images. The proposed approach models the spatial and hierarchical structures of the scene text, and it finds text regions based on the scene text model. In the first stage, the system generates multiple character proposals for the given image by using multiple image segmentations and a local CRF model. In the second stage, the system selectively integrates the generated character proposals to determine proper character regions by using a holistic CRF model. Through the TCRF approach, we cast the scene text separation problem as a probabilistic labeling problem, which yields the optimal label configuration of pixels that maximizes the conditional probability of the given image. Experimental results indicate that our framework exhibits good performance in the case of the public databases.

Figure optionsDownload high-quality image (370 K)Download as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Image and Vision Computing - Volume 31, Issue 11, November 2013, Pages 823–840
نویسندگان
, ,