کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
525740 869020 2015 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
New Gradient-Spatial-Structural Features for video script identification
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
New Gradient-Spatial-Structural Features for video script identification
چکیده انگلیسی


• Dominant pixel selection by exploring gradient information with histogram.
• Proposing Gradient-Spatial-Features for text candidate selection.
• Proposing Gradient-Structural-Features in new way for text candidate selection.
• Determining weights based on templates for correct classification.
• Integrating Gradient-Spatial-Structural-Features in novel way for classification.

Multi-script identification helps in automatically selecting an appropriate OCR when video has several scripts; however, script identification in video frames is challenging because low resolution and complex background of video often cause disconnections or the loss of text information. This paper presents a novel idea that integrates the Gradient-Spatial-Features (GSpF) and the Gradient-Structural-Features (GStF) at block level based on an error factor and the weights of the features to identify six video scripts, namely, Arabic, Chinese, English, Japanese, Korean and Tamil. Horizontal and vertical gradient values are first computed for each text block to increase the contrast of text pixels. Then the method divides the horizontal and the vertical gradient blocks into two equal parts at the centroid in the horizontal direction. Histogram operation on each part is performed to select dominant text pixels from respective subparts of the horizontal and the vertical gradient blocks, which results in text components. After extracting GSpF and GStF from the text components, we finally propose to integrate the spatial and the structural features based on end points, intersection points, junction points and straightness of the skeleton of text components in a novel way to identify the scripts. The method is evaluated on 970 video frames of six scripts which involves font, font size or contrast variations, and is compared with an existing method in terms of classification rate. Experimental results show that the proposed method achieves 83.0% average classification rate for video script identification. The method is also evaluated by testing on noisy images and scanned low resolution documents, illustrating the robustness and the extensibility of the proposed Gradient-Spatial-Structural Features.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Vision and Image Understanding - Volume 130, January 2015, Pages 35–53
نویسندگان
, , , , ,