کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
495101 862815 2015 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Rough-fuzzy clustering and multiresolution image analysis for text-graphics segmentation
ترجمه فارسی عنوان
تجزیه و تحلیل خوشه ای فازی و تجزیه و تحلیل تصویر چند منظوره برای تقسیم بندی متن گرافیکی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• A new method is proposed for text-graphics segmentation.
• M-band wavelet packet is used to extract scale-space features for document image.
• Unsupervised feature selection method is proposed to select relevant and non-redundant features.
• Rough-fuzzy clustering is used to address uncertainty problem of document segmentation.
• The approach is invariant under font size of text, scanning resolution and type of layout.

This paper presents a segmentation method, integrating judiciously the merits of rough-fuzzy computing and multiresolution image analysis technique, for documents having both text and graphics regions. It assumes that the text and non-text or graphics regions of a given document are considered to have different textural properties. The M-band wavelet packet analysis and rough-fuzzy-possibilistic c-means are used for text-graphics segmentation problem. The M-band wavelet packet is used to extract the scale-space features, which offers a huge range of possibilities of scale-space features for document image and is able to zoom it onto narrow band high frequency components. A scale-space feature vector is thus derived, taken at different scales for each pixel in an image. However, the decomposition scheme employing M-band wavelet packet leads to a large number of redundant features. In this regard, an unsupervised feature selection method is introduced to select a set of relevant and non-redundant features for text-graphics segmentation problem. Finally, the rough-fuzzy-possibilistic c-means algorithm is used to address the uncertainty problem of document segmentation. The whole approach is invariant under the font size, line orientation, and script of the text. The performance of the proposed technique, along with a comparison with related approaches, is demonstrated on a set of real life document images.

Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 30, May 2015, Pages 705–721
نویسندگان
, ,