کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4943141 1437621 2017 39 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A robust system for document layout analysis using multilevel homogeneity structure
ترجمه فارسی عنوان
یک سیستم قوی برای تجزیه و تحلیل طرح سند با استفاده از ساختار همگن چند سطحی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
One of the difficulties in the understanding of document images is document layout analysis, which is the first step in document image modeling. In this paper, a robust system for which a multilevel-homogeneity structure is used in accordance with a hybrid methodology is proposed to deal with this problem. Our system consists of the following three main stages: classification, segmentation, and refinement and labeling. Different from other page segmentation methods, the proposed system includes an efficient algorithm to detect table regions in document images. Besides, to create an effective application, the proposed system is designed to work with a variety of document languages. The proposed method was tested with the ICDAR2015 competition (RDCL-2015) and three other published datasets in different languages. The results of these tests show that the accuracy of proposed system is superior to the previous methods.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 85, 1 November 2017, Pages 99-113
نویسندگان
, , , , , ,