کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
729800 | 1461526 | 2015 | 12 صفحه PDF | دانلود رایگان |
• We investigate problems related to conversion of paper and electronic documents into standard electronic form.
• We analyze the steps for the creation of an electronic medical databases.
• We propose an automatic system useful for extraction and collection of data contained in medical reports.
• We evaluate the performance of the proposed system.
This paper illustrates an automatic document processing system for the extraction of data contained in medical laboratory results printed on paper. The final goal of the research is to automate the collection of medical data and to enable an efficient management and dissemination of the information. The following processing steps of the system are described in detail: image preprocessing; layout analysis for the identification of the tables contained in the document; extraction and classification of the laboratory results. Among the many features of the system there are the use of an open source OCR engine, as a basis of further processing, and the storage in XML format of the data retrieved, for ease of sharing. The knowledge base used to guide the data extraction is also explained. The proposed approach has been tested on several document formats and performance analyzed.
Journal: Measurement - Volume 61, February 2015, Pages 88–99