کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
518262 | 867571 | 2011 | 9 صفحه PDF | دانلود رایگان |
Automatic summarization has been proposed to help manage the results of biomedical information retrieval systems. Semantic MEDLINE, for example, summarizes semantic predications representing assertions in MEDLINE citations. Results are presented as a graph which maintains links to the original citations. Graphs summarizing more than 500 citations are hard to read and navigate, however. We exploit graph theory for focusing these large graphs. The method is based on degree centrality, which measures connectedness in a graph. Four categories of clinical concepts related to treatment of disease were identified and presented as a summary of input text. A baseline was created using term frequency of occurrence. The system was evaluated on summaries for treatment of five diseases compared to a reference standard produced manually by two physicians. The results showed that recall for system results was 72%, precision was 73%, and F-score was 0.72. The system F-score was considerably higher than that for the baseline (0.47).
The graph illustrates a therapy-oriented summary for Parkinson’s disease. The method uses degree centrality to identify major comorbidities, relevant body parts, drugs and procedures for this disease.Figure optionsDownload as PowerPoint slideHighlights
► Graph theory is exploited to extend a semantic abstraction method for summarizing multiple biomedical texts.
► Degree centrality is effective in selecting information crucial for summarizing research on treatment of disease.
► The system performs significantly better than a frequency-based method in identifying salient information.
Journal: Journal of Biomedical Informatics - Volume 44, Issue 5, October 2011, Pages 830–838