کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
468768 698254 2015 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Enhancing medical named entity recognition with an extended segment representation technique
ترجمه فارسی عنوان
ارتقاء به رسمیت شناختن شناسه پزشکی با تکنیک بازپخش بخش توسعه یافته
کلمات کلیدی
معادن متن بیومدیکال، استخراج اطلاعات، مدارک پزشکی پزشکی غیرمتمرکز، پردازش زبان طبیعی، حاشیه نویسی متون پزشکی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی

ObjectiveThe objective of this paper is to formulate an extended segment representation (SR) technique to enhance named entity recognition (NER) in medical applications.MethodsAn extension to the IOBES (Inside/Outside/Begin/End/Single) SR technique is formulated. In the proposed extension, a new class is assigned to words that do not belong to a named entity (NE) in one context but appear as an NE in other contexts. Ambiguity in such cases can negatively affect the results of classification-based NER techniques. Assigning a separate class to words that can potentially cause ambiguity in NER allows a classifier to detect NEs more accurately; therefore increasing classification accuracy.ResultsThe proposed SR technique is evaluated using the i2b2 2010 medical challenge data set with eight different classifiers. Each classifier is trained separately to extract three different medical NEs, namely treatment, problem, and test. From the three experimental results, the extended SR technique is able to improve the average F1-measure results pertaining to seven out of eight classifiers. The kNN classifier shows an average reduction of 0.18% across three experiments, while the C4.5 classifier records an average improvement of 9.33%.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Methods and Programs in Biomedicine - Volume 119, Issue 2, April 2015, Pages 88–100
نویسندگان
, , , ,