کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
493241 721685 2012 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Multiobjective Optimization for Biomedical Named Entity Recognition and Classification
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
پیش نمایش صفحه اول مقاله
Multiobjective Optimization for Biomedical Named Entity Recognition and Classification
چکیده انگلیسی

Named Entity Recognition and Classification (NERC) is one of the most fundamental and important tasks in biomedical informa–tion extraction. Biomedical named entities (NEs) include mentions of proteins, genes, DNA, RNA etc. which, in general, have complex structures and are difficult to recognize. We have developed a large number of features for identifying NEs from biomed–ical texts. Two robust diverse classification methods like Conditional Random Field (CRF) and Support Vector Machine (SVM) are used to build a number of models depending upon the various representations of the set of features and/or feature templates. Finally the outputs of these different classifiers are combined using multiobjective weighted voted approach. We hypothesize that the reliability of predictions of each classifier differs among the various output classes. Thus, in an ensemble system, it is neces–sary to determine the appropriate weight of vote for each output class in each classifier. Here, a multiobjective genetic algorithm is utilized for determining appropriate weights of votes for combining the outputs of classifiers. The developed technique is evaluated with the benchmark dataset of JNLPBA 2004 that yields the overall recall, precision and F-measure values of 74.10%, 77.58% and 75.80%, respectively.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Technology - Volume 6, 2012, Pages 206-213