کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1181394 962932 2010 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Automatic feature subset selection for decision tree-based ensemble methods in the prediction of bioactivity
موضوعات مرتبط
مهندسی و علوم پایه شیمی شیمی آنالیزی یا شیمی تجزیه
پیش نمایش صفحه اول مقاله
Automatic feature subset selection for decision tree-based ensemble methods in the prediction of bioactivity
چکیده انگلیسی

In the structure–activity relationship (SAR) study, a learning algorithm is usually faced with the problem of selecting a compact subset of descriptors related to the property of interest, while ignoring the rest. This paper presents a new method of molecular descriptor selection utilizing three commonly used decision tree (DT)-based ensemble methods coupled with a backward elimination strategy (BES). Our proposed method eliminates descriptor redundancy automatically and searches for more compact descriptor subset tailored to DT-based ensemble methods. Six real SAR datasets related to different categorical bioactivities of compounds are used to evaluate the proposed method. The results obtained in this study indicate that DT-based ensemble methods coupled with BES, especially boosting tree model, yield better classification performance for compounds related to ADMET.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Chemometrics and Intelligent Laboratory Systems - Volume 103, Issue 2, 15 October 2010, Pages 129–136
نویسندگان
, , , , ,