کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
1181394 | 962932 | 2010 | 8 صفحه PDF | دانلود رایگان |
![عکس صفحه اول مقاله: Automatic feature subset selection for decision tree-based ensemble methods in the prediction of bioactivity Automatic feature subset selection for decision tree-based ensemble methods in the prediction of bioactivity](/preview/png/1181394.png)
In the structure–activity relationship (SAR) study, a learning algorithm is usually faced with the problem of selecting a compact subset of descriptors related to the property of interest, while ignoring the rest. This paper presents a new method of molecular descriptor selection utilizing three commonly used decision tree (DT)-based ensemble methods coupled with a backward elimination strategy (BES). Our proposed method eliminates descriptor redundancy automatically and searches for more compact descriptor subset tailored to DT-based ensemble methods. Six real SAR datasets related to different categorical bioactivities of compounds are used to evaluate the proposed method. The results obtained in this study indicate that DT-based ensemble methods coupled with BES, especially boosting tree model, yield better classification performance for compounds related to ADMET.
Journal: Chemometrics and Intelligent Laboratory Systems - Volume 103, Issue 2, 15 October 2010, Pages 129–136