کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
468344 698220 2014 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Drug/nondrug classification using Support Vector Machines with various feature selection strategies
ترجمه فارسی عنوان
دسته بندی مواد مخدر / نابارور با استفاده از ماشین های بردار پشتیبانی با استراتژی های مختلف انتخاب ویژگی
کلمات کلیدی
پشتیبانی از ماشین های بردار توصیفگرهای مولکولی، انتخاب ویژگی، کشف مواد مخدر، فراگیری ماشین
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی


• Our aim was to classify small molecule compounds as drugs and nondrugs using Support Vector Machines (SVM).
• We used three feature selection methods to improve the performance of the SVM classifier.
• Our findings revealed that data pre-processing and feature selection enhance the performance of the classifiers.
• Our study and findings will contribute to improvement of our understanding of the early-phase drug design.

In conjunction with the advance in computer technology, virtual screening of small molecules has been started to use in drug discovery. Since there are thousands of compounds in early-phase of drug discovery, a fast classification method, which can distinguish between active and inactive molecules, can be used for screening large compound collections. In this study, we used Support Vector Machines (SVM) for this type of classification task. SVM is a powerful classification tool that is becoming increasingly popular in various machine-learning applications. The data sets consist of 631 compounds for training set and 216 compounds for a separate test set. In data pre-processing step, the Pearson's correlation coefficient used as a filter to eliminate redundant features. After application of the correlation filter, a single SVM has been applied to this reduced data set. Moreover, we have investigated the performance of SVM with different feature selection strategies, including SVM–Recursive Feature Elimination, Wrapper Method and Subset Selection. All feature selection methods generally represent better performance than a single SVM while Subset Selection outperforms other feature selection methods. We have tested SVM as a classification tool in a real-life drug discovery problem and our results revealed that it could be a useful method for classification task in early-phase of drug discovery.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computer Methods and Programs in Biomedicine - Volume 117, Issue 2, November 2014, Pages 51–60
نویسندگان
, , ,