کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
5760016 1623789 2017 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Extracting features from protein sequences to improve deep extreme learning machine for protein fold recognition
ترجمه فارسی عنوان
ویژگی های استخراج از توالی پروتئین برای بهبود دستگاه عمیق یادگیری عمیق برای شناسایی پروتئین
کلمات کلیدی
شناسایی دوران پروتئین، دستگاه یادگیری شدید توصیف کننده پروتئین، استخراج ویژگی،
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
چکیده انگلیسی
Protein fold recognition is an important problem in bioinformatics to predict three-dimensional structure of a protein. One of the most challenging tasks in protein fold recognition problem is the extraction of efficient features from the amino-acid sequences to obtain better classifiers. In this paper, we have proposed six descriptors to extract features from protein sequences. These descriptors are applied in the first stage of a three-stage framework PCA-DELM-LDA to extract feature vectors from the amino-acid sequences. Principal Component Analysis PCA has been implemented to reduce the number of extracted features. The extracted feature vectors have been used with original features to improve the performance of the Deep Extreme Learning Machine DELM in the second stage. Four new features have been extracted from the second stage and used in the third stage by Linear Discriminant Analysis LDA to classify the instances into 27 folds. The proposed framework is implemented on the independent and combined feature sets in SCOP datasets. The experimental results show that extracted feature vectors in the first stage could improve the performance of DELM in extracting new useful features in second stage.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 421, 21 May 2017, Pages 1-15
نویسندگان
, ,