کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4496049 1623833 2015 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
VR-BFDT: A variance reduction based binary fuzzy decision tree induction method for protein function prediction
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
پیش نمایش صفحه اول مقاله
VR-BFDT: A variance reduction based binary fuzzy decision tree induction method for protein function prediction
چکیده انگلیسی


• Protein multi-function prediction.
• Decision boundary fuzzification.
• Label variance reduction as splitting criterion.
• Hierarchical multi-label classification.
• Variance reduction based binary fuzzy decision tree induction.

In protein function prediction (PFP) problem, the goal is to predict function of numerous well-sequenced known proteins whose function is not still known precisely. PFP is one of the special and complex problems in machine learning domain in which a protein (regarded as instance) may have more than one function simultaneously. Furthermore, the functions (regarded as classes) are dependent and also are organized in a hierarchical structure in the form of a tree or directed acyclic graph. One of the common learning methods proposed for solving this problem is decision trees in which, by partitioning data into sharp boundaries sets, small changes in the attribute values of a new instance may cause incorrect change in predicted label of the instance and finally misclassification. In this paper, a Variance Reduction based Binary Fuzzy Decision Tree (VR-BFDT) algorithm is proposed to predict functions of the proteins. This algorithm just fuzzifies the decision boundaries instead of converting the numeric attributes into fuzzy linguistic terms. It has the ability of assigning multiple functions to each protein simultaneously and preserves the hierarchy consistency between functional classes. It uses the label variance reduction as splitting criterion to select the best “attribute-value” at each node of the decision tree. The experimental results show that the overall performance of the proposed algorithm is promising.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Theoretical Biology - Volume 377, 21 July 2015, Pages 10–24
نویسندگان
, ,