کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
8408263 1545069 2018 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A Review of Matched-pairs Feature Selection Methods for Gene Expression Data Analysis
ترجمه فارسی عنوان
یک بررسی از روش انتخاب ویژگی های متقاطع برای تجزیه و تحلیل داده های بیان ژن
کلمات کلیدی
انتخاب ممتاز همراه، طراحی موردی کنونی اطلاعات جفت شده، بیان ژن،
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی بیوتکنولوژی یا زیست‌فناوری
چکیده انگلیسی
With the rapid accumulation of gene expression data from various technologies, e.g., microarray, RNA-sequencing (RNA-seq), and single-cell RNA-seq, it is necessary to carry out dimensional reduction and feature (signature genes) selection in support of making sense out of such high dimensional data. These computational methods significantly facilitate further data analysis and interpretation, such as gene function enrichment analysis, cancer biomarker detection, and drug targeting identification in precision medicine. Although numerous methods have been developed for feature selection in bioinformatics, it is still a challenge to choose the appropriate methods for a specific problem and seek for the most reasonable ranking features. Meanwhile, the paired gene expression data under matched case-control design (MCCD) is becoming increasingly popular, which has often been used in multi-omics integration studies and may increase feature selection efficiency by offsetting similar distributions of confounding features. The appropriate feature selection methods specifically designed for the paired data, which is named as matched-pairs feature selection (MPFS), however, have not been maturely developed in parallel. In this review, we compare the performance of 10 feature-selection methods (eight MPFS methods and two traditional unpaired methods) on two real datasets by applied three classification methods, and analyze the algorithm complexity of these methods through the running of their programs. This review aims to induce and comprehensively present the MPFS in such a way that readers can easily understand its characteristics and get a clue in selecting the appropriate methods for their analyses.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational and Structural Biotechnology Journal - Volume 16, 2018, Pages 88-97
نویسندگان
, , , , ,