کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
495322 862823 2014 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Implicit feature selection for omics data phenotype discrimination
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
پیش نمایش صفحه اول مقاله
Implicit feature selection for omics data phenotype discrimination
چکیده انگلیسی

A novel feature selection method: meta-data approximation (MDA) is proposed for high dimensional omics data. Different from conventional approaches, MDA is an implicit feature selection algorithm that does not have an explicit feature number decrease or dimension reduction for input data. On the other hand, MDA calculates corresponding same dimensional meta-data to approximate original data through our proposed hierarchical feature decomposition. We demonstrate MDA's advantages in disease phenotype discrimination for different high dimensional omics profiles by comparing it with state-of-the-arts. Our results suggest that high dimensional omics data are actually linearly separable under meta-data approximation. Finally, a MDA-induced biomarker discovery algorithm is proposed to further demonstrate omics data’ linear separability by identifying meaningful biomarkers. Our work not only suggests a new alternative to overcome the weaknesses of the traditional feature selection methods, but also provides new techniques and insights in translational bioinformatics. To the best of our knowledge, we firstly proposed the concept and method of implicit feature selection, which will also have positive impacts on machine learning fields.

Separating 72 normal, 78 HCC, and 51 cirrhotic samples in Cirrhosis3 data (23,846 × 201) by using their three top-ranked biomarkers. Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 20, July 2014, Pages 70–82
نویسندگان
,