کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1162965 1490921 2016 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Variable importance analysis based on rank aggregation with applications in metabolomics for biomarker discovery
ترجمه فارسی عنوان
تجزیه و تحلیل اهمیت متغیر بر اساس تجمع رتبه با برنامه های کاربردی در متابولومیک برای کشف بیومارکر
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه شیمی شیمی آنالیزی یا شیمی تجزیه
چکیده انگلیسی


• Present a problem of inconsistency between variable ranking methods for biomarker discovery in metabolomics study.
• Rank aggregation is used to merge individual ranking lists into a single “super”-list reflective of the overall preference.
• Rank aggregation has better performance when compared with using all variables and penalized method.

Biomarker discovery is one important goal in metabolomics, which is typically modeled as selecting the most discriminating metabolites for classification and often referred to as variable importance analysis or variable selection. Until now, a number of variable importance analysis methods to discover biomarkers in the metabolomics studies have been proposed. However, different methods are mostly likely to generate different variable ranking results due to their different principles. Each method generates a variable ranking list just as an expert presents an opinion. The problem of inconsistency between different variable ranking methods is often ignored. To address this problem, a simple and ideal solution is that every ranking should be taken into account. In this study, a strategy, called rank aggregation, was employed. It is an indispensable tool for merging individual ranking lists into a single “super”-list reflective of the overall preference or importance within the population. This “super”-list is regarded as the final ranking for biomarker discovery. Finally, it was used for biomarkers discovery and selecting the best variable subset with the highest predictive classification accuracy. Nine methods were used, including three univariate filtering and six multivariate methods. When applied to two metabolic datasets (Childhood overweight dataset and Tubulointerstitial lesions dataset), the results show that the performance of rank aggregation has improved greatly with higher prediction accuracy compared with using all variables. Moreover, it is also better than penalized method, least absolute shrinkage and selectionator operator (LASSO), with higher prediction accuracy or less number of selected variables which are more interpretable.

Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Analytica Chimica Acta - Volume 911, 10 March 2016, Pages 27–34
نویسندگان
, , , , ,