کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
494552 862799 2016 19 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Pipelining the ranking techniques for microarray data classification: A case study
ترجمه فارسی عنوان
مسیر سازی اطلاعات تکنیک های رتبه بندی برای طبقه بندی داده های میکروارگیر: یک مطالعه موردی
کلمات کلیدی
داده های میکروآرایه؛ انتخاب ویژگی؛ ویژگی رتبه بندی تکنیک؛ تقسیم بندی؛ آزمون آماری
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• This is a technique for feature selection and classification of microarray databases.
• Here rather than choosing single ranking method number of pipeline of ranking methods are used.
• Using different classifier a stable pipeline for feature selection is chosen.

Identification of relevant genes from microarray data is an apparent need in many applications. For such identification different ranking techniques with different evaluation criterion are used, which usually assign different ranks to the same gene. As a result, different techniques identify different gene subsets, which may not be the set of significant genes. To overcome such problems, in this study pipelining the ranking techniques is suggested. In each stage of pipeline, few of the lower ranked features are eliminated and at the end a relatively good subset of feature is preserved. However, the order in which the ranking techniques are used in the pipeline is important to ensure that the significant genes are preserved in the final subset. For this experimental study, twenty four unique pipeline models are generated out of four gene ranking strategies. These pipelines are tested with seven different microarray databases to find the suitable pipeline for such task. Further the gene subset obtained is tested with four classifiers and four performance metrics are evaluated. No single pipeline dominates other pipelines in performance; therefore a grading system is applied to the results of these pipelines to find out a consistent model. The finding of grading system that a pipeline model is significant is also established by Nemenyi post-hoc hypothetical test. Performance of this pipeline model is compared with four ranking techniques, though its performance is not superior always but majority of time it yields better results and can be suggested as a consistent model. However it requires more computational time in comparison to single ranking techniques.

Layout of the pipelined rank based microarray data classification.Figure optionsDownload as PowerPoint slide

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Applied Soft Computing - Volume 48, November 2016, Pages 298–316
نویسندگان
, ,