کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
571009 1446522 2016 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Scalable Information Gain Variant on Spark Cluster for Rapid Quantification of Microarray
ترجمه فارسی عنوان
اطلاعات مقیاس پذیر می تواند گویای خوشه های جرقه را برای اندازه گیری سریع میکروارایا فراهم کند
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر علوم کامپیوتر (عمومی)
چکیده انگلیسی

Microarray technology is one of the emerging technologies in the field of genetic research, which many researchers often use to monitor expression levels of genes in a given organism. Microarray experiments have wide range of applications in health care sector. The colossal amount of raw gene expression data often leads to computational and analytical challenges including feature selection and classification of the dataset into correct group or class. In this paper, mutual information feature selection method based on spark framework (sf-MIFS) is proposed to determine the pertinent features. After completion of feature selection process, various classifiers i.e., Logistic Regression (sf-LoR) and Naive Bayes (sf-NB) based on Spark framework has been applied to classify the microarray datasets. A detailed comparative analysis in terms of execution time and accuracy is enumerated on the proposed feature selection and classifier methodologies, based on Spark framework and conventional system respectively.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Computer Science - Volume 93, 2016, Pages 292–298
نویسندگان
, , , ,