کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
504845 864442 2015 12 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A probabilistic approach for automated discovery of perturbed genes using expression data from microarray or RNA-Seq
ترجمه فارسی عنوان
یک روش احتمالاتی برای کشف خودکار ژن‌های آشفته با استفاده از داده های بیان از میکروآرایه یا RNA-Seq
کلمات کلیدی
کشف بیومارکر؛ مدل سازی احتمالی؛ تجزیه و تحلیل شبکه؛ Microarrays؛ RNA-Seq؛ ژنومیکس؛ سرطان پستان؛ بیماری پیچیده؛ تنومند؛ سیگنالینگ و شبکه نظارتی
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر
چکیده انگلیسی


• Automated analysis of high throughput data for disease gene and biomarker discovery.
• Network based analysis using probability of change for gene expression.
• Analysis of coherent path level changes in regulation in response to complex disease.
• Biomarkers with high precision and accuracy of classification.

BackgroundIn complex diseases, alterations of multiple molecular and cellular components in response to perturbations are indicative of disease physiology. While expression level of genes from high-throughput analysis can vary among patients, the common path among disease progression suggests that the underlying cellular sub-processes involving associated genes follow similar fates. Motivated by the interconnected nature of sub-processes, we have developed an automated methodology that combines ideas from biological networks, statistical models, and game theory, to probe connected cellular processes. The core concept in our approach uses probability of change (POC) to indicate the probability that a gene’s expression level has changed between two conditions. POC facilitates the definition of change at the neighborhood, pathway, and network levels and enables evaluation of the influence of diseases on the expression. The ‘connected’ disease-related genes (DRG) identified display coherent and concomitant differential expression levels along paths.ResultsRNA-Seq and microarray breast cancer subtyping expression data sets were used to identify DRG between subtypes. A machine-learning algorithm was trained for subtype discrimination using the DRG, and the training yielded a set of biomarkers. The discriminative power of the biomarkers was tested using an unseen data set. Biomarkers identified overlaps with disease-specific identified genes, and we were able to classify disease subtypes with 100% and 80% agreement with PAM50, for microarray and RNA-Seq data set respectively.ConclusionsWe present an automated probabilistic approach that offers unbiased and reproducible results, thus complementing existing methods in DRG and biomarker discovery for complex diseases.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computers in Biology and Medicine - Volume 67, 1 December 2015, Pages 29–40
نویسندگان
, ,