کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4499888 1624010 2015 13 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
From genome-scale data to models of infectious disease: A Bayesian network-based strategy to drive model development
ترجمه فارسی عنوان
از داده های ژنوم مقیاس به مدل های بیماری عفونی: استراتژی مبتنی بر شبکه بیس برای رانندگی مدل توسعه
کلمات کلیدی
استنتاج شبکه بیسین، تجزیه و تحلیل داده ها در مقیاس بزرگ، توسعه مدل، بیماری های عفونی، مالاریا
موضوعات مرتبط
علوم زیستی و بیوفناوری علوم کشاورزی و بیولوژیک علوم کشاورزی و بیولوژیک (عمومی)
چکیده انگلیسی


• A generalized workflow for driving model development based on large-scale datasets.
• Network inference for omic-scale datasets with few observations.
• Robust Bayesian network learning from few samples using resampling and permutation.

High-throughput, genome-scale data present a unique opportunity to link host to pathogen on a molecular level. Forging such connections will help drive the development of mathematical models to better understand and predict both pathogen behavior and the epidemiology of infectious diseases, including malaria. However, the datasets that can aid in identifying these links and models are vast and not amenable to simple, reductionist, and univariate analyses. These datasets require data mining in order to identify the truly important measurements that best describe clinical and molecular observations. Moreover, these datasets typically have relatively few samples due to experimental limitations (particularly for human studies or in vivo animal experiments), making data mining extremely difficult. Here, after first providing a brief overview of common strategies for data reduction and identification of relationships between variables for inclusion in mathematical models, we present a new generalized strategy for performing these data reduction and relationship inference tasks. Our approach emphasizes the importance of robustness when using data to drive model development, particularly when using genome-scale, small-sample in vivo data. We identify the use of appropriate feature reduction combined with data permutations and subsampling strategies as being critical to enable increasingly robust results from network inference using high-dimensional, low-observation data.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Mathematical Biosciences - Volume 270, Part B, December 2015, Pages 156–168
نویسندگان
, , , , ,