کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
8258410 1534605 2018 11 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Distinguishing three subtypes of hematopoietic cells based on gene expression profiles using a support vector machine
ترجمه فارسی عنوان
مشخص کردن سه زیر تیپ سلول های خونگرم بر اساس پروفایل های بیان ژن با استفاده از دستگاه بردار پشتیبانی
کلمات کلیدی
هماتوپیزیسم، سلول های بنیادی هماتوپوئیت، ماشین بردار پشتیبانی، حداقل بهینه سازی بهینه حداقل معادله حداکثر ارتباط،
موضوعات مرتبط
علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی سالمندی
چکیده انگلیسی
Hematopoiesis is a complicated process involving a series of biological sub-processes that lead to the formation of various blood components. A widely accepted model of early hematopoiesis proceeds from long-term hematopoietic stem cells (LT-HSCs) to multipotent progenitors (MPPs) and then to lineage-committed progenitors. However, the molecular mechanisms of early hematopoiesis have not been fully characterized. In this study, we applied a computational strategy to identify the gene expression signatures distinguishing three types of closely related hematopoietic cells collected in recent studies: (1) hematopoietic stem cell/multipotent progenitor cells; (2) LT-HSCs; and (3) hematopoietic progenitor cells. Each cell in these cell types was represented by its gene expression profile among a total number of 20,475 genes. The expression features were analyzed by a Monte-Carlo Feature Selection (MCFS) method, resulting in a feature list. Then, the incremental feature selection (IFS) and a support vector machine (SVM) optimized with a sequential minimum optimization (SMO) algorithm were employed to access the optimal classifier with the highest Matthews correlation coefficient (MCC) value of 0.889, in which 6698 features were used to represent cells. In addition, through an updated program of MCFS method, seventeen decision rules can be obtained, which can classify the three cell types with an overall accuracy of 0.812. Using a literature review, both the rules and the top features used for building the optimal classifier were confirmed to be commonly used or potential biological markers for distinguishing the three cell types of HSPCs. This article is part of a Special Issue entitled: Accelerating Precision Medicine through Genetic and Genomic Big Data Analysis edited by Yudong Cai & Tao Huang.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease - Volume 1864, Issue 6, Part B, June 2018, Pages 2255-2265
نویسندگان
, , , , ,