دانلود رایگان مقاله: انتخاب متغیر در مجموعه داده های ناهمگن: یک مدل ترکیبی خطی ضعیف با استفاده از برنامه های کاربردی برای مطالعات ارتباطی ژنوم

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
8339986	1541186	2018	19 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Variable selection in heterogeneous datasets: A truncated-rank sparse linear mixed model with applications to genome-wide association studies

ترجمه فارسی عنوان

انتخاب متغیر در مجموعه داده های ناهمگن: یک مدل ترکیبی خطی ضعیف با استفاده از برنامه های کاربردی برای مطالعات ارتباطی ژنوم

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

انتخاب متغیر، بررسی ارتباط ژنوم، مدل مخلوط، ناهمگونی، تصحیح متضاد،

Variable selection - انتخاب متغیر Mixed model - مدل مخلوط Genome-wide association study - مطالعه مرتبط با ژنوم Heterogeneity - ناهمگونی

موضوعات مرتبط

علوم زیستی و بیوفناوری بیوشیمی، ژنتیک و زیست شناسی مولکولی زیست شیمی

پیش نمایش مقاله

انتخاب متغیر در مجموعه داده های ناهمگن: یک مدل ترکیبی خطی ضعیف با استفاده از برنامه های کاربردی برای مطالعات ارتباطی ژنوم

چکیده انگلیسی

A fundamental and important challenge in modern datasets of ever increasing dimensionality is variable selection, which has taken on renewed interest recently due to the growth of biological and medical datasets with complex, non-i.i.d. structures. Naïvely applying classical variable selection methods such as the Lasso to such datasets may lead to a large number of false discoveries. Motivated by genome-wide association studies in genetics, we study the problem of variable selection for datasets arising from multiple subpopulations, when this underlying population structure is unknown to the researcher. We propose a unified framework for sparse variable selection that adaptively corrects for population structure via a low-rank linear mixed model. Most importantly, the proposed method does not require prior knowledge of sample structure in the data and adaptively selects a covariance structure of the correct complexity. Through extensive experiments, we illustrate the effectiveness of this framework over existing methods. Further, we test our method on three different genomic datasets from plants, mice, and human, and discuss the knowledge we discover with our method.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Methods - Volume 145, 1 August 2018, Pages 2-9

نویسندگان

Haohan Wang, Bryon Aragam, Eric P. Xing,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : انتخاب متغیر در مجموعه داده های ناهمگن: یک مدل ترکیبی خطی ضعیف با استفاده از برنامه های کاربردی برای مطالعات ارتباطی ژنوم

دسترسی سریع

ارتباط

English Website