Classification for high-throughput data with an optimal subset of principal components

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
15248	1396	2009	6 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

canonical variate analysis - تجزیه و تحلیل عددی کانونی Principal component analysis - تحلیل مولفه‌های اصلی یا PCA High-throughput data - داده های با توان بالا Classification - طبقه بندی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی شیمی بیو مهندسی (مهندسی زیستی)

پیش نمایش صفحه اول مقاله

Classification for high-throughput data with an optimal subset of principal components

چکیده انگلیسی

High-throughput data have been widely used in biological and medical studies to discover gene and protein functions. Due to the high dimensionality, principal component analysis (PCA) is often involved for data dimension reduction. However, when a few principal components (PCs) are selected for dimension reduction or considered for dimension determination, they are typically ranked by their variances, eigenvalues. However, this approach is not always effective in subsequent multivariate analysis, particularly classification. To maximize information from data with a subset of the components, we apply a different ranking criterion, canonical variate criterion, which considers within- and between-group variance rather than total variance in the classical criterion. Four prevalent classification methods are considered and compared using leave-one-out cross-validation. These methods are illustrated with three real high-throughput data sets, two microarray data sets and a nuclear magnetic resonance spectra data set.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational Biology and Chemistry - Volume 33, Issue 5, October 2009, Pages 408–413

نویسندگان

Joon Jin Song, Yuan Ren, Fenglan Yan,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Classification for high-throughput data with an optimal subset of principal components

دسترسی سریع

ارتباط

English Website