کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
382276 660754 2014 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Noise-free principal component analysis: An efficient dimension reduction technique for high dimensional molecular data
ترجمه فارسی عنوان
تجزیه و تحلیل مولفه اصلی بدون سر و صدا: یک روش کاهش اندازه کارایی برای داده های مولکولی با ابعاد بزرگ
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی


• Principal component analysis (PCA) is one of the powerful dimension reduction techniques widely used in data mining field.
• Data usually contaminated by the noise.
• Noise in the data has effect in computation of PC’s components.
• We use regularization method to filter the diffusion of the noise in PC’s.
• Experimental results shows the power of the new approach.

Principal component analysis (PCA) is one of the powerful dimension reduction techniques widely used in data mining field. PCA tries to project the data into lower dimensional space while preserving the intrinsic information hidden in the data as much as possible. Disadvantage of PCA is that, extracted principal components (PCs) are linear combination of all features, hence PCs are may still contaminated with noise in the data. To address this problem we propose a modified version of PCA called noise free PCA (NFPCA), in which regularization is introduced during the PCs extraction step to mitigate the effect of noise. Potentials of the proposed method is assessed in two important application of high-dimensional molecular data: classification and survival prediction. Multiple publicly available real-world data sets are used for this illustration. Experimental results show that, the NFPCA produce highly informative than the ordinary PCA method. This is largely due to the fact that the NFPCA suppress the effect of noise in the PCs more efficiently with minimum information lost. The NFPCA is a promising alternative to existing PCA approaches not only in terms of highly informative PCs, but also its relatively cheap computational cost.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Expert Systems with Applications - Volume 41, Issue 17, 1 December 2014, Pages 7797–7804
نویسندگان
, ,