کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
415450 681208 2008 18 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Outlier identification in high dimensions
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
Outlier identification in high dimensions
چکیده انگلیسی

A computationally fast procedure for identifying outliers is presented that is particularly effective in high dimensions. This algorithm utilizes simple properties of principal components to identify outliers in the transformed space, leading to significant computational advantages for high-dimensional data. This approach requires considerably less computational time than existing methods for outlier detection, and is suitable for use on very large data sets. It is also capable of analyzing the data situation commonly found in certain biological applications in which the number of dimensions is several orders of magnitude larger than the number of observations. The performance of this method is illustrated on real and simulated data with dimension ranging in the thousands.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational Statistics & Data Analysis - Volume 52, Issue 3, 1 January 2008, Pages 1694–1711
نویسندگان
, , ,