Article ID Journal Published Year Pages File Type
6869261 Computational Statistics & Data Analysis 2016 11 Pages PDF
Abstract
Cellwise outliers are likely to occur together with casewise outliers in modern datasets of relatively large dimension. Recent work has shown that traditional robust regression methods may fail when applied to such datasets. We propose a new robust regression procedure to deal with casewise and cellwise outliers. The proposed method, called three-step regression, proceeds as follows: first, it uses a consistent univariate filter, that is, a procedure that flags and eliminates extreme cellwise outliers; second, it applies a robust estimator of multivariate location and scatter to the filtered data to down-weight casewise outliers; third, it computes robust regression coefficients from the estimates obtained in the second step. The three-step estimator is consistent and asymptotically normal at the central model under some assumptions on the tails of the distributions of the continuous covariates. The estimator is extended to handle both continuous and dummy covariates using an iterative algorithm. Extensive simulation results show that the three-step estimator is resilient to cellwise outliers. It also performs well under casewise contamination when compared to traditional high breakdown point estimators.
Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics
Authors
, , ,