| Article ID | Journal | Published Year | Pages | File Type |
|---|---|---|---|---|
| 6869261 | Computational Statistics & Data Analysis | 2016 | 11 Pages |
Abstract
Cellwise outliers are likely to occur together with casewise outliers in modern datasets of relatively large dimension. Recent work has shown that traditional robust regression methods may fail when applied to such datasets. We propose a new robust regression procedure to deal with casewise and cellwise outliers. The proposed method, called three-step regression, proceeds as follows: first, it uses a consistent univariate filter, that is, a procedure that flags and eliminates extreme cellwise outliers; second, it applies a robust estimator of multivariate location and scatter to the filtered data to down-weight casewise outliers; third, it computes robust regression coefficients from the estimates obtained in the second step. The three-step estimator is consistent and asymptotically normal at the central model under some assumptions on the tails of the distributions of the continuous covariates. The estimator is extended to handle both continuous and dummy covariates using an iterative algorithm. Extensive simulation results show that the three-step estimator is resilient to cellwise outliers. It also performs well under casewise contamination when compared to traditional high breakdown point estimators.
Keywords
Related Topics
Physical Sciences and Engineering
Computer Science
Computational Theory and Mathematics
Authors
Andy Leung, Hongyang Zhang, Ruben Zamar,
