Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
802242 | Probabilistic Engineering Mechanics | 2012 | 12 Pages |
•The associated uncertainty of the parameters and residuals is taken into account.•The proposed method requires no information on the outlier distribution model.•The proposed approach provides the probability of outlier.•The size of the dataset is incorporated to determine the probability of outlier.
Outliers are one of the main concerns in statistics. Parametric identification results of ordinary least squares are sensitive to outliers. Many robust estimators have been proposed to overcome this problem but there are still some drawbacks in existing methods. In this paper, a novel probabilistic method is proposed for robust parametric identification and outlier detection in linear regression problems. The crux of this method is to calculate the probability of outlier, which quantifies how probable it is that a data point is an outlier. There are several appealing features of the proposed method. First, not only the optimal values of the parameters and residuals but also the associated uncertainties are taken into account for outlier detection. Second, the size of the dataset is incorporated because it is one of the key variables to determine the probability of obtaining a large-residual data point. Third, the proposed method requires no information on the outlier distribution model. Fourth, the proposed approach provides the probability of outlier. In the illustrative examples, the proposed method is compared with three well-known methods. It turns out that the proposed method is substantially superior and it is capable of robust parametric identification and outlier detection even for very challenging situations.