Article ID Journal Published Year Pages File Type
416507 Computational Statistics & Data Analysis 2012 8 Pages PDF
Abstract

This paper proposes a new method of estimating extreme quantiles of heavy-tailed distributions for massive data. The method utilizes the Peak Over Threshold (POT) method with generalized Pareto distribution (GPD) that is commonly used to estimate extreme quantiles and the parameter estimation of GPD using the empirical distribution function (EDF) and nonlinear least squares (NLS). We first estimate the parameters of GPD using EDF and NLS and then, estimate multiple high quantiles for massive data based on observations over a certain threshold value using the conventional POT. The simulation results demonstrate that our parameter estimation method has a smaller Mean square error (MSE) than other common methods when the shape parameter of GPD is at least 0. The estimated quantiles also show the best performance in terms of root MSE (RMSE) and absolute relative bias (ARB) for heavy-tailed distributions.

► We propose a new parameter estimation method for GPD using EDF and NLS. ► We estimate multiple high quantiles for massive data with this new method. ► In estimating quantiles, we use the conventional POT method. ► Our method gives the smallest MSE for several cases of GPD. ► It shows the best performance in high quantiles for heavy-tailed distributions.

Related Topics
Physical Sciences and Engineering Computer Science Computational Theory and Mathematics
Authors
, ,