کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6338832 1620369 2015 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Imputation of missing data in time series for air pollutants
ترجمه فارسی عنوان
محاسبه داده های گم شده در سری زمانی برای آلاینده های هوا
موضوعات مرتبط
مهندسی و علوم پایه علوم زمین و سیارات علم هواشناسی
چکیده انگلیسی


- We propose a method for imputation of missing values in times series.
- Simulations showed adequate goodness-of-fit.
- The findings also suggest good accuracy and precision.
- We implemented the method as an open source R library.

Missing data are major concerns in epidemiological studies of the health effects of environmental air pollutants. This article presents an imputation-based method that is suitable for multivariate time series data, which uses the EM algorithm under the assumption of normal distribution. Different approaches are considered for filtering the temporal component. A simulation study was performed to assess validity and performance of proposed method in comparison with some frequently used methods. Simulations showed that when the amount of missing data was as low as 5%, the complete data analysis yielded satisfactory results regardless of the generating mechanism of the missing data, whereas the validity began to degenerate when the proportion of missing values exceeded 10%. The proposed imputation method exhibited good accuracy and precision in different settings with respect to the patterns of missing observations. Most of the imputations obtained valid results, even under missing not at random. The methods proposed in this study are implemented as a package called mtsdi for the statistical software system R.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Atmospheric Environment - Volume 102, February 2015, Pages 96-104
نویسندگان
, ,