کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1147451 1489776 2014 17 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Goodness-of-fit testing-based selection for large-p-small-n problems: A two-stage ranking approach
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات ریاضیات کاربردی
پیش نمایش صفحه اول مقاله
Goodness-of-fit testing-based selection for large-p-small-n problems: A two-stage ranking approach
چکیده انگلیسی


• Two-stage ranking–selection procedures are proposed.
• The procedures are based on goodness-of-fit testing.
• The first-stage ranking takes the correlations of covariates into account.
• A soft threshold cutoff is proposed to select a working model.
• One-step backward screening is proposed to estimate a true model.

In this paper, we investigate two-stage ranking–selection procedures for ultra-high dimensional data in the framework of goodness-of-fit testing. We develop a k-step marginal F-test (MFTk) screening in the first stage. The MFT1 is, as a statistic, equivalent to that used in the sure independence screening (SIS) proposed by Fan and Lv (2008). The MFTk with k≥2k≥2 makes improvement over the MFT1 mainly on better handling correlations among predictors. For selecting a more parsimonious working model in the first stage, we propose a soft threshold cutoff through a sequential goodness-of-fit testing. This avoids some drawbacks of the hard threshold cutoff in Fan and Lv (2008) and the extended BIC used in Wang (2009). In the second stage, we develop one-step backward screening to further remove those insignificant predictors from the model. Further, likewise as the iterative SIS, we provide the iterative versions of the proposed procedures to have more accurate variable selection. Extensive numerical studies and real data analysis are carried out to examine the performance of our proposed procedures.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Statistical Planning and Inference - Volume 145, February 2014, Pages 148–164
نویسندگان
, , ,