کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
1150285 | 957921 | 2006 | 16 صفحه PDF | دانلود رایگان |
This paper connects consistent variable selection with multiple hypotheses testing procedures in the linear regression model Y=Xβ+ɛY=Xβ+ɛ, where the dimension pp of the parameter ββ is allowed to grow with the sample size nn. We view the variable selection problem as one of estimating the index set I0⊆{1,…,p}I0⊆{1,…,p} of the non-zero components of β∈Rpβ∈Rp. Estimation of I0I0 can be further reformulated in terms of testing the hypotheses β1=0,…,βp=0β1=0,…,βp=0. We study here testing via the false discovery rate (FDR) and Bonferroni methods. We show that the set I^⊆{1,…,p} consisting of the indices of rejected hypotheses βi=0βi=0 is a consistent estimator of I0I0, under appropriate conditions on the design matrix XX and the control values used in either procedure. This technique can handle situations where pp is large at a very low computational cost, as no exhaustive search over the space of the 2p2p submodels is required.
Journal: Journal of Statistical Planning and Inference - Volume 136, Issue 12, 1 December 2006, Pages 4349–4364