Article ID | Journal | Published Year | Pages | File Type |
---|---|---|---|---|
10918825 | Radiotherapy and Oncology | 2012 | 7 Pages |
Abstract
Despite considerable spread around the optimal number of selected variables, the bootstrapping method is efficient and accurate for sufficiently large data sets, and guards against overfitting for all simulated cases with the exception of some data sets with a particularly low number of events. An appropriate minimum data set size to obtain a model with high predictive power is approximately 200 patients and more than 32 events. With fewer data samples the true predictive power decreases rapidly, and for larger data set sizes the benefit levels off toward an asymptotic maximum predictive power.
Related Topics
Life Sciences
Biochemistry, Genetics and Molecular Biology
Cancer Research
Authors
Arjen van der Schaaf, Cheng-Jian Xu, Peter van Luijk, Aart A. van't Veld, Johannes A. Langendijk, Cornelis Schilstra,