کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
1082873 950971 2011 8 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Performance of logistic regression modeling: beyond the number of events per variable, the role of data structure
موضوعات مرتبط
علوم پزشکی و سلامت پزشکی و دندانپزشکی سیاست های بهداشت و سلامت عمومی
پیش نمایش صفحه اول مقاله
Performance of logistic regression modeling: beyond the number of events per variable, the role of data structure
چکیده انگلیسی

ObjectiveLogistic regression is commonly used in health research, and it is important to be sure that the parameter estimates can be trusted. A common problem occurs when the outcome has few events; in such a case, parameter estimates may be biased or unreliable. This study examined the relation between correctness of estimation and several data characteristics: number of events per variable (EPV), number of predictors, percentage of predictors that are highly correlated, percentage of predictors that were non-null, size of regression coefficients, and size of correlations.Study DesignSimulation studies.ResultsIn many situations, logistic regression modeling may pose substantial problems even if the number of EPV exceeds 10. Moreover, the number of EPV is not the only element that impacts on the correctness of parameter estimation. High regression coefficients and high correlations between the predictors may cause large problems in the estimation process. Finally, power is generally very low, even at 20 EPV.ConclusionThere is no single rule based on EPV that would guarantee an accurate estimation of logistic regression parameters. Instead, the number of predictors, probable size of the regression coefficients based on previous literature, and correlations among the predictors must be taken into account as guidelines to determine the necessary sample size.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Clinical Epidemiology - Volume 64, Issue 9, September 2011, Pages 993–1000
نویسندگان
, , , , ,