کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
6150678 1596429 2016 29 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Propensity score and proximity matching using random forest
ترجمه فارسی عنوان
نمره تساهل و تطابق نزدیکی با استفاده از جنگل تصادفی
موضوعات مرتبط
علوم پزشکی و سلامت پزشکی و دندانپزشکی پزشکی و دندانپزشکی (عمومی)
چکیده انگلیسی
In order to derive unbiased inference from observational data, matching methods are often applied to produce balanced treatment and control groups in terms of all background variables. Propensity score has been a key component in this research area. However, propensity score based matching methods in the literature have several limitations, such as model mis-specifications, categorical variables with more than two levels, difficulties in handling missing data, and nonlinear relationships. Random forest, averaging outcomes from many decision trees, is nonparametric in nature, straightforward to use, and capable of solving these issues. More importantly, the precision afforded by random forest (Caruana et al., 2008) may provide us with a more accurate and less model dependent estimate of the propensity score. In addition, the proximity matrix, a by-product of the random forest, may naturally serve as a distance measure between observations that can be used in matching. The proposed random forest based matching methods are applied to data from the National Health and Nutrition Examination Survey (NHANES). Our results show that the proposed methods can produce well balanced treatment and control groups. An illustration is also provided that the methods can effectively deal with missing data in covariates.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Contemporary Clinical Trials - Volume 47, March 2016, Pages 85-92
نویسندگان
, , , ,