کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
5121824 1486844 2017 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Predicting data saturation in qualitative surveys with mathematical models from ecological research
ترجمه فارسی عنوان
پیش بینی اشباع داده ها در نظرسنجی های کیفی با استفاده از مدل های ریاضی از تحقیقات بوم شناختی
کلمات کلیدی
اندازهی نمونه، تحقیق کیفی، اشباع داده، سوالات باز نظرسنجی ها و پرسشنامه ها، پرسشنامه های مبتنی بر وب،
موضوعات مرتبط
علوم پزشکی و سلامت پزشکی و دندانپزشکی سیاست های بهداشت و سلامت عمومی
چکیده انگلیسی

ObjectiveSample size in surveys with open-ended questions relies on the principle of data saturation. Determining the point of data saturation is complex because researchers have information on only what they have found. The decision to stop data collection is solely dictated by the judgment and experience of researchers. In this article, we present how mathematical modeling may be used to describe and extrapolate the accumulation of themes during a study to help researchers determine the point of data saturation.Study Design and SettingThe model considers a latent distribution of the probability of elicitation of all themes and infers the accumulation of themes as arising from a mixture of zero-truncated binomial distributions. We illustrate how the model could be used with data from a survey with open-ended questions on the burden of treatment involving 1,053 participants from 34 different countries and with various conditions. The performance of the model in predicting the number of themes to be found with the inclusion of new participants was investigated by Monte Carlo simulations. Then, we tested how the slope of the expected theme accumulation curve could be used as a stopping criterion for data collection in surveys with open-ended questions.ResultsBy doubling the sample size after the inclusion of initial samples of 25 to 200 participants, the model reliably predicted the number of themes to be found. Mean estimation error ranged from 3% to 1% with simulated data and was <2% with data from the study of the burden of treatment. Sequentially calculating the slope of the expected theme accumulation curve for every five new participants included was a feasible approach to balance the benefits of including these new participants in the study. In our simulations, a stopping criterion based on a value of 0.05 for this slope allowed for identifying 97.5% of the themes while limiting the inclusion of participants eliciting nothing new in the study.ConclusionMathematical models adapted from ecological research can accurately predict the point of data saturation in surveys with open-ended questions.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Clinical Epidemiology - Volume 82, February 2017, Pages 71-78.e2
نویسندگان
, , , ,