کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
1148080 | 1489757 | 2015 | 15 صفحه PDF | دانلود رایگان |
• Joint factor and cluster analysis of questionnaires with multiple categorical responses.
• Joint inference on the number of factors and clustering of subjects.
• Clustering borrows strength across subjects, improving estimation of the model parameters.
• We employ Markov chain Monte Carlo techniques, including sampling of missing data.
• Application to educational datasets and uncover hidden relationships between questions and educational concepts.
We develop a modeling framework for joint factor and cluster analysis of datasets where multiple categorical response items are collected on a heterogeneous population of individuals. We introduce a latent factor multinomial probit model and employ prior constructions that allow inference on the number of factors as well as clustering of the subjects into homogeneous groups according to their relevant factors. Clustering, in particular, allows us to borrow strength across subjects, therefore helping in the estimation of the model parameters, particularly when the number of observations is small. We employ Markov chain Monte Carlo techniques and obtain tractable posterior inference for our objectives, including sampling of missing data. We demonstrate the effectiveness of our method on simulated data. We also analyze two real-world educational datasets and show that our method outperforms state-of-the-art methods. In the analysis of the real-world data, we uncover hidden relationships between the questions and the underlying educational concepts, while simultaneously partitioning the students into groups of similar educational mastery.
Journal: Journal of Statistical Planning and Inference - Volume 166, November 2015, Pages 52–66