Exact and Monte Carlo calculations of integrated likelihoods for the latent class model

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
1149083	957862	2010	12 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

EM algorithm - الگوریتم EM Bayesian model selection - انتخاب مدل بیزی Categorical data - داده های طبقه بندی شده Gibbs sampler - نمونه گیبس Importance sampling - نمونه گیری نقاط مهم

موضوعات مرتبط

مهندسی و علوم پایه ریاضیات ریاضیات کاربردی

پیش نمایش صفحه اول مقاله

Exact and Monte Carlo calculations of integrated likelihoods for the latent class model

چکیده انگلیسی

The latent class model or multivariate multinomial mixture is a powerful approach for clustering categorical data. It uses a conditional independence assumption given the latent class to which a statistical unit is belonging. In this paper, we exploit the fact that a fully Bayesian analysis with Jeffreys non-informative prior distributions does not involve technical difficulty to propose an exact expression of the integrated complete-data likelihood, which is known as being a meaningful model selection criterion in a clustering perspective. Similarly, a Monte Carlo approximation of the integrated observed-data likelihood can be obtained in two steps: an exact integration over the parameters is followed by an approximation of the sum over all possible partitions through an importance sampling strategy. Then, the exact and the approximate criteria experimentally compete, respectively, with their standard asymptotic BIC approximations for choosing the number of mixture components. Numerical experiments on simulated data and a biological example highlight that asymptotic criteria are usually dramatically more conservative than the non-asymptotic presented criteria, not only for moderate sample sizes as expected but also for quite large sample sizes. This research highlights that asymptotic standard criteria could often fail to select some interesting structures present in the data.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Journal of Statistical Planning and Inference - Volume 140, Issue 11, November 2010, Pages 2991–3002

نویسندگان

C. Biernacki, G. Celeux, G. Govaert,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

Exact and Monte Carlo calculations of integrated likelihoods for the latent class model

دسترسی سریع

ارتباط

English Website