کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
418050 681605 2008 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر نظریه محاسباتی و ریاضیات
پیش نمایش صفحه اول مقاله
On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing
چکیده انگلیسی

Non-negative Matrix Factorization (NMF) and Probabilistic Latent Semantic Indexing (PLSI) have been successfully applied to document clustering recently. In this paper, we show that PLSI and NMF (with the I-divergence objective function) optimize the same objective function, although PLSI and NMF are different algorithms as verified by experiments. This provides a theoretical basis for a new hybrid method that runs PLSI and NMF alternatively, each jumping out of the local minima of the other method successively, thus achieving a better final solution. Extensive experiments on five real-life datasets show relations between NMF and PLSI, and indicate that the hybrid method leads to significant improvements over NMF-only or PLSI-only methods. We also show that at first-order approximation, NMF is identical to the χ2χ2-statistic.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Computational Statistics & Data Analysis - Volume 52, Issue 8, 15 April 2008, Pages 3913–3927
نویسندگان
, , ,