کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
379008 659250 2010 14 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Modeling the evolution of associated data
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
پیش نمایش صفحه اول مقاله
Modeling the evolution of associated data
چکیده انگلیسی

Statistical topic models have been proposed for modeling documents and authorship information. However, few previous works have studied the evolution of associated data. In this paper, we investigate how to model trends of changes in document content and author interests simultaneously over time. We propose two models: a bag-of-words based Author–Time–Topic model that extends the state-of-the-art LDA-style topic model and a Hidden Markov Author–Time–Topic model, which can model interdependencies between topics. We use the Gibbs EM algorithm for parameter estimation. We apply these models to two data sets: NIPS papers and Yahoo group posts. Experimental results show that our models can achieve a lower perplexity (− 2.0%–20%) than the baseline LDA and Author–Topic model, when modeling quickly evolving associated data. Experiments also reveal that the proposed models can accurately capture the hot topics in different periods (e.g. “Yao at preseason” in Aug-2004, when the Chinese player Ming Yao became a highlight in the NBA) from the two data sets.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Data & Knowledge Engineering - Volume 69, Issue 9, September 2010, Pages 965–978
نویسندگان
, ,