کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
536585 870563 2010 9 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
A document clustering algorithm for discovering and describing topics
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
A document clustering algorithm for discovering and describing topics
چکیده انگلیسی

In this paper, we introduce a new clustering algorithm for discovering and describing the topics comprised in a text collection. Our proposal relies on both the most probable term pairs generated from the collection and the estimation of the topic homogeneity associated to these pairs. Topics and their descriptions are generated from those term pairs whose support sets are homogeneous enough for representing collection topics. Experimental results obtained over three benchmark text collections demonstrate the effectiveness and utility of this new approach.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 31, Issue 6, 15 April 2010, Pages 502–510
نویسندگان
, , ,