کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
534895 870302 2009 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An online document clustering technique for short web contents
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر چشم انداز کامپیوتر و تشخیص الگو
پیش نمایش صفحه اول مقاله
An online document clustering technique for short web contents
چکیده انگلیسی

Document clustering techniques have been applied in several areas, with the web as one of the most recent and influential. Both general-purpose and text-oriented techniques exist and can be used to cluster a collection of documents in many ways. This work proposes a novel heuristic online document clustering model that can be specialized with a variety of text-oriented similarity measures. An experimental evaluation of the proposed model was conducted in the e-commerce domain. Performances were measured using a clustering-oriented metric based on F-Measure and compared with those obtained by other well-known approaches. The obtained results confirm the validity of the proposed method both for batch scenarios and online scenarios where document collections can grow over time.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Pattern Recognition Letters - Volume 30, Issue 10, 15 July 2009, Pages 870–876
نویسندگان
, , ,