کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
862695 1470796 2012 7 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Performance Evaluation of Semantic Based and Ontology Based Text Document Clustering Techniques
موضوعات مرتبط
مهندسی و علوم پایه سایر رشته های مهندسی مهندسی (عمومی)
پیش نمایش صفحه اول مقاله
Performance Evaluation of Semantic Based and Ontology Based Text Document Clustering Techniques
چکیده انگلیسی

The amount of digital information is created and used is steadily growing along with the development of sophisticated hardware and software. This has increased the need for powerful algorithms that can interpret and extract interesting knowledge from these data. Data mining is a technique that has been successfully exploited for this purpose. Text mining, a category of data mining, considers only digital documents or text. Text Clustering is the process of grouping text or documents such that the document in the same cluster are similar and are dissimilar from the one in other clusters. This paper studies the working of two sophisticated algorithms. The first work is a hybrid method that combines pattern recognition process with semantic driven methods for clustering documents, while the second uses an ontology-based approach to cluster documents. Through experiments, the performance of both the selected algorithms is analyzed in terms of clustering efficiency and speed of clustering.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Procedia Engineering - Volume 30, 2012, Pages 100-106