دانلود رایگان مقاله: سریع و موثر بازیابی اطلاعات مبتنی بر خوشه با استفاده از بسته های مکرر بسته

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
6856406	1437956	2018	32 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Fast and effective cluster-based information retrieval using frequent closed itemsets

ترجمه فارسی عنوان

سریع و موثر بازیابی اطلاعات مبتنی بر خوشه با استفاده از بسته های مکرر بسته

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

بازیابی اطلاعات سند، داده کاوی، مجموعه های بزرگ، رویکردهای مبتنی بر خوشه ای، استخراج مزارع مکرر،

Frequent itemset mining - استخراج مزارع مکرر Data mining - داده‌کاوی

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی

پیش نمایش مقاله

سریع و موثر بازیابی اطلاعات مبتنی بر خوشه با استفاده از بسته های مکرر بسته

چکیده انگلیسی

Document Information retrieval consists of finding the documents in a collection of documents that are the most relevant to a user query. Information retrieval techniques are widely-used by organizations to facilitate the search for information. However, applying traditional information retrieval techniques is time consuming for large document collections. Recently, cluster-based information retrieval approaches have been developed. Although these approaches are often much faster than traditional approaches for processing large document collections, the quality of the documents retrieved by cluster-based approaches is often less than that of traditional approaches. To address this drawback of cluster-based approaches, and improve the performance of information retrieval both in terms of runtime and quality of retrieved documents, this paper proposes a new cluster-based information retrieval approach named ICIR (Intelligent Cluster-based Information Retrieval). The proposed approach combines k-means clustering with frequent closed itemset mining to extract clusters of documents and find frequent terms in each cluster. Patterns discovered in each cluster are then used to select the most relevant document clusters to answer each user query. Four alternative heuristics are proposed to select the most relevant clusters, and two alternative heuristics for choosing documents in the selected clusters. Thus, eight versions of the proposed approach are obtained. To validate the proposed approach, extensive experiments have been carried out on well-known document collections. Results show that the designed approach outperforms traditional and cluster-based information retrieval approaches both in terms of execution time and quality of the returned documents.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 453, July 2018, Pages 154-167

نویسندگان

Youcef Djenouri, Asma Belhadi, Philippe Fournier-Viger, Jerry Chun-Wei Lin,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : سریع و موثر بازیابی اطلاعات مبتنی بر خوشه با استفاده از بسته های مکرر بسته

دسترسی سریع

ارتباط

English Website