دانلود رایگان مقاله: طراحی و ارزیابی یک الگوریتم موازی برای استنتاج سلسله مراتب موضوع

کد مقاله	کد نشریه	سال انتشار	مقاله انگلیسی	نسخه تمام متن
515357	866998	2015	15 صفحه PDF	دانلود رایگان

عنوان انگلیسی مقاله ISI

Design and evaluation of a parallel algorithm for inferring topic hierarchies

ترجمه فارسی عنوان

طراحی و ارزیابی یک الگوریتم موازی برای استنتاج سلسله مراتب موضوع

دانلود مقاله + سفارش ترجمه

دانلود مقاله ISI انگلیسی

رایگان برای ایرانیان

کلمات کلیدی

مدل سازی موضوع؛ خوشه بندی سلسله مراتبی؛ بازیابی اطلاعات؛ الگوریتم موازی؛ محاسبات خوشه ای؛ رابط عبور پیام

Parallel algorithm - الگوریتم‌های موازی Information retrieval - بازیابی اطلاعات Hierarchical clustering - خوشه بندی سلسله مراتبی Message passing interface - رابط عبور پیام Cluster computing - محاسبات خوشه ای Topic modeling - مدل سازی موضوع

موضوعات مرتبط

مهندسی و علوم پایه مهندسی کامپیوتر نرم افزارهای علوم کامپیوتر

پیش نمایش مقاله

طراحی و ارزیابی یک الگوریتم موازی برای استنتاج سلسله مراتب موضوع

چکیده انگلیسی

• We propose a novel parallel Algorithm for inferring topic hierarchies using HLDA.
• We use loosely-coupled parallel tasks that do not require frequent synchronization.
• The parallel Algorithm is well-suited to be run on distributed computing systems.
• The proposed Algorithm achieves a predictive accuracy on par with that of HLDA.
• The parallel Algorithm exhibits a near-linear speed-up and scales well.

The rapid growth of information in the digital world especially on the web, calls for automated methods of organizing the digital information for convenient access and efficient information retrieval. Topic modeling is a branch of machine learning and probabilistic graphical modeling that helps in arranging the web pages according to their topical structure. The topic distribution over a set of documents (web pages) and the affinity of a document toward a specific topic can be revealed using topic modeling. Topic modeling algorithms are typically computationally expensive due to their iterative nature. Recent research efforts have attempted to parallelize specific topic models and are successful in their attempts. These parallel algorithms however have tightly-coupled parallel processes which require frequent synchronization and are also tightly coupled with the underlying topic model which is used for inferring the topic hierarchy. In this paper, we propose a parallel algorithm to infer topic hierarchies from a large scale document corpus. A key feature of the proposed algorithm is that it exploits coarse grained parallelism and the components running in parallel need not synchronize after every iteration, thus the algorithm lends itself to be implemented on a geographically dispersed set of processing elements interconnected through a network. The parallel algorithm realizes a speed up of 53.5 on a 32-node cluster of dual-core workstations and at the same time achieving approximately the same likelihood or predictive accuracy as that of the sequential algorithm, with respect to the performance of Information Retrieval tasks.

ناشر

Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Processing & Management - Volume 51, Issue 5, September 2015, Pages 662–676

نویسندگان

Karthick Seshadri, S Mercy Shalinie, Chidambaram Kollengode,

علوم انسانی و هنر

فنی، مهندسی و علوم پایه

پزشکی و سلامت

بیو تکنولوژی

پذیرش سفارش ترجمه

دانلود رایگان مقاله ISI : طراحی و ارزیابی یک الگوریتم موازی برای استنتاج سلسله مراتب موضوع

دسترسی سریع

ارتباط

English Website