کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
4944565 1438001 2017 37 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Towards big topic modeling
ترجمه فارسی عنوان
به سوی مدل سازی بزرگ موضوع
کلمات کلیدی
مدل سازی موضوع بزرگ، تخصیص صندوق قرض الحسنه، پیچیدگی ارتباطی، معماری چند پردازنده، پخش آنلاین اعتقاد، قانون قدرت،
موضوعات مرتبط
مهندسی و علوم پایه مهندسی کامپیوتر هوش مصنوعی
چکیده انگلیسی
To solve the big topic modeling problem, we need to reduce both the time and space complexities of batch latent Dirichlet allocation (LDA) algorithms. Although parallel LDA algorithms on multi-processor architectures have low time and space complexities, their communication costs among processors often scale linearly with the vocabulary size and the number of topics, leading to a serious scalability problem. To reduce the communication complexity among processors to achieve improved scalability, we propose a novel communication-efficient parallel topic modeling architecture based on a power law, which consumes orders of magnitude less communication time when the number of topics is large. We combine the proposed communication-efficient parallel architecture with the online belief propagation (OBP) algorithm, referred to as POBP, for big topic modeling tasks. Extensive empirical results confirm that POBP has the following advantages for solving the big topic modeling problem when compared with recent state-of-the-art parallel LDA algorithms on multi-processor architectures: (1) high accuracy, (2) high communication efficiency, (3) high speed, and (4) constant memory usage.
ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Information Sciences - Volume 390, June 2017, Pages 15-31
نویسندگان
, , , , ,