کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
974836 1480135 2015 15 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
An online expectation maximization algorithm for exploring general structure in massive networks
ترجمه فارسی عنوان
الگوریتم به حداکثر رساندن انتظارات آنلاین برای بررسی ساختار کلی در شبکه های عظیم
کلمات کلیدی
شبکه های پیچیده اکتشاف ساختار عمومی، الگوریتم به حداکثر رساندن انتظارات آنلاین
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات فیزیک ریاضی
چکیده انگلیسی


• An efficiency online variational EM algorithm (noted as onlineVEM) was designed.
• The objective function of the onlineVEM was proved to be additive.
• Model parameters were estimated by differentiating the objective function.
• The onlineVEM was tested to be robust, accurate and efficient.

Mixture model and stochastic block model (SBM) for structure discovery employ a broad and flexible definition of vertex classes such that they are able to explore a wide variety of structure. Compared to the existing algorithms based on the SBM (their time complexities are O(mc2)O(mc2), where mm and cc are the number of edges and clusters), the algorithms of mixture model are capable of dealing with networks with a large number of communities more efficiently due to their O(mc)O(mc) time complexity. However, the algorithms of mixture model using expectation maximization (EM) technique are still too slow to deal with real million-node networks, since they compute hidden variables on the entire network in each iteration. In this paper, an online variational EM algorithm is designed to improve the efficiency of the EM algorithms. In each iteration, our online algorithm samples a node and estimates its cluster memberships only by its adjacency links, and model parameters are then estimated by the memberships of the sampled node and old model parameters obtained in the previous iteration. The provided online algorithm updates model parameters subsequently by the links of a new sampled node and explores the general structure of massive and growing networks with millions of nodes and hundreds of clusters in hours. Compared to the relevant algorithms on synthetic and real networks, the proposed online algorithm costs less with little or no degradation of accuracy. Results illustrate that the presented algorithm offers a good trade-off between precision and efficiency.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Physica A: Statistical Mechanics and its Applications - Volume 438, 15 November 2015, Pages 454–468
نویسندگان
, , ,