کد مقاله کد نشریه سال انتشار مقاله انگلیسی نسخه تمام متن
973973 1480110 2016 10 صفحه PDF دانلود رایگان
عنوان انگلیسی مقاله ISI
Flexible sampling large-scale social networks by self-adjustable random walk
ترجمه فارسی عنوان
نمونه برداری انعطاف پذیر شبکه های اجتماعی در مقیاس وسیع با راه رفتن تصادفی قابل تنظیم
کلمات کلیدی
موضوعات مرتبط
مهندسی و علوم پایه ریاضیات فیزیک ریاضی
چکیده انگلیسی


• A sampling method called Self-Adjustable Random Walk (SARW) is proposed.
• The advantage of SARW sampling in comparison with four prevailing methods.
• Mixing both induced-edge and external-edge information for calculating network measures.
• SARW sampling can generate unbiased samples with maximal precision and minimal cost.

Online social networks (OSNs) have become an increasingly attractive gold mine for academic and commercial researchers. However, research on OSNs faces a number of difficult challenges. One bottleneck lies in the massive quantity and often unavailability of OSN population data. Sampling perhaps becomes the only feasible solution to the problems. How to draw samples that can represent the underlying OSNs has remained a formidable task because of a number of conceptual and methodological reasons. Especially, most of the empirically-driven studies on network sampling are confined to simulated data or sub-graph data, which are fundamentally different from real and complete-graph OSNs. In the current study, we propose a flexible sampling method, called Self-Adjustable Random Walk (SARW), and test it against with the population data of a real large-scale OSN. We evaluate the strengths of the sampling method in comparison with four prevailing methods, including uniform, breadth-first search (BFS), random walk (RW), and revised RW (i.e., MHRW) sampling. We try to mix both induced-edge and external-edge information of sampled nodes together in the same sampling process. Our results show that the SARW sampling method has been able to generate unbiased samples of OSNs with maximal precision and minimal cost. The study is helpful for the practice of OSN research by providing a highly needed sampling tools, for the methodological development of large-scale network sampling by comparative evaluations of existing sampling methods, and for the theoretical understanding of human networks by highlighting discrepancies and contradictions between existing knowledge/assumptions of large-scale real OSN data.

ناشر
Database: Elsevier - ScienceDirect (ساینس دایرکت)
Journal: Physica A: Statistical Mechanics and its Applications - Volume 463, 1 December 2016, Pages 356–365
نویسندگان
, ,