Article ID Journal Published Year Pages File Type
4944262 Information Sciences 2017 44 Pages PDF
Abstract
SimRank is a well-known link-based similarity measure successfully applied in many graph-related applications. Despite of the current success of SimRank, it suffers from the problem caused by its pairwise normalization paradigm in similarity computation. In this paper, we propose JacSim (Jaccard-based SimRank) that solves the pairwise normalization problem in an effective way. JacSim computes the similarity score of a node-pair by combining two different computation manners: Jaccard coefficient and pairwise normalization. We point out two problems of existing measures targeted at solving the pairwise normalization problem and provide effective solutions to them: (1) JacSim eliminates the redundancy hidden in their similarity computation; (2) JacSim enables to control the degree of importance of the two scores obtained by employing Jaccard coefficient and pairwise normalization. In order to take advantage of links weights in similarity computation, we propose a weighted version of JacSim applicable to weighted graphs. Furthermore, to accelerate JacSim, we provide a linear recursive matrix form of JacSim, which is composed of only linear operations. We demonstrate the effectiveness and efficiency of our JacSim by conducting extensive experiments with real-world datasets. The results show that JacSim outperforms existing measures significantly in term of accuracy and also provides better performance than the similarity measures targeted to solve the pairwise normalization problem.
Related Topics
Physical Sciences and Engineering Computer Science Artificial Intelligence
Authors
, ,