Sig-SR: SimRank Search over Singular Graphs
File(s)Sig-SR.pdf (161.63 KB)
Accepted version
Author(s)
Yu, W
McCann, J
Type
Conference Paper
Abstract
SimRank is an attractive structural-context measure of similarity
between two objects in a graph. It recursively follows
the intuition that “two objects are similar if they are referenced
by similar objects”. The best known matrix-based
method [1] for calculating SimRank, however, implies an
assumption that the graph is non-singular, i.e., its adjacency
matrix is invertible. In reality, non-singular graphs are very
rare; such an assumption in [1] is too restrictive in practice.
In this paper, we provide a treatment of [1], by supporting
similarity assessment on non-invertible adjacency matrices.
Assume that a singular graph G has n nodes, with r (< n)
being the rank of its adjacency matrix. (1) We show that
SimRank matrix S on G has an elegant structure: S can be
represented as a rank r matrix plus a scaled identity matrix.
(2) By virtue of this, an efficient algorithm over singular
graphs, Sig-SR, is proposed for calculating all-pairs SimRank
in O(r(n
2 + Kr2
)) time for K iterations. In contrast, the
only known matrix-based algorithm that supports singular
graphs [2] needs O(r
4n
2
) time. The experimental results on
real and synthetic datasets demonstrate the superiority of
Sig-SR on singular graphs against its baselines.
between two objects in a graph. It recursively follows
the intuition that “two objects are similar if they are referenced
by similar objects”. The best known matrix-based
method [1] for calculating SimRank, however, implies an
assumption that the graph is non-singular, i.e., its adjacency
matrix is invertible. In reality, non-singular graphs are very
rare; such an assumption in [1] is too restrictive in practice.
In this paper, we provide a treatment of [1], by supporting
similarity assessment on non-invertible adjacency matrices.
Assume that a singular graph G has n nodes, with r (< n)
being the rank of its adjacency matrix. (1) We show that
SimRank matrix S on G has an elegant structure: S can be
represented as a rank r matrix plus a scaled identity matrix.
(2) By virtue of this, an efficient algorithm over singular
graphs, Sig-SR, is proposed for calculating all-pairs SimRank
in O(r(n
2 + Kr2
)) time for K iterations. In contrast, the
only known matrix-based algorithm that supports singular
graphs [2] needs O(r
4n
2
) time. The experimental results on
real and synthetic datasets demonstrate the superiority of
Sig-SR on singular graphs against its baselines.
Date Issued
2014-07-06
Date Acceptance
2014-06-03
Citation
SIGIR '14 Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, 2014, pp.859-862
ISBN
978-1-4503-2257-7
Publisher
Association for Computing Machinery
Start Page
859
End Page
862
Journal / Book Title
SIGIR '14 Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval
Copyright Statement
© ACM, 2014. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in SIGIR '14 Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval, 11 Jul 2014. http://doi.acm.org/10.1145/2600428.2609459
Source
The 37th Annual ACM SIGIR Conference
Publication Status
Published
Start Date
2014-07-06
Finish Date
2014-07-11
Coverage Spatial
Gold Coast, Australia