On the Efficiency of Estimating Penetrating Rank on Large Graphs
File(s)P-Rank.pdf (430.3 KB)
Accepted version
Author(s)
Yu, W
Le, J
Lin, X
Zhang, W
Type
Conference Paper
Abstract
P-Rank (Penetrating Rank) has been suggested as a useful measure of structural similarity that takes account of both incoming and outgoing edges in ubiquitous networks. Existing work often utilizes memoization to compute P-Rank similarity in an iterative fashion, which requires cubic time in the worst case. Besides, previous methods mainly focus on the deterministic computation of P-Rank, but lack the probabilistic framework that scales well for large graphs. In this paper, we propose two efficient algorithms for computing P-Rank on large graphs. The first observation is that a large body of objects in a real graph usually share similar neighborhood structures. By merging such objects with an explicit low-rank factorization, we devise a deterministic algorithm to compute P-Rank in quadratic time. The second observation is that by converting the iterative form of P-Rank into a matrix power series form, we can leverage the random sampling approach to probabilistically compute P-Rank in linear time with provable accuracy guarantees. The empirical results on both real and synthetic datasets show that our approaches achieve high time efficiency with controlled error and outperform the baseline algorithms by at least one order of magnitude.
Date Issued
2012-06-25
Date Acceptance
2012-06-25
Citation
Lecture Notes in Computer Science: Proceedings of the 24th International Conference, SSDBM 2012, Chania, Crete, Greece, June 25-27, 2012., 2012, pp.231-249
ISBN
978-3-642-31234-2
ISSN
0302-9743
Publisher
Springer
Start Page
231
End Page
249
Journal / Book Title
Lecture Notes in Computer Science: Proceedings of the 24th International Conference, SSDBM 2012, Chania, Crete, Greece, June 25-27, 2012.
Copyright Statement
© 2012, Springer-Verlag Berlin Heidelberg. The final publication is available at Springer via https://dx.doi.org/10.1007/978-3-642-31235-9_15
Source
24th International Conference Scientific and Statistical Database Management
Notes
bibsource: DBLP, http://dblp.uni-trier.de ee: http://dx.doi.org/10.1007/978-3-642-31235-9_15 owner: ywr0708 timestamp: 2013.11.24
Publication Status
Published
Start Date
2012-06-25
Finish Date
2012-06-27
Coverage Spatial
Chania, Greece