5
IRUS Total
Downloads
  Altmetric

Efficient PartialPairs SimRank search on large graphs

File Description SizeFormat 
DTR14-11.pdfPublished version369.58 kBAdobe PDFView/Open
Title: Efficient PartialPairs SimRank search on large graphs
Authors: Yu, W
McCann, J
Item Type: Report
Abstract: The assessment of node-to-node similarities based on graph topology arises in a myriad of applications, e.g., web search. SimRank is a notable measure of this type, with the intuition that “two nodes are similar if their in-neighbors are similar”. While most existing work retrieving SimRank only considers all-pairs SimRank s(⋆, ⋆) and single-source SimRank s(⋆, j) (scores between every node and query j), there are appealing applications for partial-pairs SimRank, e.g., similarity join. Given two node subsets A and B in a graph, partial-pairs SimRank assessment aims to retrieve only {s(a, b)}∀a∈A,∀b∈B. However, the best-known solution [17] is not self-contained since it hinges on the premise that the SimRank scores with node-pairs in an h-go cover set must be given beforehand. This paper focuses on efficient assessment of partial-pairs SimRank in a self-contained manner. (1) We devise a novel “seed germination” model that computes partial-pairs Sim- Rank in O(k|E|min{|A|, |B|}) time and O(|E|+k|V |) memory for k iterations on a graph of |V | nodes and |E| edges. (2) We further eliminate unnecessary edge access to improve the time of partial-pairs SimRank to O(mmin{|A|, |B|}), where m ≤ min{k|E|, 2k}, and is the maximum degree. (3) We show that our partial-pairs SimRank model also can handle the computations of all-pairs and single-source Sim- Ranks, as well as partial-pairs SimRank* (a related notion of SimRank). (4) We empirically verify that our algorithms are (a) 38x faster than the best-known competitors, and (b) memory-efficient, allowing scores to be assessed accurately on graphs with tens of millions of links.
Issue Date: 1-Jan-2014
URI: http://hdl.handle.net/10044/1/95040
DOI: https://doi.org/10.25561/95040
Publisher: Department of Computing, Imperial College London
Start Page: 1
End Page: 14
Journal / Book Title: Departmental Technical Report: 14/11
Copyright Statement: © 2014 The Author(s). This report is available open access under a CC-BY-NC-ND (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Publication Status: Published
Article Number: 14/11
Appears in Collections:Computing
Computing Technical Reports



This item is licensed under a Creative Commons License Creative Commons