TRANSFORMERS: Robust spatial joins on non-uniform data distributions
File(s)transformers_compressed.pdf (299.68 KB)
Accepted version
Author(s)
Pavlovic, Mirjana
Heinis, Thomas
Tauheed, Farhan
Karras, Panagiotis
Ailamaki, Anastasia
Type
Conference Paper
Abstract
Spatial joins are becoming increasingly ubiquitous in many applications, particularly in the scientific domain. While several approaches have been proposed for joining spatial datasets, each of them has a strength for a particular type of density ratio among the joined datasets. More generally, no single proposed method can efficiently join two spatial datasets in a robust manner with respect to their data distributions. Some approaches do well for datasets with contrasting densities while others do better with similar densities. None of them does well when the datasets have locally divergent data distributions. In this paper we develop TRANSFORMERS, an efficient and robust spatial join approach that is indifferent to such variations of distribution among the joined data. TRANSFORMERS achieves this feat by departing from the state-of-the-art through adapting the join strategy and data layout to local density variations among the joined data. It employs a join method based on data-oriented partitioning when joining areas of substantially different local densities, whereas it uses big partitions (as in space-oriented partitioning) when the densities are similar, while seamlessly switching among these two strategies at runtime. We experimentally demonstrate that TRANSFORMERS outperforms state-of-the-art approaches by a factor of between 2 and 8.
Date Issued
2016
Date Acceptance
2015-12-14
Citation
32nd IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May 16-20, 2016, 2016, pp.673-684
Publisher
IEEE
Start Page
673
End Page
684
Journal / Book Title
32nd IEEE International Conference on Data Engineering, ICDE 2016, Helsinki, Finland, May 16-20, 2016
Copyright Statement
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Source
32nd IEEE International Conference on Data Engineering (ICDE)
Publication Status
Published
Start Date
2016-05-16
Finish Date
2016-05-20
Coverage Spatial
Helsinki, Finland