OPAL: High performance platform for large-scale privacy-preserving location data analytics
File(s)OPAL_IEEE_Big_Data_2019_with_copyright.pdf (816.94 KB)
Accepted version
Author(s)
Oehmichen, A
Jain, S
Gadotti, A
Montjoye, YAD
Type
Conference Paper
Abstract
Mobile phones and other ubiquitous technologies are
generating vast amounts of high-resolution location data. This
data has been shown to have a great potential for the public good,
e.g. to monitor human migration during crises or to predict the
spread of epidemic diseases. Location data is, however, considered
one of the most sensitive types of data, and a large body of
research has shown the limits of traditional data anonymization
methods for big data. Privacy concerns have so far strongly
limited the use of location data collected by telcos, especially
in developing countries.
In this paper, we introduce OPAL (for OPen ALgorithms),
an open-source, scalable, and privacy-preserving platform for
location data. At its core, OPAL relies on an open algorithm
to extract key aggregated statistics from location data for a
wide range of potential use cases. We first discuss how we
designed the OPAL platform, building a modular and resilient
framework for efficient location analytics. We then describe the
layered mechanisms we have put in place to protect privacy
and discuss the example of a population density algorithm. We
finally evaluate the scalability and extensibility of the platform
and discuss related work.
generating vast amounts of high-resolution location data. This
data has been shown to have a great potential for the public good,
e.g. to monitor human migration during crises or to predict the
spread of epidemic diseases. Location data is, however, considered
one of the most sensitive types of data, and a large body of
research has shown the limits of traditional data anonymization
methods for big data. Privacy concerns have so far strongly
limited the use of location data collected by telcos, especially
in developing countries.
In this paper, we introduce OPAL (for OPen ALgorithms),
an open-source, scalable, and privacy-preserving platform for
location data. At its core, OPAL relies on an open algorithm
to extract key aggregated statistics from location data for a
wide range of potential use cases. We first discuss how we
designed the OPAL platform, building a modular and resilient
framework for efficient location analytics. We then describe the
layered mechanisms we have put in place to protect privacy
and discuss the example of a population density algorithm. We
finally evaluate the scalability and extensibility of the platform
and discuss related work.
Date Issued
2019-12-09
Date Acceptance
2019-10-17
Citation
2019, IEEE International Conference on Big Data, 2019
Publisher
IEEE
Journal / Book Title
2019, IEEE International Conference on Big Data
Copyright Statement
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Sponsor
Overseas Development Institute
Identifier
https://ieeexplore.ieee.org/document/9006389
Grant Number
F0239200
Source
IEEE International Conference on Big Data
Publication Status
Published
Start Date
2019-12-09
Finish Date
2019-12-12
Coverage Spatial
Los Angeles, CA, USA
Date Publish Online
2020-02-24