PAX: Partition-Aware Autoscaling for the Cassandra NoSQL Database
File(s)177331.pdf (812.61 KB)
Accepted version
Author(s)
Dipietro, S
Buyya, R
Casale, G
Type
Conference Paper
Abstract
Apache Cassandra has emerged as one of the most widely adopted NoSQL databases. However, there is still a limited understanding on how to optimally operate Cassandra in the cloud using autoscaling methods, by which resources can be scaled up or down to reduce operational costs and meet service-level objectives (SLOs). To address this limitation, we present PAX, a partition-aware elastic resource management system for Apache Cassandra. PAX uses low-overhead query sampling and knowledge of the data-partitioning across the nodes to automatically adapt capacity in Cassandra clusters. Differently from existing autoscaling methods for Cassandra, which incur large acquisition times for new nodes, PAX exploits Cassandra's hinted handoff mechanism and a shared hints storage to minimize the time needed to acquire a node into the cluster. We propose a reactive and a proactive implementation of PAX and compare their performance against different workloads with varying intensities and item popularity distributions, finding that the proactive version significantly reduces SLO violations.
Date Issued
2018-07-09
Date Acceptance
2017-12-04
Citation
NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium, 2018
ISBN
9781538634172
ISSN
2374-9709
Publisher
IEEE
Journal / Book Title
NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium
Copyright Statement
© 2018 IEEE.
Sponsor
Commission of the European Communities
Grant Number
644869
Source
2018 IEEE/IFIP Network Operations and Management Symposium
Publication Status
Published
Start Date
2018-04-23
Finish Date
2018-04-27
Coverage Spatial
Taipei, Taiwan
Date Publish Online
2018-07-09