Cutting latency tail: analyzing and validating replication without canceling

File Description SizeFormat 
main.pdfAccepted version790.83 kBAdobe PDFView/Open
Title: Cutting latency tail: analyzing and validating replication without canceling
Authors: Qiu, Z
Perez, JF
Birke, R
Chen, L
Harrison, PG
Item Type: Journal Article
Abstract: Response time variability in software applications can severely degrade the quality of the user experience. To reduce this variability, request replication emerges as an effective solution by spawning multiple copies of each request and using the result of the first one to complete. Most previous studies have mainly focused on the mean latency for systems implementing replica cancellation, i.e., all replicas of a request are canceled once the first one finishes. Instead, we develop models to obtain the response-time distribution for systems where replica cancellation may be too expensive or infeasible to implement, as in “fast” systems, such as web services, or in legacy systems. Furthermore, we introduce a novel service model to explicitly consider correlation in the processing times of the request replicas, and design an efficient algorithm to parameterize the model from real data. Extensive evaluations on a MATLAB benchmark and a three-tier web application (MediaWiki) show remarkable accuracy, e.g., 7% (4%) average error on the 99th percentile response time for the benchmark (respectively, MediaWiki), the requests of which execute in the order of seconds (respectively, milliseconds). Insights into optimal replication levels are thereby gained from this precise quantitative analysis, under a wide variety of system scenarios.
Issue Date: 19-May-2017
Date of Acceptance: 21-Apr-2017
ISSN: 1558-2183
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Journal / Book Title: IEEE Transactions on Parallel and Distributed Systems
Copyright Statement: © 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See for more information
Sponsor/Funder: Engineering & Physical Science Research Council (EPSRC)
Funder's Grant Number: EP/L00738X/1
Keywords: 0805 Distributed Computing
0803 Computer Software
Publication Status: Published
Appears in Collections:Faculty of Engineering

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Creative Commonsx