1
IRUS Total
Downloads
  Altmetric

Let it recover: multiparty protocol-induced recovery

File Description SizeFormat 
DTRS16-10.pdfPublished version520.6 kBAdobe PDFView/Open
Title: Let it recover: multiparty protocol-induced recovery
Authors: Neykova, R
Yoshida, N
Item Type: Report
Abstract: Fault-tolerant communication systems rely on recovery strategies which are often error-prone (e.g. a programmer manually specifies recovery strategies) or inefficient (e.g. the whole system is restarted from the beginning). This paper proposes a static analysis based on multiparty session types that can efficiently compute a safe global state from which a system of interacting processes should be recovered. We statically analyse the communication flow of a program, given as a multiparty protocol, to extract the causal dependencies between processes and to localise failures. We formalise our recovery algorithm and prove its safety. A recovered communication system is free from deadlocks, orphan messages and reception errors. Our recovery algorithm incurs less communication cost (only affected processes are notified) and overall execution time (only required states are repeated). On top of our analysis, we design and implement a runtime framework in Erlang where failed processes and their dependencies are soundly restarted from a computed safe state. We evaluate our recovery framework on messagepassing benchmarks and a use case for crawling webpages. The experimental results indicate our framework outperforms a built-in static recovery strategy in Erlang when a part of the protocol can be safely recovered.
Issue Date: 1-Jan-2016
URI: http://hdl.handle.net/10044/1/94971
DOI: 10.25561/94971
Publisher: Department of Computing, Imperial College London
Start Page: 1
End Page: 14
Journal / Book Title: Departmental Technical Report: 16/10
Copyright Statement: © 2016 The Author(s). This report is available open access under a CC-BY-NC-ND (https://creativecommons.org/licenses/by-nc-nd/4.0/)
Publication Status: Published
Article Number: 16/10
Appears in Collections:Computing
Computing Technical Reports
Faculty of Engineering



This item is licensed under a Creative Commons License Creative Commons