sPARE: Partial Replication for Multi-tier Applications in the Cloud

Robert Birke; Juan F. Perez; Zhan Qiu; Mathias Borkqvist; Lydia Y. Chen

doi:10.1109/TSC.2017.2780845

sPARE: Partial Replication for Multi-tier Applications in the Cloud

Robert Birke, Juan F. Perez, Zhan Qiu, Mathias Borkqvist, Lydia Y. Chen

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x and 2.9x , respectively.

Original language	English (US)
Journal	IEEE Transactions on Services Computing
DOIs	https://doi.org/10.1109/TSC.2017.2780845 https://doi.org/10.1109/TSC.2017.2780845
State	Published - Dec 8 2017

All Science Journal Classification (ASJC) codes

Hardware and Architecture
Computer Science Applications
Computer Networks and Communications
Information Systems and Management

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

Cite this

@article{9acd399299be41cab9ef0b151ad7bd9e,

title = "sPARE: Partial Replication for Multi-tier Applications in the Cloud",

abstract = "Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x and 2.9x , respectively.",

author = "Robert Birke and Perez, {Juan F.} and Zhan Qiu and Mathias Borkqvist and Chen, {Lydia Y.}",

year = "2017",

month = dec,

day = "8",

doi = "10.1109/TSC.2017.2780845",

language = "English (US)",

journal = "IEEE Transactions on Services Computing",

issn = "1939-1374",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

TY - JOUR

T1 - sPARE

T2 - Partial Replication for Multi-tier Applications in the Cloud

AU - Birke, Robert

AU - Perez, Juan F.

AU - Qiu, Zhan

AU - Borkqvist, Mathias

AU - Chen, Lydia Y.

PY - 2017/12/8

Y1 - 2017/12/8

N2 - Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x and 2.9x , respectively.

AB - Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x and 2.9x , respectively.

UR - http://www.scopus.com/inward/record.url?scp=85038366923&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85038366923&partnerID=8YFLogxK

U2 - 10.1109/TSC.2017.2780845

DO - 10.1109/TSC.2017.2780845

M3 - Article

AN - SCOPUS:85038366923

SN - 1939-1374

JO - IEEE Transactions on Services Computing

JF - IEEE Transactions on Services Computing

ER -

sPARE: Partial Replication for Multi-tier Applications in the Cloud

Abstract

All Science Journal Classification (ASJC) codes

UN SDGs

Access to Document

Other files and links

Fingerprint

Cite this