sPARE: Partial Replication for Multi-tier Applications in the Cloud

Robert Birke, Juan F. Perez, Zhan Qiu, Mathias Borkqvist, Lydia Y. Chen

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x  and  2.9x  , respectively.

Fingerprint

Replication
World Wide Web
Latency
Arbitration
Factors
Interference
Web application
Amazon
Art
Workload
Virtual machine

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture
  • Computer Science Applications
  • Computer Networks and Communications
  • Information Systems and Management

Cite this

@article{9acd399299be41cab9ef0b151ad7bd9e,
title = "sPARE: Partial Replication for Multi-tier Applications in the Cloud",
abstract = "Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x  and  2.9x  , respectively.",
author = "Robert Birke and Perez, {Juan F.} and Zhan Qiu and Mathias Borkqvist and Chen, {Lydia Y.}",
year = "2017",
month = "12",
day = "8",
doi = "10.1109/TSC.2017.2780845",
language = "English (US)",
journal = "IEEE Transactions on Services Computing",
issn = "1939-1374",
publisher = "Institute of Electrical and Electronics Engineers Inc.",

}

sPARE : Partial Replication for Multi-tier Applications in the Cloud. / Birke, Robert; Perez, Juan F.; Qiu, Zhan; Borkqvist, Mathias; Chen, Lydia Y.

In: IEEE Transactions on Services Computing, 08.12.2017.

Research output: Contribution to journalArticle

TY - JOUR

T1 - sPARE

T2 - Partial Replication for Multi-tier Applications in the Cloud

AU - Birke, Robert

AU - Perez, Juan F.

AU - Qiu, Zhan

AU - Borkqvist, Mathias

AU - Chen, Lydia Y.

PY - 2017/12/8

Y1 - 2017/12/8

N2 - Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x  and  2.9x  , respectively.

AB - Offering consistent low latency remains a key challenge for distributed applications, especially when deployed on the cloud where virtual machines (VMs) suffer from capacity variability caused by colocated tenants. Replicating redundant requests were shown to be an effective mechanism to defend application performance from high capacity variability. While the prior art centers on single-tier systems, it still remains an open question how to design replication strategies for distributed multi-tier systems. In this paper, we design a first of its kind PArtial REplication system, sPARE, that replicates and dispatches read-only workloads for distributed multi-tier web applications The two key components of sPARE are (i) the variability-aware replicator that coordinates the replication levels on all tiers via an iterative searching algorithm, and (ii) the replication-aware arbiter that uses a novel token-based arbitration algorithm (TAD) to dispatch requests in each tier. We evaluate sPARE on web serving and web searching applications, i.e., MediaWiki and Solr, the former deployed on our private cloud and the latter in the wild on Amazon EC2. Our results based on various interference patterns and traffic loads show that sPARE is able to improve the tail latency of MediaWiki and Solr by a factor of almost 2.7x  and  2.9x  , respectively.

UR - http://www.scopus.com/inward/record.url?scp=85038366923&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85038366923&partnerID=8YFLogxK

U2 - 10.1109/TSC.2017.2780845

DO - 10.1109/TSC.2017.2780845

M3 - Article

AN - SCOPUS:85038366923

JO - IEEE Transactions on Services Computing

JF - IEEE Transactions on Services Computing

SN - 1939-1374

ER -