Scalable Load Balancing in the Presence of Heterogeneous Servers

5 March 2021

journal article
research article
Published by Association for Computing Machinery (ACM) in ACM SIGMETRICS Performance Evaluation Review

Vol. 48 (3), 37-38
https://doi.org/10.1145/3453953.3453961

Abstract

In large-scale computer systems, deciding how to dispatch arriving jobs to servers is a primary factor affecting system performance. Consequently, there is a wealth of literature on designing, analyzing, and evaluating the performance of load balancing policies. For analytical tractability, most existing work on dispatching in large-scale systems makes a key assumption: that the servers are homogeneous, meaning that they all have the same speeds, capabilities, and available resources. But this assumption is not accurate in practice. Modern computer systems are instead heterogeneous: server farms may consist of multiple generations of hardware, servers with varied resources, or even virtual machines running in a cloud environment. Given the ubiquity of heterogeneity in today's systems, it is critically important to develop load balancing policies that perform well in heterogeneous environments. In this paper, we focus on systems in which server speeds are heterogeneous.

Keywords

This publication has 3 references indexed in Scilit:

Asymptotic independence of queues under randomized load balancing
Queueing Systems, 2012
Asymptotic Optimality of Balanced Routing
Operations Research, 2012
The power of two choices in randomized load balancing
IEEE Transactions on Parallel and Distributed Systems, 2001

Cited by 3 articles