Deconstructing Commodity Storage Clusters
- 28 July 2005
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 33 (2), 60-71
- https://doi.org/10.1109/isca.2005.20
Abstract
The traditional approach for characterizing complex systems is to run standard workloads and measure the resulting performance as seen by the end user. However, unique opportunities exist when characterizing a system that is itself constructed from standardized components: one can also look inside the system itself by instrumenting each of the components. In this paper, we show how intra-box instrumentation can help one understand the behavior of a large-scale storage cluster, the EMC Centera. In our analysis, we leverage standard tools for tracing both the disk and network traffic emanating from each node of the cluster. By correlating this traffic with the running workload, we are able to infer the structure of the software system (e.g., its write update protocol) as well as its policies (e.g., how it performs caching, replication, and load-balancing). Further, by imposing variable intra-box delays on network and disk traffic, we can confirm the causal relationships between network and disk events. Thus, we are able to infer the semantics of the messages between nodes without examining a single line of source code.Keywords
This publication has 18 references indexed in Scilit:
- Deconstructing storage arraysPublished by Association for Computing Machinery (ACM) ,2004
- Cache performance for selected SPEC CPU2000 benchmarksACM SIGARCH Computer Architecture News, 2001
- Computing in the RAIN: a reliable array of independent nodesIEEE Transactions on Parallel and Distributed Systems, 2001
- PetalPublished by Association for Computing Machinery (ACM) ,1996
- Measuring cache and TLB performance and their effect on benchmark runtimesIEEE Transactions on Computers, 1995
- Informed prefetching and cachingPublished by Association for Computing Machinery (ACM) ,1995
- A case for NOW (Networks of Workstations)IEEE Micro, 1995
- RAID: high-performance, reliable secondary storageACM Computing Surveys, 1994
- A new approach to I/O performance evaluationPublished by Association for Computing Machinery (ACM) ,1993
- The design and implementation of a log-structured file systemACM Transactions on Computer Systems, 1992