Synthesizing representative I/O workloads using iterative distillation
- 22 June 2004
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Storage systems designers are still searching for bet- ter methods of obtaining representative I/O workloads to drive studies of I/O systems. Traces of production work- loads are very accurate, but inflexible and difficult to ob- tain. The use of synthetic workloads addresses these limi- tations; however, synthetic workloads are accurate only if they share certain key properties with the production work- load on which they are based (e.g., mean request size, read percentage). Unfortunately, we do not know which proper- ties are "key" for a given workload and storage system. We have developed a tool, the Distiller, that automati- cally identifies the key properties ("attribute-values") of the workload. The Distiller then uses these attribute-values to generate a synthetic workload representative of the produc- tion workload. This paper presents the design and eval- uation of the Distiller. We demonstrate how the Distiller finds representative synthetic workloads for simple artificial workloads and three production workload traces.Keywords
This publication has 10 references indexed in Scilit:
- Data mining meets performance evaluation: fast algorithms for modeling bursty trafficPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A user-oriented synthetic workload generatorPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A new approach in the modeling and generation of synthetic disk workloadPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A new approach in the analysis and modeling of disk access patternsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Capturing the spatio-temporal behavior of real traffic dataPerformance Evaluation, 2002
- Techniques for file system simulationSoftware: Practice and Experience, 1994
- Construction and use of multiclass workload modelsPerformance Evaluation, 1994
- An introduction to disk drive modelingComputer, 1994
- A synthetic workload model for a distributed system file serverPublished by Association for Computing Machinery (ACM) ,1991
- On the foundations of artificial workload designPublished by Association for Computing Machinery (ACM) ,1984