Reducing Costs of Spot Instances via Checkpointing in the Amazon Elastic Compute Cloud
- 1 July 2010
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 236-243
- https://doi.org/10.1109/cloud.2010.35
Abstract
Recently introduced spot instances in the Amazon Elastic Compute Cloud (EC2) offer lower resource costs in exchange for reduced reliability; these instances can be revoked abruptly due to price and demand fluctuations. Mechanisms and tools that deal with the cost-reliability trade-offs under this schema are of great value for users seeking to lessen their costs while maintaining high reliability. We study how one such a mechanism, namely check pointing, can be used to minimize the cost and volatility of resource provisioning. Based on the real price history of EC2 spot instances, we compare several adaptive check pointing schemes in terms of monetary costs and improvement of job completion times. Trace-based simulations show that our approach can reduce significantly both price and the task completion times.Keywords
This publication has 12 references indexed in Scilit:
- Exploiting non-dedicated resources for cloud computingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Cost-benefit analysis of Cloud Computing versus desktop gridsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- The cost of doing science on the cloud: The Montage examplePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- On correlated availability in Internet-distributed systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Amazon S3 for science gridsPublished by Association for Computing Machinery (ACM) ,2008
- Exploring event correlation for failure prediction in coalitions of clustersPublished by Association for Computing Machinery (ACM) ,2007
- Adaptive page-level incremental checkpointing based on expected recovery timePublished by Association for Computing Machinery (ACM) ,2006
- Condor-a hunter of idle workstationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile NodesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The effects of checkpointing on program execution timeInformation Processing Letters, 1983