Characterizing Cloud Applications on a Google Data Center
- 1 October 2013
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 468-473
- https://doi.org/10.1109/icpp.2013.56
Abstract
In this paper, we characterize Google applications, based on a one-month Google trace with over 650k jobs running across over 12000 heterogeneous hosts from a Google data center. On one hand, we carefully compute the valuable statistics about task events and resource utilization for Google applications, based on various types of resources (such as CPU, memory) and execution types (e.g., whether they can run batch tasks or not). Resource utilization per application is observed with an extremely typical Pareto principle. On the other hand, we classify applications via a K-means clustering algorithm with optimized number of sets, based on task events and resource usage. The number of applications in the K-means clustering sets follows a Pareto-similar distribution. We believe our work is very interesting and valuable for the further investigation of Cloud environment.Keywords
This publication has 10 references indexed in Scilit:
- Characterization and Comparison of Cloud versus Grid WorkloadsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Characterizing Machines and Workloads on a Google ClusterPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Dynamic Optimization of Multiattribute Resource Allocation in Self-Organizing CloudsIEEE Transactions on Parallel and Distributed Systems, 2012
- Virtual Machine Resource Allocation for Service Hosting on Heterogeneous Distributed PlatformsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2012
- Modeling and synthesizing task placement constraints in Google compute clustersPublished by Association for Computing Machinery (ACM) ,2011
- CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithmsSoftware: Practice and Experience, 2010
- Towards characterizing cloud backend workloadsACM SIGMETRICS Performance Evaluation Review, 2010
- InterCloud: Utility-Oriented Federation of Cloud Computing Environments for Scaling of Application ServicesLecture Notes in Computer Science, 2010
- Alternatives to the k-means algorithm that find better clusteringsPublished by Association for Computing Machinery (ACM) ,2002
- Spatial TessellationsWiley Series in Probability and Statistics, 2000