Workload Predicting-Based Automatic Scaling in Service Clouds

Abstract
Service platforms have disadvantages such as they have long construction periods, low resource utilizations and isolated constructions. Migrating service platforms into clouds can solve these problems. The scalability is an important characteristic of service clouds. With the scalability, the service cloud can offer on-demand capacities to different services. In order to achieve the scalability, we need to know when and how to scale virtual resources assigned to different services. In this paper, a linear regression model is used to predict the workload. Based on this predicted workload, an auto-scaling mechanism is proposed to scale virtual resources at different resource levels in service clouds. The automatic scaling mechanism combines the real-time scaling and the pre-scaling. Finally experimental results are provided to demonstrate that our approach can satisfy the user SLA while keeping scaling costs low.

This publication has 9 references indexed in Scilit: