Chapter 5 Evaluation
5.2 Experiment Results and Discussion
First we evaluate the performance of energy saving using error-free workload information. In Table 4, we compare our algorithm with other two approaches that only resizing VM. It can be easily understood that the first two approaches consume much more energy since they don’t provide the server level resizing, and the basic energy consumption takes a significant fraction of the overall energy consumption on a working server [7]. Next we compare our algorithm with other two that resizing at both VM and server levels. The result is illustrated in Fig. 4. We can easily observe that, as the degree of workload fluctuation increases, the energy consumption also increases due to more and more switching cost. We notice that the
server
P
25
Table 4. Energy consumption comparison of the proposed TD-D with approaches only resizing VMs
Variance of workload fluctuation
Resizing VMs only, without break-even
time [6]
Resizing VMs only,
with break-even time TD-D (proposed)
2 2253 2234 248
2.38 2314 2288 292
2.72 2328 2297 311
3 2328 2292 350
Figure 4. Energy consumption comparison of the proposed TD-D with approaches that resize both VMs and servers
approach concerning break-even time consume more energy than the one that uses on-demand resizing. This phenomenon can be understood as we described in Chapter 3, that when applying a simple break-even time rule, we may need to allocate more servers to accommodate the VMs that we kept in VM break-even time events. As the workload fluctuation become severe, more VM break-even time events happen and
26
Table 5. Comparison of average computing time and percentage of times an algorithm completed within 3 minutes
more wasted servers are allocated. This is a good example why we need an optimal algorithm rather than a best-effort algorithm that using simple rules or heuristics.
In the second part, we measure the computing time of our algorithm. Since the complexity of our algorithm is mainly dominated by the level of workload fluctuation, we record the computing time under different workload fluctuation level. We also show the computing time of local search approach for comparison, which is the one that closest to TD-D in energy saving. The results are shown in Table 5. As we can see, the average computing time is acceptable for a long term resource allocation algorithm, and there is a high percentage of times that the algorithm can be completed within a time slot (3 minutes) and give us the optimal solution. In contrast, the local search approach never completes its search within three minutes due to its unbounded search space and the lack of stopping criterion. Since we use the relative energy unit, that is, we set the = 1 and the proposed TD-D can be completed within one time slot, we can conclude that the energy consumption of performing our algorithm is no more than 1. Compare the energy our algorithm can save with the energy and time our algorithm costs, we show the effectiveness and efficiency of our algorithm in energy saving.
Finally, we evaluate the reliability and effectiveness of our algorithm to
Algorithm Variance used in workload generator
2 2.38 2.72 3
27
Figure 5. Comparison of energy consumption under different severity of prediction error
prediction error. Besides the reactive controller showed in Chapter 4, we re-perform our TD-D algorithm every 2 time slots, instead of W, to resist prediction error. When a resource over-allocation occurs, it brings extra energy consumption of operating cost.
When a resource under-allocation occurs, it brings extra energy consumption of switching cost since the reactive controller has to switch on new VM/server to fulfill the demand. Again we use local search approach for comparison. The reactive controller and algorithm re-performing are also implemented in the local search approach. We set the variance used in workload generator = 2.38 and the results are shown in Fig. 5. As we can see, as the prediction error become severer, the energy consumption also become larger, due to the extra operating cost caused by over-allocation and the extra switching cost caused by under-allocation. We find our algorithm can still save more energy than the comparative approach under prediction error. Another way to evaluate the reliability to prediction error is under-allocation ratio. This is an important evaluation since some energy efficient algorithm may take
28
the risk of under-allocation to achieve less energy consumption. The results are shown in Table 6. The difference between the second and third column of Table 6 is, the second column show the average VM under-allocation counts of the whole data center, while the third column show the average VM under-allocation counts of each application, which is the actual influencing factor for user experience. We can find that our algorithm can keep in very low resource under-allocation ratio even under severe prediction error. The reason of how our algorithm can achieve low energy consumption while keeping in low resource-allocation ratio is the good side-effect of concerning break-even time. In a break-even time event, we may choose to keep that temporarily unnecessary resource, thus avoiding some resource under-allocation if a prediction error occurs and the resource demand does not really go down.
Table 6. VM under-allocation under different AWGN variances over 7 time slots
Variance of AWGN
Average VM under-allocation (VM / time slot)
Average VM under-allocation per application (VM / (time slot ×
application))
Local search TD-D Local Search TD-D
0.02 0.27 0.30 0.01 0.01
0.04 0.86 0.76 0.03 0.03
0.06 1.20 1.13 0.04 0.04
29
Chapter 6 Conclusion
6.1 Concluding Remarks
In this paper, we introduce our minimum energy consumption resource allocation algorithm for cloud data centers called Time-Directed Dijkstra (TD-D). It can produce optimal solution by utilizing the existing load prediction approaches. We first characterize the difficulties of resizing at both VM and server levels, and then come up with an optimal algorithm that can seek the best trade-off between operating cost and switching cost to achieve minimum energy consumption. We demonstrate the correctness of our algorithm and show that even such high complexity problem can be completed by commodity machine in reasonable computing time. Compared with representative best-effort dynamic resource allocation algorithm, our optimal algorithm can save more energy under different workload fluctuation level. We also demonstrate the robustness and energy efficiency of our algorithm to prediction error.
6.2 Future Work
The next step is to use the workload traces from real world to further evaluate our algorithm. Another future work is the cluster version of our algorithm. Since there are more and more large scale data center, for reliability and scalability, the cluster version must be developed to build a decentralized resource control system.
30
References
[1] John J. Prevost, KranthiManoj Nagothu, Brian Kelley and Mo Jamshidi,
“Prediction of Cloud Data Center Networks Loads Using Stochastic and Neural Models,” Proc. of the 6th International Conference on System of Systems Engineering, 2011, pp. 276-281.
[2] Truong Vinh Truong Duy, Yukinori Sato, Yasushi Inoguchi, “Performance Evaluation of a Green Scheduling Algorithm for Energy Savings in Cloud Computing,” International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), 2010.
[3] Daniel Gmach, Jerry Rolia, Ludmila Cherkasova, Alfons Kemper, "Workload Analysis and Demand Prediction of Enterprise Data Center Applications," IEEE 10th International Symposium on Workload Characterization, 2007.
[4] Arijit Khan, Xifeng Yan, Shu Tao, Nikos Anerousis, "Workload Characterization and Prediction in the Cloud: A Multiple Time Series Approach," IEEE Network Operations and Management Symposium (NOMS), 2012.
[5] Minghong Lin, Adam Wierman, Lachlan L. H. Andrew, and Eno Thereska,
“Dynamic Right-Sizing for Power-Proportional Data Centers,” IEEE INFOCOM, 2011.
[6] Chunqiang Tang, Malgorzata Steinder, Michael Spreitzer, and Giovanni Pacifici,
“A Scalable Application Placement Controller for Enterprise Data Centers,” ACM Proceedings of the 16th international conference on World Wide Web, 2007.
[7] Dara Kusic, Jeffrey O. Kephart, James E. Hanson, Nagarajan Kandasamy, Guofei Jiang, “Power and Performance Management of Virtualized Computing Environments via Lookahead Control,” Cluster Computing, vol. 12, no. 1, pp.
1-15, March 2009.
31
[8] Anton Beloglazov, Jemal Abawajy, Rajkumar Buyya, “Energy-aware Resource Allocation Heuristics for Efficient Management of Data Centers for Cloud Computing,” Future Generation Computer Systems, vol. 28, no. 5, May 2012, pp.
755–768, 2012.
[9] Danilo Ardagna, Barbara Panicucci, Marco Trubian, and Li Zhang,
“Energy-Aware Autonomic Resource Allocation in Multitier Virtualized Environments,” IEEE Transactions on Services Computing, vol. 5, no. 1, 2012.
[10] Vinicius Petrucci, Orlando Loques, Daniel Mossé , “A Dynamic Optimization Model for Power and Performance Management of Virtualized Clusters,”
Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking, 2010, pp. 225-233.
[11] Anton Beloglazov, Rajkumar Buyya1, Young Choon Lee, and Albert Zomaya,
“A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems,” Advances in Computers, vol. 82, 2011.
[12] Andrew Krioukov, Prashanth Mohan, Sara Alspaugh, Laura Keys, David Culler, Randy Katz, "NapSAC: Design and Implementation of a Power-Proportional Web Cluster," ACM SIGCOMM Computer Communication Review, vol. 41, no. 1, pp.
102-108, January 2011.
[13] Norman Bobroff, Andrzej Kochut, Kirk Beaty, “Dynamic Placement of Virtual Machines for Managing SLA Violations,” 10th IFIP/IEEE International Symposium on Integrated Network Management, 2007.