Optimising AI Workload Distribution in Multi-Cloud Environments: A Dynamic Resource Allocation Approach

Authors

  • Bo Yuan VMware
  • Guanghe Cao University of Southern California
  • Jun Sun University of Connecticut
  • Shiji Zhou University of Southern California

DOI:

https://doi.org/10.5281/zenodo.13863194

ARK:

https://n2t.net/ark:/40704/JIEAS.v2n5a10

References:

40

Keywords:

Multi-cloud Computing, AI Workload Optimisation, Dynamic Resource Allocation, Energy-efficient Computing

Abstract

This research presents a new resource allocation method for optimising AI task distribution in multi-cloud environments. The proposed approach addresses the challenges of managing complex AI operations across different environments, focusing on improving resource efficiency, energy efficiency, and financial efficiency. The framework includes advanced machine learning techniques, including performance measurement and performance prediction, multi-dimensional monitoring and profiling, decision-based adaptive support learning, and data transfer in different clouds.
The experimental results show a significant improvement over existing solutions, with a 9.8% increase in average resource utilisation and a 21% reduction in task completion time. Even when measured for 5000 VMs, the framework performs well, showing exceptional scalability and robustness. A cost-benefit analysis shows a 30.6% reduction in Total Cost of Ownership over a simulated 3-year period and a 30.5% reduction in energy and gas consumption—carbon emissions.
The research findings have significant implications for climate control AI in many areas, providing insight into strategies for optimising operations and energy efficiency and improving environmental trust. The proposed framework represents a paradigm shift in the cloud, providing a blueprint for next-generation AI infrastructure that can adapt to the evolving needs of complex AI applications while supporting business stability and effectiveness.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Author Biographies

Bo Yuan, VMware

VMware, Beijing, China.

Guanghe Cao, University of Southern California

Computer Science, University of Southern California, CA, USA.

Jun Sun, University of Connecticut

Business Analytics and Project Management, University of Connecticut, CT, USA.

Shiji Zhou, University of Southern California

Computer Science, University of Southern California, CA, USA.

References

Kumar, P., Tharad, A., Mukhammadjonov, U., & Rawat, S. (2021, October). Analysis on Resource Allocation for parallel processing and Scheduling in Cloud Computing. In 2021 5th International Conference on Information Systems and Computer Networks (ISCON) (pp. 1-6). IEEE.

Yin, Y., & Zhao, M. (2023, May). Application of AI, Big Data and Cloud Computing Technology in Smart Factories. In 2023 6th International Conference on Artificial Intelligence and Big Data (ICAIBD) (pp. 192-196). IEEE.

Chavan, P., & Chavan, P. (2024, June). Automation of AD-OHC Dashbord and Monitoring of Cloud Resources using Genrative AI to Reduce Costing and Enhance Performance. In 2024 International Conference on Innovations and Challenges in Emerging Technologies (ICICET) (pp. 1-9). IEEE.

Paraskevoulakou, E., Tom-Ata, J. D. T., Symvoulidis, C., & Kyriazis, D. (2024, January). Enhancing cloud-based application component placement with ai-driven operations. In 2024 IEEE 14th Annual Computing and Communication Workshop and Conference (CCWC) (pp. 0687-0694). IEEE.

Gore, S., Bhapkar, Y., Ghadge, J., Gore, S., & Singha, S. K. (2023, October). Evolutionary Programming for Dynamic Resource Management and Energy Optimization in Cloud Computing. In 2023 International Conference on Advanced Computing Technologies and Applications (ICACTA) (pp. 1-5). IEEE.

Li, S., Xu, H., Lu, T., Cao, G., & Zhang, X. (2024). Emerging Technologies in Finance: Revolutionizing Investment Strategies and Tax Management in the Digital Era. Management Journal for Advanced Research, 4(4), 35-49.

Shi J, Shang F, Zhou S, et al. Applications of Quantum Machine Learning in Large-Scale E-commerce Recommendation Systems: Enhancing Efficiency and Accuracy[J]. Journal of Industrial Engineering and Applied Science, 2024, 2(4): 90-103.

Wang, S., Zheng, H., Wen, X., & Fu, S. (2024). DISTRIBUTED HIGH-PERFORMANCE COMPUTING METHODS FOR ACCELERATING DEEP LEARNING TRAINING. Journal of Knowledge Learning and Science Technology ISSN: 2959-6386 (online), 3(3), 108-126.

Wang, B., Zheng, H., Qian, K., Zhan, X., & Wang, J. (2024). Edge computing and AI-driven intelligent traffic monitoring and optimization. Applied and Computational Engineering, 77, 225-230.

Li, H., Wang, S. X., Shang, F., Niu, K., & Song, R. (2024). Applications of Large Language Models in Cloud Computing: An Empirical Study Using Real-world Data. International Journal of Innovative Research in Computer Science & Technology, 12(4), 59-69.

Ping, G., Wang, S. X., Zhao, F., Wang, Z., & Zhang, X. (2024). Blockchain Based Reverse Logistics Data Tracking: An Innovative Approach to Enhance E-Waste Recycling Efficiency.

Xu, H., Niu, K., Lu, T., & Li, S. (2024). Leveraging artificial intelligence for enhanced risk management in financial services: Current applications and future prospects. Engineering Science & Technology Journal, 5(8), 2402-2426.

Shi, Y., Shang, F., Xu, Z., & Zhou, S. (2024). Emotion-Driven Deep Learning Recommendation Systems: Mining Preferences from User Reviews and Predicting Scores. Journal of Artificial Intelligence and Development, 3(1), 40-46.

Wang, Shikai, Kangming Xu, and Zhipeng Ling. "Deep Learning-Based Chip Power Prediction and Optimization: An Intelligent EDA Approach." International Journal of Innovative Research in Computer Science & Technology 12.4 (2024): 77-87.

Ping, G., Zhu, M., Ling, Z., & Niu, K. (2024). Research on Optimizing Logistics Transportation Routes Using AI Large Models. Applied Science and Engineering Journal for Advanced Research, 3(4), 14-27.

Shang, F., Shi, J., Shi, Y., & Zhou, S. (2024). Enhancing E-Commerce Recommendation Systems with Deep Learning-based Sentiment Analysis of User Reviews. International Journal of Engineering and Management Research, 14(4), 19-34.

Xu, H., Li, S., Niu, K., & Ping, G. (2024). Utilizing Deep Learning to Detect Fraud in Financial Transactions and Tax Reporting. Journal of Economic Theory and Business Management, 1(4), 61-71.

Xu, K., Zhou, H., Zheng, H., Zhu, M., & Xin, Q. (2024). Intelligent Classification and Personalized Recommendation of E-commerce Products Based on Machine Learning. arXiv preprint arXiv:2403.19345.

Xu, K., Zheng, H., Zhan, X., Zhou, S., & Niu, K. (2024). Evaluation and Optimization of Intelligent Recommendation System Performance with Cloud Resource Automation Compatibility.

Zheng, H., Xu, K., Zhou, H., Wang, Y., & Su, G. (2024). Medication Recommendation System Based on Natural Language Processing for Patient Emotion Analysis. Academic Journal of Science and Technology, 10(1), 62-68.

Zheng, H.; Wu, J.; Song, R.; Guo, L.; Xu, Z. Predicting Financial Enterprise Stocks and Economic Data Trends Using Machine Learning Time Series Analysis. Applied and Computational Engineering 2024, 87, 26–32.

Zhan, X., Shi, C., Li, L., Xu, K., & Zheng, H. (2024). Aspect category sentiment analysis based on multiple attention mechanisms and pre-trained models. Applied and Computational Engineering, 71, 21-26.

Liu, B., Zhao, X., Hu, H., Lin, Q., & Huang, J. (2023). Detection of Esophageal Cancer Lesions Based on CBAM Faster R-CNN. Journal of Theory and Practice of Engineering Science, 3(12), 36-42.

Liu, B., Yu, L., Che, C., Lin, Q., Hu, H., & Zhao, X. (2024). Integration and performance analysis of artificial intelligence and computer vision based on deep learning algorithms. Applied and Computational Engineering, 64, 36-41.

Liu, B. (2023). Based on intelligent advertising recommendation and abnormal advertising monitoring system in the field of machine learning. International Journal of Computer Science and Information Technology, 1(1), 17-23.

Wu, B., Xu, J., Zhang, Y., Liu, B., Gong, Y., & Huang, J. (2024). Integration of computer networks and artificial neural networks for an AI-based network operator. arXiv preprint arXiv:2407.01541.

Liang, P., Song, B., Zhan, X., Chen, Z., & Yuan, J. (2024). Automating the training and deployment of models in MLOps by integrating systems with machine learning. Applied and Computational Engineering, 67, 1-7.

Wu, B., Gong, Y., Zheng, H., Zhang, Y., Huang, J., & Xu, J. (2024). Enterprise cloud resource optimization and management based on cloud operations. Applied and Computational Engineering, 67, 8-14.

Liu, B., & Zhang, Y. (2023). Implementation of seamless assistance with Google Assistant leveraging cloud computing. Journal of Cloud Computing, 12(4), 1-15.

Guo, L., Li, Z., Qian, K., Ding, W., & Chen, Z. (2024). Bank Credit Risk Early Warning Model Based on Machine Learning Decision Trees. Journal of Economic Theory and Business Management, 1(3), 24-30.

Xu, Z., Guo, L., Zhou, S., Song, R., & Niu, K. (2024). Enterprise Supply Chain Risk Management and Decision Support Driven by Large Language Models. Applied Science and Engineering Journal for Advanced Research, 3(4), 1-7.

Song, R., Wang, Z., Guo, L., Zhao, F., & Xu, Z. (2024). Deep Belief Networks (DBN) for Financial Time Series Analysis and Market Trends Prediction.World Journal of Innovative Medical Technologies, 5(3), 27-34.

Guo, L.; Song, R.; Wu, J.; Xu, Z.; Zhao, F. Integrating a Machine Learning-Driven Fraud Detection System Based on a Risk Management Framework. Preprints 2024, 2024061756.

Feng, Y., Qi, Y., Li, H., Wang, X., & Tian, J. (2024, July 11). Leveraging federated learning and edge computing for recommendation systems within cloud computing networks. In Proceedings of the Third International Symposium on Computer Applications and Information Systems (ISCAIS 2024) (Vol. 13210, pp. 279-287). SPIE.

Zhao, F.; Li, H.; Niu, K.; Shi, J.; Song, R. Application of Deep Learning-Based Intrusion Detection System (IDS) in Network Anomaly Traffic Detection. Preprints 2024, 2024070595.

Gong, Y., Liu, H., Li, L., Tian, J., & Li, H. (2024, February 28). Deep learning-based medical image registration algorithm: Enhancing accuracy with dense connections and channel attention mechanisms. Journal of Theory and Practice of Engineering Science, 4(02), 1-7.

Yu, K., Bao, Q., Xu, H., Cao, G., & Xia, S. (2024). An Extreme Learning Machine Stock Price Prediction Algorithm Based on the Optimisation of the Crown Porcupine Optimisation Algorithm with an Adaptive Bandwidth Kernel Function Density Estimation Algorithm.

Li A, Zhuang S, Yang T, Lu W, Xu J. Optimization of logistics cargo tracking and transportation efficiency based on data science deep learning models. Applied and Computational Engineering. 2024 Jul 8;69:71-7.

Zhang, M., Yuan, B., Li, H., & Xu, K. (2024). LLM-Cloud Complete: Leveraging Cloud Computing for Efficient Large Language Model-based Code Completion. Journal of Artificial Intelligence General science (JAIGS) ISSN: 3006-4023, 5(1), 295-326.

Wang, Y., Zhu, M., Yuan, J., Wang, G., & Zhou, H. The intelligent prediction and assessment of financial information risk in the cloud computing model. Appl. Comput. Eng. 2024, 64, 136–142.

Downloads

Published

2024-10-01

How to Cite

[1]
B. Yuan, G. Cao, J. Sun, and S. Zhou, “Optimising AI Workload Distribution in Multi-Cloud Environments: A Dynamic Resource Allocation Approach”, Journal of Industrial Engineering & Applied Science, vol. 2, no. 5, pp. 68–79, Oct. 2024.

Issue

Section

Articles

ARK