
浏览全部资源
扫码关注微信
Received:10 February 2026,
Revised:2026-04-08,
Accepted:11 May 2026,
移动端阅览
Research on Cloud Resource Scheduling Based on Hierarchical Game Theory and Hybrid Parameterized Control[J/OL]. Telecommunications Science, 2026.
Research on Cloud Resource Scheduling Based on Hierarchical Game Theory and Hybrid Parameterized Control[J/OL]. Telecommunications Science, 2026. DOI: 10.11959/j.issn.1000-0801.DXKX260102.
面对云计算环境的大规模并发性、高度动态性以及多目标冲突(如完工时间、能耗与资源利用率之间的权衡)的严峻挑战,传统的启发式算法难以适应负载潮汐,而单智能体深度强化学习(DRL)在处理大规模状态空间时往往陷入“维度灾难”。本文提出了一种创新HGT-MARL-CS-PSO框架,旨在通过分层博弈机制解决云资源调度的复杂性。首先,本文将调度问题形式化为一个两级斯塔克伯格博弈模型:高层管理智能体作为领导者,根据全局负载特征制定宏观战略目标;低层执行智能体作为跟随者,在战略约束下进行非合作纳什博弈,以平衡局部资源竞争与全局协作。其次,为了克服离散动作空间爆炸的问题,本文设计了一种混合参数化控制机制,智能体并不直接输出任务映射,而是输出连续的权重参数向量,动态驱动底层的布谷鸟搜索-粒子群调度器进行微观寻优。理论分析证明了该框架在斯塔克伯格均衡点的收敛性。Google Borg Trace真实数据集的大规模实验表明,HGT-MARL-CS-PSO在完工时间、能耗效率及资源利用率等关键指标上显著优于现有的DRL基线及主流元启发式算法,展现了卓越的泛化能力与鲁棒性。
Cloud resource scheduling in large-scale data centers faces severe challenges characterized by high concurrency
dynamic workload volatility
and conflicting optimization objectives (e.g.
tradeoffs between makespan
energy consumption
and resource utilization). Traditional heuristics often lack adaptability
while single-agent Deep Reinforcement Learning (DRL) approaches suffer from the "curse of dimensionality" when scaling to large state spaces. To address these limitations
this paper proposes HGT-MARL-CS-PSO
a novel framework that orchestrates scheduling through a hierarchical game-theoretic approach. First
we formulate the scheduling problem as a Two-Level Stackelberg Game: a high-level Manager Agent (M-Agent) acts as the Leader
defining global strategic goals based on macro-workload patterns
while low-level Executor Agents (E-Agents) act as Followers
engaging in a non-cooperative Nash game to balance local resource competition with global cooperation. Second
to mitigate the explosion of discrete action spaces
we introduce a Hybrid Parameterized Control Mechanism. Instead of generating task mappings directly
the RL agents output continuous weight parameter vectors to dynamically configure a subordinate Cuckoo Search-Particle Swarm Optimization (CS-PSO) scheduler for precise micro-execution. Theoretical analysis validates the convergence of the proposed framework towards a Stackelberg Equilibrium. Extensive experiments using real-world Google Borg Traces demonstrate that HGT-MARL-CS-PSO significantly outperforms state-of-the-art DRL baselines and meta-heuristics in terms of makespan reduction
energy efficiency
and resource utilization
proving its superior generalization and robustness in complex environments.
Buyya R , Yeo C S , Venugopal S , et al . Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility [J ] . Future Generation Computer Systems , 2009 , 25 ( 6 ): 599 - 616 .
Kumar M , Sharma S C , Goel A , et al . A comprehensive survey for scheduling techniques in cloud computing [J ] . Journal of Network and Computer Applications , 2019 , 143 : 1 - 33 .
Ullman J D . NP-complete scheduling problems [J ] . Journal of Computer and System Sciences , 1975 , 10 ( 3 ): 384 - 393 .
Etminani K , Naghibzadeh M . A Min-Min Max-Min selective algorithm for grid task scheduling [C ] // 2007 3rd IEEE/IFIP International Conference on Central Asia on Internet . IEEE , 2007 : 1 - 7 .
Jena U K , Das P K , Kabat M R . Hybridization of meta-heuristic algorithm for load balancing in cloud computing environment [J ] . Journal of King Saud University-Computer and Information Sciences , 2022 , 34 ( 6 ): 2332 - 2342 .
Mao H , Alizadeh M , Menache I , et al . Resource management with deep reinforcement learning [C ] // Proceedings of the 15th ACM Workshop on Hot Topics in Networks . 2016 : 50 - 56 .
Arulkumaran K , Deisenroth M P , Brundage M , et al . Deep reinforcement learning: A brief survey [J ] . IEEE Signal Processing Magazine , 2017 , 34 ( 6 ): 26 - 38 .
Cheng M , Li J , Nazarian S , et al . Multi-agent reinforcement learning for task scheduling in large-scale cloud data centers [J ] . Future Generation Computer Systems , 2023 , 145 : 112 - 125 .
Lowe R , Wu Y , Tamar A , et al . Multi-agent actor-critic for mixed cooperative-competitive environments [C ] // Advances in Neural Information Processing Systems . 2017 : 6379 - 6390 .
Krishnamurthy B , Shiva S G . Scalable Resource Provisioning Framework for Fog Computing Using LLM-Guided Q-Learning Approach [J ] . Algorithms , 2025 , 18 ( 4 ): 230 .
Wu Y , et al . Task Scheduling in Geo-Distributed Computing: A Survey [J ] . arXiv preprint arXiv: 2501.15504 , 2025 .
Wang T , Liang Y , Jia Y , et al . Stackelberg game based resource allocation in edge computing with renewable energy [J ] . IEEE Internet of Things Journal , 2020 , 7 ( 8 ): 7306 - 7318 .
Hady M A , et al . Multi-Agent Reinforcement Learning for Resources Allocation Optimization: A Survey [J ] . arXiv preprint arXiv: 2504.21048 , 2025 .
Krishnamurthy B , Shiva S G . Large Language Model-Guided SARSA Algorithm for Dynamic Task Scheduling in Cloud Computing [J ] . Mathematics , 2025 , 13 ( 6 ): 926 .
Moura J , Hutchison D . Game Theory for Multi-Access Edge Computing: Survey, Use Cases, and Future Trends [J ] . IEEE Communications Surveys & Tutorials , 2019 , 21 ( 1 ): 260 - 288 .
Liu N , Li Z , Xu J , et al . A Hierarchical Framework of Cloud Resource Allocation and Power Management Using Deep Reinforcement Learning [C ] // 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS) . IEEE , 2017 : 372 - 382 .
Houssein E H , Gad A G , et al . Task Scheduling in Cloud Computing based on Meta-heuristics: Review, Taxonomy, Open Challenges, and Future Trends [J ] . Swarm and Evolutionary Computation , 2021 , 62 : 100841 .
Beloglazov A , Buyya R . Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers [J ] . Concurrency and Computation: Practice and Experience , 2012 , 24 ( 13 ): 1397 - 1420 .
Yang X S , Deb S . Cuckoo search via Lévy flights [C ] // 2009 World Congress on Nature & Biologically Inspired Computing (NaBIC) . IEEE , 2009 : 210 - 214 .
Tirmazi M Y , Barker A , Deng N , et al . Borg: the next generation [C ] // Proceedings of the Fifteenth European Conference on Computer Systems . 2020 : 1 - 14 .
0
Views
0
下载量
0
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution
京公网安备11010802024621