The continuous expansion of remote-sensing satellite constellations has significantly enhanced their comprehensive application capabilities in resource surveys
environmental monitoring
and emergency disaster response. However
the burgeoning scale of these constellations markedly increases the complexity of mission planning. While multi-agent reinforcement learning is an effective approach for large-scale mission planning
the surge in satellite numbers leads to the challenge of dimensionality explosion in the state space. To address this bottleneck
a multi-head graph attention driven feature aggregation (MGADFA) algorithm for mission planning in giant remote-sensing constellations was proposed. This method utilized multi-head graph-attention to capture dynamic interaction weights between satellites and employed a feature aggregation mechanism to transform complex multi-body interactions into individual-population associations. While preserving key cooperative features
this approach reduced the joint decision space from an exponential scale to a linear dimension. Simulation results demonstrated that the proposed algorithm exhibited superior performance in terms of task throughput and load balancing.
Hu X X , Wang Z L , Xia W , et al . Mission planning technology for remote sensing satellite: status and prospect [J ] . Journal of Command and Control , 2023 , 9 ( 5 ): 495 - 507 .
Peng G S , Song G P , Xing L N , et al . An exact algorithm for agile earth observation satellite scheduling with time-dependent profits [J ] . Computers & Operations Research , 2020 , 120 : 104946 .
Zeng G M , Zhan Y F , Xie H R , et al . Resource allocation for networked telemetry system of mega LEO satellite constellations [J ] . IEEE Transactions on Communications , 2022 , 70 ( 12 ): 8215 - 8228 .
Wei L N , Chen M , Xing L N , et al . Knowledge-transfer based genetic programming algorithm for multi-objective dynamic agile earth observation satellite scheduling problem [J ] . Swarm and Evolutionary Computation , 2024 , 85 : 101460 .
Zhang Z H , Zhang J Y , Zhang J K , et al . Multi-satellite cooperative mission planning method based on genetic tabu algorithm [J ] . Radio Engineering , 2022 , 52 ( 7 ): 1127 - 1135 .
Chen Y , Zhang D Y , Zhou M Q , et al . Multi-satellite observation scheduling algorithm based on hybrid genetic particle swarm optimization [M ] // Advances in Information Technology and Industry Applications . Berlin, HeidelbergSpringer 2012 : 441 - 448 .
Gu Y , Han C , Chen Y H , et al . Large region targets observation scheduling by multiple satellites using resampling particle swarm optimization [J ] . IEEE Transactions on Aerospace and Electronic Systems , 2023 , 59 ( 2 ): 1800 - 1815 .
Han C , Gu Y , Wu G H , et al . Simulated annealing-based heuristic for multiple agile satellites scheduling under cloud coverage uncertainty [J ] . IEEE Transactions on Systems, Man, and Cybernetics: Systems , 2023 , 53 ( 5 ): 2863 - 2874 .
Khojah H A , Mosa M A . Multi-objective optimization for multi-satellite scheduling task: Multi-objective optimization for multi-satellite scheduling task [J ] . Journal of Soft Computing Exploration , 2022 , 3 ( 1 ): 19 - 30 .
Liu Y C , Chen Q F , Li C Y , et al . Mission planning for Earth observation satellite with competitive learning strategy [J ] . Aerospace Science and Technology , 2021 , 118 : 107047 .
Chen X Y , Tian T , Dai G M , et al . Deep reinforcement learning-based resource allocation method for multi-satellite scheduling [J ] . Computers & Operations Research , 2025 , 181 : 107088 .
Ou J W , Xing L N , Yao F , et al . Deep reinforcement learning method for satellite range scheduling problem [J ] . Swarm and Evolutionary Computation , 2023 , 77 : 101233 .
Li Y Y , Shi H Y , Zhao T . On-orbit distributed negotiation intelligent mission planning for instant response [J ] . Chinese Journal of Space Science , 2024 , 44 ( 1 ): 159 - 168 .
Herrmann A , Stephenson M A , Schaub H . Single-agent reinforcement learning for scalable earth-observing satellite constellation operations [J ] . Journal of Spacecraft and Rockets , 2024 , 61 ( 1 ): 114 - 132 .
Zhang G H , Li X H , Hu G X , et al . MARL-based multi-satellite intelligent task planning method [J ] . IEEE Access , 2023 , 11 : 135517 - 135528 .
Lowe R , Wu Y , Tamar A , et al . Multi-agent actor-critic for mixed cooperative-competitive environments [C ] // Proceedings of the 31st International Conference on Neural Information Processing Systems . New York : ACM , 2017 : 6382 - 6393 .
Liu S , Chen Y W , Xing L N , et al . Time-dependent autonomous task planning of agile imaging satellites [J ] . Journal of Intelligent & Fuzzy Systems , 2016 , 31 ( 3 ): 1365 - 1375 .
Liu Z , Gao L , Chai Y . Research on task scheduling mechanism of distributed satellite system [J ] . Journal of Computational Information Systems , 2014 , 10 ( 24 ): 10703 - 10713 .
Nowé A , Vrancx P , De Hauwere Y M . Game theory and multi-agent reinforcement learning [M ] //Wiering M, van Otterlo M. Reinforcement Learning: State-of-the-Art . Berlin, Heidelberg : Springer , 2012 : : 441 - 470 .
Veličković P , Cucurull G , Casanova A , et al . Graph attention networks [C ] // Proceedings of the International Conference on Learning Representations (ICLR) . 2018 .
Zhang S , Tong H H , Xu J J , et al . Graph convolutional networks: a comprehensive review [J ] . Computational Social Networks , 2019 , 6 : 11 .
Mnih V , Kavukcuoglu K , Silver D , et al . Human-level control through deep reinforcement learning [J ] . Nature , 2015 , 518 ( 7540 ): 529 - 533 .
Yu C , Velu A , Vinitsky E , et al . The surprising effectiveness of PPO in cooperative multi-agent games [C ] // Proceedings of the 36th International Conference on Neural Information Processing Systems . New York : ACM , 2022 : 24611 - 24624 .
Abdi H . Coefficient of variation [J ] . Encyclopedia of Research Design , 2010 , 1 ( 5 ): 169 - 171 .