1.中国铁塔股份有限公司 北京 100089
2.中移(苏州)软件技术有限公司 苏州 215000
[ "张阔(1988-),男,香港理工大学机械工程硕士,现任中国铁塔股份有限公司通信技术研究院高级经理、高级工程师,主要从事边缘计算、算力网络方向的技术研究和研发创新。" ]
[ "闫亚旗(1988-),男,北京理工大学电子信息硕士,现任中国铁塔股份有限公司通信技术研究院高级经理、高级工程师,主要研究方向为物联网、边缘计算、算力网络相关技术及产品创新。" ]
[ "安颖(1992-),女,北京邮电大学信息安全硕士,现任中国铁塔股份有限公司通信技术研究院经理,主要研究方向为算力网络、智算网络架构和云原生技术等。" ]
[ "董玉池(1988-),男,电子科技大学计算机硕士,现任中国铁塔股份有限公司通信技术研究院高级经理,主要研究方向为边缘计算、算力网络和在网计算等。" ]
[ "张民贵(1980-),男,清华大学计算机科学与技术博士,现任中国铁塔股份有限公司通信技术研究院技术总监、高级工程师,主要研究方向为算力网络、工业互联网、边缘计算、数据中心网络等。" ]
[ "袁刚(1989-),男,南京邮电大学通信与信息系统专业硕士,现任中移(苏州)软件技术有限公司产品经理,主要研究方向为云原生+智算技术体系构建、产品化落地。" ]
收稿:2026-01-12,
修回:2026-02-12,
录用:2026-03-20,
移动端阅览
张阔, 闫亚旗, 安颖, 等. DS-PPO:面向高并发CNN模型推理的云边端异构硬件资源协同调度[J/OL]. 电信科学, 2026.
ZHANG Kuo, YAN Yaqi, AN Ying, et al. DS-PPO: Cloud-Edge-End Collaborative Scheduling of Heterogeneous Hardware Resources for Highly Concurrent CNN Model Inference[J/OL]. Telecommunications Science, 2026.
随着人工智能物联网(AIoT)技术的发展和硬件性能的提升,云边端协同场景下引入卷积神经网络(CNN)模型进行推理已成为趋势,以满足智能AI应用对实时性的需求。针对异构计算资源(如CPU、GPU、NPU、FPGA)的高效利用问题,提出了一种创新的任务调度系统,以支持高并发CNN模型推理。该系统采用DeepSets模型进行任务和环境特征提取,并提升近端策略优化算法的泛化能力实现任务调度决策,确保在复杂多变的环境中实现精准高效的任务分配。系统已在Kubernetes和Kosmos平台上部署,并通过高性能服务器、NVIDIA Jetson Xavier NX和NVIDIA Jetson Nano等异构处理单元进行了验证。实验结果显示,相较于现有调度算法,该系统的性能提升了275.93%,资源利用率提高38.86%,显著提高了云边端计算的效率和可靠性,为智能AI应用提供了坚实的支持。
With the advancement of artificial intelligence of things (AIoT) technology and the enhancement of hardware performance
convolutional neural network (CNN) models were increasingly introduced for inference in collaborative cloud-edge-end scenarios to satisfy real-time requirements of intelligent AI applications. To address the efficient utilization of heterogeneous computing resources including CPUs
GPUs
NPUs
and FPGAs
an innovative task scheduling system was proposed to support high-concurrency CNN model inference. The DeepSets model was employed to extract features from tasks and environmental constraints
while the generalization capability of the Proximal Policy Optimization (PPO) algorithm was enhanced to achieve task scheduling decisions. Precise and efficient task allocation was ensured in complex dynamic environments through this approach. The proposed system was deployed on Kubernetes and Kosmos platforms
with comprehensive validation conducted across various heterogeneous processing units
including high-performance servers
NVIDIA Jetson Xavier NX
and NVIDIA Jetson Nano devices. Experimental results demonstrate that compared to existing scheduling algorithms
the proposed system improves performance by 275.93% and increases resource utilization by 38.86%
significantly enhancing the efficiency and reliability of cloud-edge computing frameworks and providing robust support for intelligent AI applications.
Hosseinzadeh M , Azhir E , Lansky J , et al . Task scheduling mechanisms for fog computing: a systematic survey [J ] . IEEE Access , 2023 , 11 : 50994 - 51017 .
Jiang M , Wu T , Wang Z , et al . A multi-intersection vehicular cooperative control based on end-edge-cloud computing [J ] . IEEE Transactions on Vehicular Technology , 2022 , 71 ( 3 ): 2459 - 2471 .
Ren J , Jiang H , Shen X , et al . Editorial of ccf transactions on networking: special issue on intelligence-enabled end-edge-cloud orchestrated computing [J ] . CCF Transactions on Networking , 2020 , 3 ( 3 ): 155 - 157 .
Ren J , Zhang D , He S , et al . A survey on end-edge-cloud orchestrated network computing paradigms: Transparent computing, mobile edge computing, fog computing, and cloudlet [J ] . ACM Computing Surveys (CSUR) , 2019 , 52 ( 6 ): 1 - 36 .
Attiya I , Abd Elaziz M , Abualigah L , et al . An improved hybrid swarm intelligence for scheduling IoT application tasks in the cloud [J ] . IEEE Transactions on Industrial Informatics , 2022 , 18 ( 9 ): 6264 - 6272 .
Alsaidy S A , Abbood A D , Sahib M A . Heuristic initialization of PSO task scheduling algorithm in cloud computing [J ] . Journal of King Saud University-Computer and Information Sciences , 2022 , 34 ( 6 ): 2370 - 2382 .
Wang L , Pan Z , Wang J . A review of reinforcement learning based intelligent optimization for manufacturing scheduling [J ] . Complex System Modeling and Simulation , 2021 , 1 ( 4 ): 257 - 270 .
Houssein E H , Gad A G , Wazery Y M , et al . Task scheduling in cloud computing based on meta-heuristics: review, taxonomy, open challenges, and future trends [J ] . Swarm and Evolutionary Computation , 2021 , 62 : 100841 .
Yuan H , Bi J , Zhou M C . Multiqueue scheduling of heterogeneous tasks with bounded response time in hybrid green IaaS clouds [J ] . IEEE Transactions on Industrial Informatics , 2019 , 15 ( 10 ): 5404 - 5412 .
Zhou J , Sun J , Cong P , et al . Security-critical energy-aware task scheduling for heterogeneous real-time MPSoCs in IoT [J ] . IEEE Transactions on Services Computing , 2019 , 13 ( 4 ): 745 - 758 .
Carrión C . Kubernetes scheduling: Taxonomy, ongoing issues and challenges [J ] . ACM Computing Surveys , 2022 , 55 ( 7 ): 1 - 37 .
Narayanan D , Santhanam K , Kazhamiaka F , et al . {Heterogeneity-Aware} cluster scheduling policies for deep learning workloads [C ] // 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20). 2020 : 481 - 498 .
Feng J , Zhang W , Pei Q , et al . Heterogeneous computation and resource allocation for wireless powered federated edge learning systems [J ] . IEEE Transactions on Communications , 2022 , 70 ( 5 ): 3220 - 3233 .
Zhong Z , Buyya R . A cost-efficient container orchestration strategy in kubernetes-based cloud computing infrastructures with heterogeneous resources [J ] . ACM Transactions on Internet Technology (TOIT) , 2020 , 20 ( 2 ): 1 - 24 .
Abdulazeez D H , Askar S K . Offloading mechanisms based on reinforcement learning and deep learning algorithms in the fog computing environment [J ] . Ieee Access , 2023 , 11 : 12555 - 12586 .
Jung Y , Park J . Bayesian Convolutional Deep Sets with Task-Dependent Stationary Prior [C ] // International Conference on Artificial Intelligence and Statistics . PMLR , 2023 : 3795 - 3824 .
He H , Meng X , Wang Y , et al . Deep reinforcement learning based energy management strategies for electrified vehicles: Recent advances and perspectives [J ] . Renewable and Sustainable Energy Reviews , 2024 , 192 : 114248 .
Xie B , Bian Y , Chen Y , et al . Enhancing neural subset selection: Integrating background information into set representations [J ] . arXiv preprint arXiv: 2402.03139 , 2024 .
Kim C H , Kim S J , Choi H L . Heterogeneous satellite task scheduling with revisit time minimization using milp formulation [C ] //AIAA SCITECH 2024 F orum . 2024 : 0747 .
Cui H , Tang Z , Lou J , et al . Latency-aware container scheduling in edge cluster upgrades: A deep reinforcement learning approach [J ] . IEEE Transactions on Services Computing , 2024 , 17 ( 5 ): 2530 - 2543 .
Cheng Y , Guo Q , Wang X . Proximal policy optimization with advantage reuse competition [J ] . IEEE Transactions on Artificial Intelligence , 2024 , 5 ( 8 ): 3915 - 3925 .
Du Y , Li J . A deep reinforcement learning based algorithm for a distributed precast concrete production scheduling [J ] . International Journal of Production Economics , 2024 , 268 : 109102 .
Yassami M , Ashtari P . A novel hybrid optimization algorithm: Dynamic hybrid optimization algorithm [J ] . Multimedia Tools and Applications , 2023 , 82 ( 21 ): 31947 - 31979 .
Premkumar M , Sowmya R , Ramakrishnan C , et al . An efficient and reliable scheduling algorithm for unit commitment scheme in microgrid systems using enhanced mixed integer particle swarm optimizer considering uncertainties [J ] . Energy Reports , 2023 , 9 : 1029 - 1053 .
Rouskas G N . First-Fit: A universal algorithm for spectrum assignment [C ] // GLOBECOM 2023-2023 IEEE Global Communications Conference . IEEE , 2023 : 2123 - 2128 .
0
浏览量
0
下载量
0
CSCD
关联资源
相关文章
相关作者
相关机构
京公网安备11010802024621