基于Transformer-双Q网络的太赫兹NOMA通信网络即时功率分配

周音

doi:null

您当前的位置：

首页 >

文章列表页 >

基于Transformer-双Q网络的太赫兹NOMA通信网络即时功率分配

更新时间：2026-05-18

- 基于Transformer-双Q网络的太赫兹NOMA通信网络即时功率分配
- Instant power allocation for terahertz NOMA communication networks based on transformer-double deep Q network
- 电信科学 2026年
- 作者机构：
  
  中国人民解放军91977部队，北京 100036
- 作者简介：
- 基金信息：
- DOI：
  中图分类号： TN92
- 收稿：2026-03-06，
  
  修回：2026-04-11，
  
  录用：2026-05-13，
- 稿件说明：
移动端阅览
周音. 基于Transformer-双Q网络的太赫兹NOMA通信网络即时功率分配[J/OL]. 电信科学, 2026.

ZHOU Yin. Instant power allocation for terahertz NOMA communication networks based on transformer-double deep Q network[J/OL]. Telecommunications Science, 2026.
周音. 基于Transformer-双Q网络的太赫兹NOMA通信网络即时功率分配[J/OL]. 电信科学, 2026. DOI：

ZHOU Yin. Instant power allocation for terahertz NOMA communication networks based on transformer-double deep Q network[J/OL]. Telecommunications Science, 2026. DOI：

摘要

太赫兹非正交多址接入（non-orthogonal multiple access，NOMA）技术有望成为6G通信系统的关键突破性方案。其核心机制是通过利用超宽带资源与功率域复用，实现海量用户共享多个子频段，显著提升系统连接容量。为充分释放太赫兹NOMA系统的性能潜力，关键在于实现满足服务质量（quality of service，QoS）约束下的快速功率分配优化。提出了一种基于Transformer架构的双Q网络模型，通过Transformer学习不同用户分配策略的关联性，并采用双Q网络实现更稳定的决策优化过程。经训练后本算法可生成适应多种用户分布的即时功率分配策略。实验结果表明，训练完成的模型仅需毫秒级计算时间，即可实现接近穷举法的高吞吐量性能。该算法展现出较强的实时性与鲁棒性，具有较大工程应用潜力。

Abstract

Terahertz (THz) non-orthogonal multiple access (NOMA) was regarded as a candidate in 6G and beyond systems. By exploring the ultrabroad bandwidth and power domain

THz-NOMA could realize massive connectivity through assigning each sub-band to different users. To unleash the potential of the THz-NOMA system

it was significant to allocate power fast under quality of service (QoS) requirements. Focusing on the instant power allocation

a novel transformer-based double deep Q-network (DQN) solution adaptive for general user distributions was proposed in this paper. Transformer was used to learn the relationships among allocation strategies for different users

and a double DQN was adopted to achieve a more stable decision optimization process. The simulation results validated that the proposed algorithm realized the throughput close to the optimum given by exhaustive search method within millisecond level. The proposed method demonstrates high real-time performance and robustness

which suggests its high practicability.

关键词

Keywords

references

唐雄燕 , 李福昌 , 张忠皓 , 等 . 6G网络需求,架构及技术趋势 [J ] . 移动通信 , 2021 , 45 ( 4 ): 37 - 44 .

TANG X Y , LI F C , ZHANG Z H , et al . Requirements, architectures and technology trends of 6G network [J ] . Mobile Communications , 2021 , 45 ( 4 ): 37 – 44 .

WANG C X , YOU X , GAO X , et al . On the road to 6G: visions, requirements, key technologies, and testbeds [J ] . IEEE Communications Surveys & Tutorials , 2023 , 25 ( 2 ): 905 - 974 .

王战将 , 李凯乐 , 张飞翔 , 等 . 面向6G的太赫兹通信技术研究 [J ] . 移动通信 , 2025 , 49 ( 5 ): 121 - 127 .

WANG Z J , LI K L , ZHANG F X , et al . A review of terahertz communication technology for 6G [J ] . Mobile Communications , 2025 , 49 ( 5 ): 121 – 127 .

PETROV V , PYATTAEV A , MOLTCHANOV D , et al . Terahertz band communications: applications, research challenges, and standardization activities [C ] // Proceedings of 2016 8th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT) . Piscataway : IEEE Press , 2016 : 183 - 190 .

AKYILDIZ I F , HAN C , HU Z , et al . Terahertz band communication: an old problem revisited and research directions for the next decade [J ] . IEEE Transactions on Communications , 2022 , 70 ( 6 ): 4250 - 4285 .

XUE Q , JI C , MA S , et al . A survey of beam management for mmWave and THz communications towards 6G [J ] . IEEE Communications Surveys & Tutorials , 2024 , 26 ( 3 ): 1520 - 1559 .

DAI L , WANG B , YUAN Y , et al . Non-orthogonal multiple access for 5G: solutions, challenges, opportunities, and future research trends [J ] . IEEE Communications Magazine , 2015 , 53 ( 9 ): 74 - 81 .

SAITO Y , KISHIYAMA Y , BENJEBBOUR A , et al . Non-orthogonal multiple access (NOMA) for cellular future radio access [C ] // Proceedings of 2013 IEEE 77th Vehicular Technology Conference (VTC Spring) . Piscataway : IEEE Press , 2013 : 1 - 5 .

AHMED A , WANG X , HAWBANI A , et al . Unveiling the potential of NOMA: a journey to next-generation multiple access [J ] . IEEE Communications Surveys & Tutorials , 2024 , 27 ( 5 ): 3099 - 3164 .

DING Z , ADACHI F , POOR H V . The application of MIMO to non-orthogonal multiple access [J ] . IEEE Transactions on Wireless Communications , 2016 , 15 ( 1 ): 537 - 552 .

ZHANG X , HAN C , WANG X . Joint beamforming-power-bandwidth allocation in terahertz NOMA networks [C ] // Proceedings of 2019 16th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON) . Piscataway : IEEE Press , 2019 : 1 - 9 .

JIAO L , SHAO Y , SUN L , et al . Advanced deep learning models for 6G: overview, opportunities, and challenges [J ] . IEEE Access , 2024 , 12 : 133245 - 133314 .

MAHMOOD M R , MATIN M A , SARIGIANNIDIS P , et al . A comprehensive review on artificial intelligence/machine learning algorithms for empowering the future IoT toward 6G era [J ] . IEEE Access , 2022 , 10 : 87535 - 87562 .

SANJALAWE Y , FRAIHAT S , ABUALHAJ M , et al . A review of 6G and AI convergence: enhancing communication networks with artificial intelligence [J ] . IEEE Open Journal of the Communications Society , 2025 , 6 : 2308 - 2355 .

SAETAN W , THIPCHAKSURAT S . Power allocation for sum rate maximization in 5G NOMA system with imperfect SIC: a deep learning approach [C ] // Proceedings of 2019 4th International Conference on Information Technology (InCIT) . Piscataway : IEEE Press , 2019 : 195 - 198 .

XIAO L , LI Y , DAI C , et al . Reinforcement learning-based NOMA power allocation in the presence of smart jamming [J ] . IEEE Transactions on Vehicular Technology , 2018 , 67 ( 4 ): 3377 - 3389 .

ZHANG Y , WANG X , XU Y . Energy-efficient resource allocation in uplink NOMA systems with deep reinforcement learning [C ] // Proceedings of 2019 11th International Conference on Wireless Communications and Signal Processing (WCSP) . Piscataway : IEEE Press , 2019 : 1 - 6 .

VAN H H , GUEZ A , SILVER D . Deep reinforcement learning with double Q-learning [C ] //Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16). Washington D.C: AAAI Press , 2016 : 2094 – 2100 .

ZHAO D , LIU D , LEWIS F L , et al . Special issue on deep reinforcement learning and adaptive dynamic programming [J ] . IEEE Transactions on Neural Networks and Learning Systems , 2018 , 29 ( 6 ): 2038 - 2041 .

李伟琨 , 姚信威 , 王万良 , 等 . 太赫兹通信中MIMO信道建模与容量分析 [J ] . 计算机工程 , 2015 , 41 ( 4 ): 4 .

LI W K , YAO X W , WANG W L , et al . MIMO channel modeling and capacity analysis in terahertz communication [J ] . Computer Engineering , 2015 , 41 ( 4 ): 4 .

ALHAJ N A , JAMLOS M F , MANAP S A , et al . Integration of hybrid networks, AI, ultra massive-MIMO, THz frequency, and FBMC modulation toward 6G requirements: a review [J ] . IEEE Access , 2023 , 12 : 483 - 513 .

YAN L , HAN C , YUAN J . Energy-efficient dynamic-subarray with fixed true-time-delay design for terahertz wideband hybrid beamforming [J ] . IEEE Journal on Selected Areas in Communications , 2022 , 40 ( 10 ): 2840 - 2854 .

HU Z , HAN C , DENG Y , et al . Multi-task deep reinforcement learning for terahertz NOMA resource allocation with hybrid discrete and continuous actions [J ] . IEEE Transactions on Vehicular Technology , 2024 , 73 ( 8 ): 11647 - 11663 .

HAN C , WANG Y , LI Y , et al . Terahertz wireless channels: a holistic survey on measurement, modeling, and analysis [J ] . IEEE Communications Surveys & Tutorials , 2022 , 24 ( 3 ): 1670 - 1707 .

WANG J , WANG C X , HUANG J , et al . 6G THz propagation channel characteristics and modeling: recent developments and future challenges [J ] . IEEE Communications Magazine , 2022 , 62 ( 2 ): 56 - 62 .

ISLAM S M R , AVAZOV N , DOBRE O A , et al . Power-domain non-orthogonal multiple access (NOMA) in 5G systems: potentials and challenges [J ] . IEEE Communications Surveys & Tutorials , 2016 , 19 ( 2 ): 721 - 742 .

HE C , HU Y , CHEN Y , et al . Joint power allocation and channel assignment for NOMA with deep reinforcement learning [J ] . IEEE Journal on Selected Areas in Communications , 2019 , 37 ( 10 ): 2200 - 2210 .

WEN M , LIN R , WANG H , et al . Large sequence models for sequential decision-making: a survey [J ] . Frontiers of Computer Science , 2023 , 17 ( 6 ): 176349 .

YUAN W , CHEN J , CHEN S , et al . Transformer in reinforcement learning for decision-making: a survey [J ] . Frontiers of Information Technology & Electronic Engineering , 2024 , 25 ( 6 ): 763 - 790 .

陈卓 , 冯钢 , 何颖 , 等 . 运营商网络中基于深度强化学习的服务功能链迁移机制 [J ] . 电子与信息学报 , 2020 , 42 ( 9 ): 2173 - 2179 .

CHEN Z , FENG G , HE Y , et al . Deep reinforcement learning based migration mechanism for service function chain in operator networks [J ] . Journal of Electronics & Information Technology , 2020 , 42 ( 9 ): 2173 – 2179 .

TERMEHCHI A , BAO T , SYED A , et al . Goal-oriented reinforcement learning in THz-enabled UAV-aided network using supervised learning [J ] . IEEE Open Journal of the Communications Society , 2024 , 5 : 5027 - 5036 .

KHALILI A , MONFARED E M , ZARGARI S , et al . Resource management for transmit power minimization in UAV-assisted RIS HetNets supported by dual connectivity [J ] . IEEE Transactions on Wireless Communications , 2021 , 21 ( 3 ): 1806 - 1822 .

浏览量

下载量

CSCD

文章被引用时，请邮件提醒。

提交

工具集

关联资源

一种面向资源高效利用的无蜂窝RAN分层协同资源分配算法

一种基于深度强化学习的海上MEC任务卸载和资源分配优化算法

双模介质通信mesh网络吞吐量分析与优化

基于SDN和NFV的空天地一体化网络任务部署与恢复综述

面向智算中心间互联的光网络关键技术研究