计算机科学 ›› 2023, Vol. 50 ›› Issue (8): 233-242.doi: 10.11896/jsjkx.220900181
所属专题: 智能化边缘计算
张乃心1, 陈霄睿1, 李安1, 杨乐瑶1, 吴华明2
ZHANG Naixin1, CHEN Xiaorui1, LI An1, YANG Leyao1, WU Huaming2
摘要: 物联网设备中大量未被充分利用的计算资源,正是移动边缘计算所需要的。一种基于设备对设备通信技术和无线充电技术的边缘卸载框架,可以最大化利用闲置物联网设备的计算资源,提升用户体验。在此基础上,可以建立物联网设备的D2D-MEC网络模型。在该模型中,主设备根据当前环境信息和估计的设备状态信息,选择向多个边缘设备卸载不同数量的任务,并应用无线充电技术提升传输的成功率和计算的稳定性。运用强化学习方法解决任务分配和资源分配的联合优化问题,也就是最小化计算延迟、能量消耗和任务丢弃损失,最大化边缘设备利用率和任务卸载比例的优化问题。除此之外,为了适应状态空间更大的情况,提高学习速度,提出了一种基于深度强化学习的卸载方案。基于以上理论和模型,使用数学推导计算出了D2D-MEC系统的最优解及性能上限。仿真实验证明了D2D-MEC卸载模型及其卸载策略的综合性能更好,更能充分利用物联网设备的计算资源。
中图分类号:
[1]BOYD B,SHILPA T,REZA A,et al.Networks and devices for the 5G era[J].IEEE Communications Magazine,2014,52(2):90-96. [2]KO H,PACK S.Distributed Device-to-Device Offloading Sys-tem:Design and Performance Optimization[J].IEEE Transactions on Mobile Computing,2021,20(10):2949-2960. [3]MOLIN L,CHEN T,ZENG J,et al.D2D-Assisted Computation Offloading for Mobile Edge Computing Systems with Energy Harvesting[C]//2019 20th International Conference on Parallel and Distributed Computing,Applications and Technologies (PDCAT).Gold Coast,QLD,Australia:2019:90-95. [4]MAO Y,ZHANG J,LETAIEF K B.Dynamic computation offloading for mobile-edge computing with energy harvesting devices[J].IEEE Journal on Selected Areas in Communications,2016,34(12):3590-3605. [5]PU L,CHEN X,XU J,et al.D2D Fogging:An Energy-Efficient and Incentive-Aware Task Offloading Framework via Network-assisted D2D Collaboration[J].IEEE Journal on Selected Areas in Communications,2016,34(12):3887-3901. [6]FANG T,YANG Y,CHEN J X.Optimization of offloadingstrategy under assisted mobile edge computing[J].Computer Science,2022,49(S1):601-605. [7]QIAO G H,LENG S P,ZHANG Y.Online Learning and Optimization for Computation Offloading in D2D Edge Computing and Networks[J].Mobile Networks & Applications,2019,27:1111-1122. [8]GAIBIN L,CHEN M,WEI X,et al.Computation OffloadingWith Reinforcement Learning in D2D-MEC Network [C]//2020 International Wireless Communications and Mobile Computing (IWCMC).Limassol,Cyprus,2020:69-74. [9]HUANG L,FENG X,QIAN L,et al.Deep ReinforcementLearning-Based Task Offloading and Resource Allocation for Mobile Edge Computing[C]//Machine Learning and Intelligent Communications(MLICOM).Cham:Springer,2018:33-42. [10]LIANG W,WANG K,PAN C,et al.Deep Reinforcement Lear-ning Based Dynamic Trajectory Control for UAV-assisted Mobile Edge Computing[J].arXiv:1911.03887v2,2019. [11]MINGHUI M,XIAO L,CHEN Y,et al.Learning-Based Computation Offloading for IoT Devices With Energy Harvesting[J].IEEE Transactions on Vehicular Technology,2019,68(2):1930-1941. [12]HUANG L,BI S,ZHANG Y J A.Deep Reinforcement Learning for Online Computation Offloading in Wireless Powered Mobile-Edge Computing Networks[J].IEEE Transactions on Mobile Computing,2020,19(11):2581-2593. [13]KAN L P G,HAUSWALD J,GAO C,et al.Collaborative Intelligence Between the Cloud and Mobile Edge[J].Acm Sigplan Notices A Monthly Publication of the Special Interest Group on Programming Languages,2017,45(1):615-629. [14]WANG Z C,HUA J W,ZHU J Q,et al.Vehicle task offloading based on deep reinforcement learning for parking cooperation[J].Small Microcomputer System,2023,44(3):658-664. [15]LIANG H,ZHANG L,YANG S,et al.Meta-Learning BasedDynamic Computation Task Offloading for Mobile Edge Computing Networks[J].IEEE Communications Letters,2021,25(5):1568-1572. [16]DENG S Q,YE X G.Multi-objective task offloading algorithm based on deep Q network[J].Computer Application,2022,42(6):1668-1674. [17]LIU P F,MAO Y C,WANG B L.Task allocation method based on cloud-fog collaboration model[J].Computer Application,2019,39(1):8-14. [18]CHENG P,ZHANG W Z,XIE S H,et al.Research on multi-objective balanced task offloading method for edge computing of Internet of Vehicles[J].Small Microcomputer System,2022,43(9):1992-1997. [19]DINH Q,TANG J,LA Q,et al.Offloading in Mobile Edge Computing:Task Allocation and Computational Frequency Scaling[J].IEEE Transactions on Communications,2017,65(8):3571-3584. [20]ZHAO S,LI Y,WU D,et al.Current-Decomposition-Based Di-gital Phase Synchronization Method for BWPT System[J].IEEE Transactions on Power Electronics,2021,36(11):12183-12188. [21]WATKINS C.Learning from delayed rewards [D].Cambridge:King's College of University of Cambridge,1989. [22]CHRISTOPHER J,PETER D.Q-learning[J].Machine Lear-ning,1992,8(3/4):279-292. [23]XIAO L,LI Y,HUANG X,et al.Cloud-based malware detection game for mobile devices with offloading[J].IEEE Trans Mobile Computing,2017,16(10):2742-2750. [24]LI J,GAO H,LV T.Deep reinforcement learning based computation offloading and resource allocation for MEC[C]//IEEE Wireless Communication and Networking Conf(WCNC).2018:1-6. |
|