Scheduling UAV Swarm with Attention-based Graph Reinforcement Learning for Ground-to-air Heterogeneous Data Communication

Authors:
Jiyuan Ren

Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China

Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China

0009-0005-4698-6682
View Profile

,
Yanggang Xu

Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China

Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China

0000-0003-0009-3553
View Profile

,
Zuxin Li

Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China

Tsinghua-Berkeley Shenzhen Institute, Tsinghua University, China

0000-0001-9681-2940
View Profile

,
Chaopeng Hong

Tsinghua Shenzhen International Graduate School, Tsinghua University, China and Key Laboratory of Ecological Remediation and Carbon Sequestration, Institute of Environment and Ecology, China

Tsinghua Shenzhen International Graduate School, Tsinghua University, China and Key Laboratory of Ecological Remediation and Carbon Sequestration, Institute of Environment and Ecology, China

0000-0002-8825-9062
View Profile

,
Xiao-Ping Zhang

Tsinghua Shenzhen International Graduate School, Tsinghua University, China and RISC-V International Open Source Laboratory, China

Tsinghua Shenzhen International Graduate School, Tsinghua University, China and RISC-V International Open Source Laboratory, China

0000-0001-5241-0069
View Profile

,
Xinlei Chen

Shenzhen International Graduate School, Tsinghua University, China and Pengcheng Laboratory, China

Shenzhen International Graduate School, Tsinghua University, China and Pengcheng Laboratory, China

0000-0001-8271-5023
View Profile

UbiComp/ISWC '23 Adjunct: Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable ComputingOctober 2023Pages 670–675https://doi.org/10.1145/3594739.3612905

Published:08 October 2023Publication History

UbiComp/ISWC '23 Adjunct: Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing

Pages 670–675

ABSTRACT

In disaster scenarios, unmanned aerial vehicles (UAVs) can serve as mobile base stations because of their maneuverability and synergy. However, due to constrained UAV communication capabilities and limited battery life, UAV base stations resource allocation for mobile sensors in a data-heterogeneous environment is a significant challenge when optimizing communication quality. To address this, we propose AGUZero, an attention-based graph reinforcement learning (RL) framework. Inspired by MuZero [27], AGUZero is designed to handle dynamic and uncontrollable environments based on Monte Carlo Tree Search (MCTS). Additionally, to tackle data heterogeneity, AGUZero represents the states using heterogeneous sub-graphs and employs an attention-based model to capture relationships among UAVs and sensors. The experimental results show that AGUZero outperforms other baseline models consistently when either the number of UAVs or the number of sensors is varying. AGUZero improves the data transmission ratio by 11.03% and 10.35% in the two cases respectively.

References

Shakil Ahmed and Boulat A. Bash. 2019. Average Worst-Case Secrecy Rate Maximization via UAV and Base Station Resource Allocation. 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton) (2019), 1176–1181.Google ScholarDigital Library
Akram Al-Hourani, Sithamparanathan Kandeepan, and Simon Lardner. 2014. Optimal LAP Altitude for Maximum Coverage. IEEE Wireless Communications Letters 3, 6 (2014), 569–572. https://doi.org/10.1109/LWC.2014.2342736Google ScholarCross Ref
Mohamed Alzenad, Amr El-Keyi, Faraj Lagum, and Halim Yanikomeroglu. 2017. 3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage. IEEE Wireless Communications Letters 6 (2017), 434–437.Google ScholarCross Ref
Xinlei Chen, Aveek Purohit, Carlos Ruiz Dominguez, Stefano Carpin, and Pei Zhang. 2015. Drunkwalk: Collaborative and adaptive planning for navigation of micro-aerial sensor swarms. In Proceedings of the 13th ACM Conference on Embedded Networked Sensor Systems. 295–308.Google ScholarDigital Library
Xinlei Chen, Aveek Purohit, Shijia Pan, Carlos Ruiz, Jun Han, Zheng Sun, Frank Mokaya, Patric Tague, and Pei Zhang. 2017. Design experiences in minimalistic flying sensor node platform through sensorfly. ACM Transactions on Sensor Networks (TOSN) 13, 4 (2017), 1–37.Google ScholarDigital Library
Xinlei Chen, Aveek Purohit, Shijia Pan, Carlos Ruiz, Jun Han, Zheng Sun, Frank Mokaya, Patric Tague, and Pei Zhang. 2017. Design Experiences in Minimalistic Flying Sensor Node Platform through SensorFly. ACM Trans. Sen. Netw. 13, 4, Article 33 (nov 2017), 37 pages. https://doi.org/10.1145/3131779Google ScholarDigital Library
Xinlei Chen, Carlos Ruiz, Sihan Zeng, Liyao Gao, Aveek Purohit, Stefano Carpin, and Pei Zhang. 2020. H-DrunkWalk: Collaborative and adaptive navigation for heterogeneous MAV swarm. ACM Transactions on Sensor Networks (TOSN) 16, 2 (2020), 1–27.Google ScholarDigital Library
Xuecheng Chen, Haoyang Wang, Zuxin Li, Wenbo Ding, Fan Dang, Chengye Wu, and Xinlei Chen. 2023. DeliverSense: Efficient Delivery Drone Scheduling for Crowdsensing with Deep Reinforcement Learning. In Adjunct Proceedings of the 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2022 ACM International Symposium on Wearable Computers (Cambridge, United Kingdom) (UbiComp/ISWC ’22 Adjunct). Association for Computing Machinery, New York, NY, USA, 403–408. https://doi.org/10.1145/3544793.3560412Google ScholarDigital Library
Xinlei Chen, Susu Xu, Haohao Fu, Carlee Joe-Wong, Lin Zhang, Hae Young Noh, and Pei Zhang. 2019. ASC: Actuation System for City-Wide Crowdsensing with Ride-Sharing Vehicular Platform. In Proceedings of the Fourth Workshop on International Science of Smart City Operations and Platforms Engineering (Montreal, Quebec, Canada) (SCOPE ’19). Association for Computing Machinery, New York, NY, USA, 19–24. https://doi.org/10.1145/3313237.3313299Google ScholarDigital Library
Xinlei Chen, Susu Xu, Jun Han, Haohao Fu, Xidong Pi, Carlee Joe-Wong, Yong Li, Lin Zhang, Hae Young Noh, and Pei Zhang. 2020. PAS: Prediction-Based Actuation System for City-Scale Ridesharing Vehicular Mobile Crowdsensing. IEEE Internet of Things Journal 7, 5 (2020), 3719–3734. https://doi.org/10.1109/JIOT.2020.2968375Google ScholarCross Ref
Xinlei Chen, Susu Xu, Xinyu Liu, Xiangxiang Xu, Hae Young Noh, Lin Zhang, and Pei Zhang. 2020. Adaptive hybrid model-enabled sensing system (HMSS) for mobile fine-grained air pollution estimation. IEEE Transactions on Mobile Computing 21, 6 (2020), 1927–1944.Google ScholarCross Ref
Xinlei Chen, Xiangxiang Xu, Xinyu Liu, Hae Young Noh, Lin Zhang, and Pei Zhang. 2016. Hap: Fine-grained dynamic air pollution map reconstruction by hybrid adaptive particle filter. In Proceedings of the 14th ACM Conference on Embedded Network Sensor Systems CD-ROM. 336–337.Google ScholarDigital Library
Xinlei Chen, Yulei Zhao, Yong Li, Xu Chen, Ning Ge, and Sheng Chen. 2018. Social trust aided D2D communications: Performance bound and implementation mechanism. IEEE Journal on Selected Areas in Communications 36, 7 (2018), 1593–1608.Google ScholarDigital Library
LightZero Contributors. 2023. LightZero: OpenDILab A lightweight and efficient toolkit designed for the MCTS, AlphaZero, and MuZero family of algorithms.https://github.com/opendilab/LightZero.Google Scholar
Zipeng Dai, Chi Harold Liu, Yuxiao Ye, Rui Han, Ye Yuan, Guoren Wang, and Jian Tang. 2022. AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning. In IEEE INFOCOM 2022 - IEEE Conference on Computer Communications. 1029–1038. https://doi.org/10.1109/INFOCOM48880.2022.9796732Google ScholarDigital Library
C. DeepakG., Alexandros Ladas, Yusuf A. Sambo, Haris Bin Pervaiz, Christos Politis, and Muhammad Ali Imran. 2019. An Overview of Post-Disaster Emergency Communication Systems in the Future Networks. IEEE Wireless Communications 26 (2019), 132–139.Google ScholarDigital Library
Zuxin Li, Fanhang Man, Xuecheng Chen, Baining Zhao, Chenye Wu, and Xinlei Chen. 2023. TRACT: Towards Large-Scale Crowdsensing With High-Efficiency Swarm Path Planning. In Adjunct Proceedings of the 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2022 ACM International Symposium on Wearable Computers (Cambridge, United Kingdom) (UbiComp/ISWC ’22 Adjunct). Association for Computing Machinery, New York, NY, USA, 409–414. https://doi.org/10.1145/3544793.3560401Google ScholarDigital Library
Wei Liu, Yi Ding, Shuai Wang, Yu Yang, and Desheng Zhang. 2022. Para-Pred: Addressing Heterogeneity for City-Wide Indoor Status Estimation in On-Demand Delivery. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3407–3417.Google ScholarDigital Library
Fangxu Lu, Zhichao Mi, Ning Zhao, Hai Wang, and Yulu Tian. 2022. 3D Deployment of Dynamic UAV Base Station based on Mobile Users. In 2021 International Conference on Advanced Computing and Endogenous Security. 1–5. https://doi.org/10.1109/IEEECONF52377.2022.10013331Google ScholarCross Ref
Jianqiang Lu, Xiangpeng Xie, Xia Zhou, and Chengjie Bu. [n. d.]. Research on power-communication coordination recovery strategy based on grid dividing after extreme disasters. 675, 1 ([n. d.]), 012038. https://doi.org/10.1088/1755-1315/675/1/012038Google ScholarCross Ref
Junhai Luo, Zhiyan Wang, Ming Xia, Linyong Wu, Yuxin Tian, and Yu Chen. [n. d.]. Path Planning for UAV Communication Networks: Related Technologies, Solutions, and Opportunities. ([n. d.]), 3560261. https://doi.org/10.1145/3560261Google ScholarDigital Library
Maurilio Matracia, Nasir Saeed, Mustafa A. Kishk, and Mohamed-Slim Alouini. 2022. Post-Disaster Communications: Enabling Technologies, Architectures, and Open Challenges. IEEE Open Journal of the Communications Society 3 (2022), 1177–1205.Google ScholarCross Ref
Luoyu Mei, Zhimeng Yin, Xiaolei Zhou, Shuai Wang, and Kai Sun. 2021. ECCR: Edge-Cloud Collaborative Recovery for Low-Power Wide-Area Networks Interference Mitigation. In Wireless Algorithms, Systems, and Applications: 16th International Conference, WASA 2021, Nanjing, China, June 25–27, 2021, Proceedings, Part I 16. Springer, 494–507.Google Scholar
Mohammad Mozaffari, Walid Saad, Mehdi Bennis, and Merouane Debbah. 2015. Drone Small Cells in the Clouds: Design, Deployment and Performance Analysis. In 2015 IEEE Global Communications Conference (GLOBECOM). 1–6. https://doi.org/10.1109/GLOCOM.2015.7417609Google ScholarCross Ref
Vidit Saxena, Joakim Jaldén, and Henrik Klessig. 2019. Optimal UAV base station trajectories using flow-level models for reinforcement learning. IEEE Transactions on Cognitive Communications and Networking 5, 4 (2019), 1101–1112.Google ScholarCross Ref
Jurgen Scherer and Bernhard Rinner. 2017. Short and full horizon motion planning for persistent multi-UAV surveillance with energy and communication constraints. 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2017), 230–235.Google ScholarDigital Library
Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy Lillicrap, and David Silver. [n. d.]. Mastering Atari, Go, chess and shogi by planning with a learned model. 588, 7839 ([n. d.]), 604–609. https://doi.org/10.1038/s41586-020-03051-4Google ScholarCross Ref
Angelo Trotta, Leonardo Montecchiari, Marco Di Felice, and Luciano Bononi. [n. d.]. A GPS-Free Flocking Model for Aerial Mesh Deployments in Disaster-Recovery Scenarios. 8 ([n. d.]), 91558–91573. https://doi.org/10.1109/ACCESS.2020.2994466Google ScholarCross Ref
Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. 2018. Graph Attention Networks. arxiv:1710.10903 [stat.ML]Google Scholar
Dong Wang and Yanping Yang. 2020. Joint Obstacle Avoidance and 3D Deployment for Securing UAV-Enabled Cellular Communications. IEEE Access 8 (2020), 67813–67821.Google ScholarCross Ref
Shuai Wang, Shijie Hu, Baoshen Guo, and Guang Wang. 2023. Cross-Region Courier Displacement for On-Demand Delivery With Multi-Agent Reinforcement Learning. IEEE Transactions on Big Data (2023), 1–14. https://doi.org/10.1109/TBDATA.2023.3262408Google ScholarCross Ref
Xiao Wang, Houye Ji, Chuan Shi, Bai Wang, Yanfang Ye, Peng Cui, and Philip S Yu. 2019. Heterogeneous Graph Attention Network. In The World Wide Web Conference (San Francisco, CA, USA) (WWW ’19). Association for Computing Machinery, New York, NY, USA, 2022–2032. https://doi.org/10.1145/3308558.3313562Google ScholarDigital Library
Yuhuai Wu, Elman Mansimov, Shun Liao, Roger B. Grosse, and Jimmy Ba. 2017. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation. CoRR abs/1708.05144 (2017). arXiv:1708.05144Google Scholar
Susu Xu, Xinlei Chen, Xidong Pi, Carlee Joe-Wong, Pei Zhang, and Hae Young Noh. 2020. iLOCuS: Incentivizing Vehicle Mobility to Optimize Sensing Distribution in Crowd Sensing. IEEE Transactions on Mobile Computing 19, 8 (2020), 1831–1847. https://doi.org/10.1109/TMC.2019.2915838Google ScholarDigital Library
Yong Zeng and Rui Zhang. 2017. Energy-Efficient UAV Communication With Trajectory Optimization. IEEE Transactions on Wireless Communications 16, 6 (2017), 3747–3760. https://doi.org/10.1109/TWC.2017.2688328Google ScholarDigital Library
Heng Zhang, Michael A Roth, Rajesh K. Panta, He Wang, and Saurabh Bagchi. 2020. CrowdBind: Fairness Enhanced Late Binding Task Scheduling in Mobile Crowdsensing. In Proceedings of the 2020 International Conference on Embedded Wireless Systems and Networks (Lyon, France) (EWSN ’20). Junction Publishing, USA, 61–72.Google ScholarDigital Library
Zeyu Zhang, Weiwei Chen, Junwen Wang, Shuai Wang, and Tian He. 2022. CONST: Exploiting Spatial-Temporal Correlation for Multi-Gateway based Reliable LoRa Reception. In 2022 IEEE 30th International Conference on Network Protocols (ICNP). IEEE, 1–11.Google ScholarCross Ref

Index Terms

Scheduling UAV Swarm with Attention-based Graph Reinforcement Learning for Ground-to-air Heterogeneous Data Communication
1. Computer systems organization
  1. Embedded and cyber-physical systems
    1. Sensor networks

Recommendations

Real-time path planning of controllable UAV by subgoals using goal-conditioned reinforcement learning
Abstract
The conventional path planning problem for an unmanned aerial vehicle (UAV) typically involves a pre-defined environment and mission, with the objective of reaching a single target point. However, in order to perform different missions,...
Highlights
- We propose a real-time autonomous path planning of UAVs in unknown environments.
Read More
UAV swarm path planning with reinforcement learning for field prospecting
Abstract
There has been steady growth in the adoption of Unmanned Aerial Vehicle (UAV) swarms by operators due to their time and cost benefits. However, this kind of system faces an important problem, which is the calculation of many optimal paths for each ...
Read More
A Brief Overview of Waveforms for UAV Air-to-Ground Communication Systems
ICVISP 2019: Proceedings of the 3rd International Conference on Vision, Image and Signal Processing

Unmanned aerial vehicles (UAVs) have tremendous potential in both military and civilian applications. Due to the increasing demand for cost-efficient yet reliable small UAVs in a plethora of civilian applications, the technical advances in cooperative ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
UbiComp/ISWC '23 Adjunct: Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing
October 2023
822 pages
ISBN:9798400702006
DOI:10.1145/3594739
Editors:
Monica Tentori
(CICESE, Mexico)
,
Nadir Weibel
(UC San Diego, USA)
,
Kristof Van Laerhoven
(University of Siegen, Germany)
,
Zhongyi Zhou
(University of Tokyo, Japan)
Copyright © 2023 Owner/Author
This work is licensed under a Creative Commons Attribution International 4.0 License.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 8 October 2023
Check for updates
Author Tags
Disaster Response
Graph Neural Network
Model-based Reinforcement Learning
Path Planning
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate764of2,912submissions,26%
Upcoming Conference
UBICOMP '24

Sponsor:

sigchi

sigchi

UBICOMP '24: The 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing

October 5 - 9, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 207
  Total Downloads
- Downloads (Last 12 months)207
- Downloads (Last 6 weeks)53
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Scheduling UAV Swarm with Attention-based Graph Reinforcement Learning for Ground-to-air Heterogeneous Data Communication

UbiComp/ISWC '23 Adjunct: Adjunct Proceedings of the 2023 ACM International Joint Conference on Pervasive and Ubiquitous Computing & the 2023 ACM International Symposium on Wearable Computing

ABSTRACT

References

Cited By

Index Terms

Recommendations

Real-time path planning of controllable UAV by subgoals using goal-conditioned reinforcement learning

UAV swarm path planning with reinforcement learning for field prospecting

A Brief Overview of Waveforms for UAV Air-to-Ground Communication Systems