skip to main content
10.1145/3594739.3612905acmconferencesArticle/Chapter ViewAbstractPublication PagesubicompConference Proceedingsconference-collections
research-article
Open Access

Scheduling UAV Swarm with Attention-based Graph Reinforcement Learning for Ground-to-air Heterogeneous Data Communication

Authors Info & Claims
Published:08 October 2023Publication History

ABSTRACT

In disaster scenarios, unmanned aerial vehicles (UAVs) can serve as mobile base stations because of their maneuverability and synergy. However, due to constrained UAV communication capabilities and limited battery life, UAV base stations resource allocation for mobile sensors in a data-heterogeneous environment is a significant challenge when optimizing communication quality. To address this, we propose AGUZero, an attention-based graph reinforcement learning (RL) framework. Inspired by MuZero [27], AGUZero is designed to handle dynamic and uncontrollable environments based on Monte Carlo Tree Search (MCTS). Additionally, to tackle data heterogeneity, AGUZero represents the states using heterogeneous sub-graphs and employs an attention-based model to capture relationships among UAVs and sensors. The experimental results show that AGUZero outperforms other baseline models consistently when either the number of UAVs or the number of sensors is varying. AGUZero improves the data transmission ratio by 11.03% and 10.35% in the two cases respectively.

References

  1. Shakil Ahmed and Boulat A. Bash. 2019. Average Worst-Case Secrecy Rate Maximization via UAV and Base Station Resource Allocation. 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton) (2019), 1176–1181.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Akram Al-Hourani, Sithamparanathan Kandeepan, and Simon Lardner. 2014. Optimal LAP Altitude for Maximum Coverage. IEEE Wireless Communications Letters 3, 6 (2014), 569–572. https://doi.org/10.1109/LWC.2014.2342736Google ScholarGoogle ScholarCross RefCross Ref
  3. Mohamed Alzenad, Amr El-Keyi, Faraj Lagum, and Halim Yanikomeroglu. 2017. 3-D Placement of an Unmanned Aerial Vehicle Base Station (UAV-BS) for Energy-Efficient Maximal Coverage. IEEE Wireless Communications Letters 6 (2017), 434–437.Google ScholarGoogle ScholarCross RefCross Ref
  4. Xinlei Chen, Aveek Purohit, Carlos Ruiz Dominguez, Stefano Carpin, and Pei Zhang. 2015. Drunkwalk: Collaborative and adaptive planning for navigation of micro-aerial sensor swarms. In Proceedings of the 13th ACM Conference on Embedded Networked Sensor Systems. 295–308.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Xinlei Chen, Aveek Purohit, Shijia Pan, Carlos Ruiz, Jun Han, Zheng Sun, Frank Mokaya, Patric Tague, and Pei Zhang. 2017. Design experiences in minimalistic flying sensor node platform through sensorfly. ACM Transactions on Sensor Networks (TOSN) 13, 4 (2017), 1–37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Xinlei Chen, Aveek Purohit, Shijia Pan, Carlos Ruiz, Jun Han, Zheng Sun, Frank Mokaya, Patric Tague, and Pei Zhang. 2017. Design Experiences in Minimalistic Flying Sensor Node Platform through SensorFly. ACM Trans. Sen. Netw. 13, 4, Article 33 (nov 2017), 37 pages. https://doi.org/10.1145/3131779Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Xinlei Chen, Carlos Ruiz, Sihan Zeng, Liyao Gao, Aveek Purohit, Stefano Carpin, and Pei Zhang. 2020. H-DrunkWalk: Collaborative and adaptive navigation for heterogeneous MAV swarm. ACM Transactions on Sensor Networks (TOSN) 16, 2 (2020), 1–27.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Xuecheng Chen, Haoyang Wang, Zuxin Li, Wenbo Ding, Fan Dang, Chengye Wu, and Xinlei Chen. 2023. DeliverSense: Efficient Delivery Drone Scheduling for Crowdsensing with Deep Reinforcement Learning. In Adjunct Proceedings of the 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2022 ACM International Symposium on Wearable Computers (Cambridge, United Kingdom) (UbiComp/ISWC ’22 Adjunct). Association for Computing Machinery, New York, NY, USA, 403–408. https://doi.org/10.1145/3544793.3560412Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Xinlei Chen, Susu Xu, Haohao Fu, Carlee Joe-Wong, Lin Zhang, Hae Young Noh, and Pei Zhang. 2019. ASC: Actuation System for City-Wide Crowdsensing with Ride-Sharing Vehicular Platform. In Proceedings of the Fourth Workshop on International Science of Smart City Operations and Platforms Engineering (Montreal, Quebec, Canada) (SCOPE ’19). Association for Computing Machinery, New York, NY, USA, 19–24. https://doi.org/10.1145/3313237.3313299Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Xinlei Chen, Susu Xu, Jun Han, Haohao Fu, Xidong Pi, Carlee Joe-Wong, Yong Li, Lin Zhang, Hae Young Noh, and Pei Zhang. 2020. PAS: Prediction-Based Actuation System for City-Scale Ridesharing Vehicular Mobile Crowdsensing. IEEE Internet of Things Journal 7, 5 (2020), 3719–3734. https://doi.org/10.1109/JIOT.2020.2968375Google ScholarGoogle ScholarCross RefCross Ref
  11. Xinlei Chen, Susu Xu, Xinyu Liu, Xiangxiang Xu, Hae Young Noh, Lin Zhang, and Pei Zhang. 2020. Adaptive hybrid model-enabled sensing system (HMSS) for mobile fine-grained air pollution estimation. IEEE Transactions on Mobile Computing 21, 6 (2020), 1927–1944.Google ScholarGoogle ScholarCross RefCross Ref
  12. Xinlei Chen, Xiangxiang Xu, Xinyu Liu, Hae Young Noh, Lin Zhang, and Pei Zhang. 2016. Hap: Fine-grained dynamic air pollution map reconstruction by hybrid adaptive particle filter. In Proceedings of the 14th ACM Conference on Embedded Network Sensor Systems CD-ROM. 336–337.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Xinlei Chen, Yulei Zhao, Yong Li, Xu Chen, Ning Ge, and Sheng Chen. 2018. Social trust aided D2D communications: Performance bound and implementation mechanism. IEEE Journal on Selected Areas in Communications 36, 7 (2018), 1593–1608.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. LightZero Contributors. 2023. LightZero: OpenDILab A lightweight and efficient toolkit designed for the MCTS, AlphaZero, and MuZero family of algorithms.https://github.com/opendilab/LightZero.Google ScholarGoogle Scholar
  15. Zipeng Dai, Chi Harold Liu, Yuxiao Ye, Rui Han, Ye Yuan, Guoren Wang, and Jian Tang. 2022. AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning. In IEEE INFOCOM 2022 - IEEE Conference on Computer Communications. 1029–1038. https://doi.org/10.1109/INFOCOM48880.2022.9796732Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. DeepakG., Alexandros Ladas, Yusuf A. Sambo, Haris Bin Pervaiz, Christos Politis, and Muhammad Ali Imran. 2019. An Overview of Post-Disaster Emergency Communication Systems in the Future Networks. IEEE Wireless Communications 26 (2019), 132–139.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Zuxin Li, Fanhang Man, Xuecheng Chen, Baining Zhao, Chenye Wu, and Xinlei Chen. 2023. TRACT: Towards Large-Scale Crowdsensing With High-Efficiency Swarm Path Planning. In Adjunct Proceedings of the 2022 ACM International Joint Conference on Pervasive and Ubiquitous Computing and the 2022 ACM International Symposium on Wearable Computers (Cambridge, United Kingdom) (UbiComp/ISWC ’22 Adjunct). Association for Computing Machinery, New York, NY, USA, 409–414. https://doi.org/10.1145/3544793.3560401Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Wei Liu, Yi Ding, Shuai Wang, Yu Yang, and Desheng Zhang. 2022. Para-Pred: Addressing Heterogeneity for City-Wide Indoor Status Estimation in On-Demand Delivery. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3407–3417.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Fangxu Lu, Zhichao Mi, Ning Zhao, Hai Wang, and Yulu Tian. 2022. 3D Deployment of Dynamic UAV Base Station based on Mobile Users. In 2021 International Conference on Advanced Computing and Endogenous Security. 1–5. https://doi.org/10.1109/IEEECONF52377.2022.10013331Google ScholarGoogle ScholarCross RefCross Ref
  20. Jianqiang Lu, Xiangpeng Xie, Xia Zhou, and Chengjie Bu. [n. d.]. Research on power-communication coordination recovery strategy based on grid dividing after extreme disasters. 675, 1 ([n. d.]), 012038. https://doi.org/10.1088/1755-1315/675/1/012038Google ScholarGoogle ScholarCross RefCross Ref
  21. Junhai Luo, Zhiyan Wang, Ming Xia, Linyong Wu, Yuxin Tian, and Yu Chen. [n. d.]. Path Planning for UAV Communication Networks: Related Technologies, Solutions, and Opportunities. ([n. d.]), 3560261. https://doi.org/10.1145/3560261Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Maurilio Matracia, Nasir Saeed, Mustafa A. Kishk, and Mohamed-Slim Alouini. 2022. Post-Disaster Communications: Enabling Technologies, Architectures, and Open Challenges. IEEE Open Journal of the Communications Society 3 (2022), 1177–1205.Google ScholarGoogle ScholarCross RefCross Ref
  23. Luoyu Mei, Zhimeng Yin, Xiaolei Zhou, Shuai Wang, and Kai Sun. 2021. ECCR: Edge-Cloud Collaborative Recovery for Low-Power Wide-Area Networks Interference Mitigation. In Wireless Algorithms, Systems, and Applications: 16th International Conference, WASA 2021, Nanjing, China, June 25–27, 2021, Proceedings, Part I 16. Springer, 494–507.Google ScholarGoogle Scholar
  24. Mohammad Mozaffari, Walid Saad, Mehdi Bennis, and Merouane Debbah. 2015. Drone Small Cells in the Clouds: Design, Deployment and Performance Analysis. In 2015 IEEE Global Communications Conference (GLOBECOM). 1–6. https://doi.org/10.1109/GLOCOM.2015.7417609Google ScholarGoogle ScholarCross RefCross Ref
  25. Vidit Saxena, Joakim Jaldén, and Henrik Klessig. 2019. Optimal UAV base station trajectories using flow-level models for reinforcement learning. IEEE Transactions on Cognitive Communications and Networking 5, 4 (2019), 1101–1112.Google ScholarGoogle ScholarCross RefCross Ref
  26. Jurgen Scherer and Bernhard Rinner. 2017. Short and full horizon motion planning for persistent multi-UAV surveillance with energy and communication constraints. 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2017), 230–235.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Julian Schrittwieser, Ioannis Antonoglou, Thomas Hubert, Karen Simonyan, Laurent Sifre, Simon Schmitt, Arthur Guez, Edward Lockhart, Demis Hassabis, Thore Graepel, Timothy Lillicrap, and David Silver. [n. d.]. Mastering Atari, Go, chess and shogi by planning with a learned model. 588, 7839 ([n. d.]), 604–609. https://doi.org/10.1038/s41586-020-03051-4Google ScholarGoogle ScholarCross RefCross Ref
  28. Angelo Trotta, Leonardo Montecchiari, Marco Di Felice, and Luciano Bononi. [n. d.]. A GPS-Free Flocking Model for Aerial Mesh Deployments in Disaster-Recovery Scenarios. 8 ([n. d.]), 91558–91573. https://doi.org/10.1109/ACCESS.2020.2994466Google ScholarGoogle ScholarCross RefCross Ref
  29. Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. 2018. Graph Attention Networks. arxiv:1710.10903 [stat.ML]Google ScholarGoogle Scholar
  30. Dong Wang and Yanping Yang. 2020. Joint Obstacle Avoidance and 3D Deployment for Securing UAV-Enabled Cellular Communications. IEEE Access 8 (2020), 67813–67821.Google ScholarGoogle ScholarCross RefCross Ref
  31. Shuai Wang, Shijie Hu, Baoshen Guo, and Guang Wang. 2023. Cross-Region Courier Displacement for On-Demand Delivery With Multi-Agent Reinforcement Learning. IEEE Transactions on Big Data (2023), 1–14. https://doi.org/10.1109/TBDATA.2023.3262408Google ScholarGoogle ScholarCross RefCross Ref
  32. Xiao Wang, Houye Ji, Chuan Shi, Bai Wang, Yanfang Ye, Peng Cui, and Philip S Yu. 2019. Heterogeneous Graph Attention Network. In The World Wide Web Conference (San Francisco, CA, USA) (WWW ’19). Association for Computing Machinery, New York, NY, USA, 2022–2032. https://doi.org/10.1145/3308558.3313562Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Yuhuai Wu, Elman Mansimov, Shun Liao, Roger B. Grosse, and Jimmy Ba. 2017. Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation. CoRR abs/1708.05144 (2017). arXiv:1708.05144Google ScholarGoogle Scholar
  34. Susu Xu, Xinlei Chen, Xidong Pi, Carlee Joe-Wong, Pei Zhang, and Hae Young Noh. 2020. iLOCuS: Incentivizing Vehicle Mobility to Optimize Sensing Distribution in Crowd Sensing. IEEE Transactions on Mobile Computing 19, 8 (2020), 1831–1847. https://doi.org/10.1109/TMC.2019.2915838Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Yong Zeng and Rui Zhang. 2017. Energy-Efficient UAV Communication With Trajectory Optimization. IEEE Transactions on Wireless Communications 16, 6 (2017), 3747–3760. https://doi.org/10.1109/TWC.2017.2688328Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Heng Zhang, Michael A Roth, Rajesh K. Panta, He Wang, and Saurabh Bagchi. 2020. CrowdBind: Fairness Enhanced Late Binding Task Scheduling in Mobile Crowdsensing. In Proceedings of the 2020 International Conference on Embedded Wireless Systems and Networks (Lyon, France) (EWSN ’20). Junction Publishing, USA, 61–72.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Zeyu Zhang, Weiwei Chen, Junwen Wang, Shuai Wang, and Tian He. 2022. CONST: Exploiting Spatial-Temporal Correlation for Multi-Gateway based Reliable LoRa Reception. In 2022 IEEE 30th International Conference on Network Protocols (ICNP). IEEE, 1–11.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Scheduling UAV Swarm with Attention-based Graph Reinforcement Learning for Ground-to-air Heterogeneous Data Communication

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format