skip to main content
research-article

RLC: A Reinforcement Learning-Based Charging Algorithm for Mobile Devices

Published:16 July 2021Publication History
Skip Abstract Section

Abstract

Wireless charging has been demonstrated as a promising technology for prolonging device operational lifetimes in Wireless Rechargeable Networks (WRNs). To schedule a mobile charger to move along a predesigned trajectory to charge devices, most existing studies assume that the precise location information of devices is already known. Unfortunately, this assumption does not always hold in real mobile application, because the activities of the vast majority of mobile devices carried by mobile agents appear dynamic and random. To the best of our knowledge, this is the first work to study how to wirelessly charge mobile devices with non-deterministic mobility. We aim to provide effective charging service to them, subject to the energy capacity of the mobile charger. We formalize the effective charging problem as a charging reward maximization problem (CRMP), where the amount of reward obtained by charging a device is inversely proportional to the residual lifetime of the device. Then, we prove that CRMP is NP-hard. To derive an effective charging heuristic, an algorithm based on Reinforcement Learning (RL) is proposed. The evaluation results show that the RL-based charging algorithm achieves excellent charging effectiveness. We further interpret the learned heuristic to gain deep and valuable insights into the design options.

References

  1. Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. 2016. Tensorflow: A system for large-scale machine learning. In OSDI, Vol. 16. 265–283. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aloÿs Augustin, Jiazi Yi, Thomas Clausen, and William Mark Townsley. 2016. A study of LoRa: Long range and low power networks for the internet of things. Sensors 16, 9 (2016), 1466.Google ScholarGoogle ScholarCross RefCross Ref
  3. Luca Benini, Alessandro Bogliolo, and Giovanni De Micheli. 2000. A survey of design techniques for system-level dynamic power management. IEEE Transactions on Very Large Scale Integration (VLSI) Systems 8, 3 (2000), 299–316. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Dimitri P. Bertsekas. 1998. Network Optimization: Continuous and Discrete Models. Citeseer.Google ScholarGoogle Scholar
  5. Xianbo Cao, Wenzheng Xu, Xuxun Liu, Jian Peng, and Tang Liu. 2020. A deep reinforcement learning-based on-demand charging algorithm for wireless rechargeable sensor networks. Ad Hoc Networks 110 (2020), 102278.Google ScholarGoogle ScholarCross RefCross Ref
  6. Lin Chen, Shan Lin, and Hua Huang. 2016. Charge me if you can: Charging path optimization and scheduling in mobile networks. In ACM MOBIHOC. 101–110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Cisco. 2020. Cisco Aironet 1840 Series Access Points Data Sheet. Retrieved from https://www.cisco.com/c/en/us/products/collateral/wireless/aironet-1840-series-access-points/datasheet-c78-742125.html.Google ScholarGoogle Scholar
  8. Haipeng Dai, Yunhuai Liu, Guihai Chen, Xiaobing Wu, Tian He, Alex X. Liu, and Yang Zhao. 2018. SCAPE: Safe charging with adjustable power. IEEE/ACM Transactions on Networking 26, 1 (2018), 520–533. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Haipeng Dai, Qiufang Ma, Xiaobing Wu, Guihai Chen, David K. Y. Yau, Shaojie Tang, Xiang-Yang Li, and Chen Tian. 2020. Chase: Charging and scheduling scheme for stochastic event capture in wireless rechargeable sensor networks. IEEE Transactions on Mobile Computing 19, 1 (2020), 44–59.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Liang He, Peng Cheng, Yu Gu, Jianping Pan, Ting Zhu, and Cong Liu. 2014. Mobile-to-mobile energy replenishment in mission-critical robotic sensor networks. In IEEE INFOCOM. 1195–1203.Google ScholarGoogle Scholar
  11. Liang He, Linghe Kong, Yu Gu, Jianping Pan, and Ting Zhu. 2014. Evaluating the on-demand mobile charging in wireless sensor networks. IEEE Transactions on Mobile Computing 14, 9 (2014), 1861–1875.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Michael Held and Richard M. Karp. 1962. A dynamic programming approach to sequencing problems. Journal of the Society for Industrial and Applied Mathematics 10, 1 (1962), 196–210.Google ScholarGoogle ScholarCross RefCross Ref
  13. Daniel P. Heyman and Matthew J. Sobel. 2004. Stochastic Models in Operations Research: Stochastic Optimization. Vol. 2. Courier Corporation.Google ScholarGoogle Scholar
  14. Sahar Hoteit, Stefano Secci, Stanislav Sobolevsky, Carlo Ratti, and Guy Pujolle. 2014. Estimating human trajectories and hotspots through mobile phone data. Computer Networks 64 (2014), 296–307. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Nitthilan Kannappan Jayakodi, Anwesha Chatterjee, Wonje Choi, Janardhan Rao Doppa, and Partha Pratim Pande. 2018. Trading-off accuracy and energy of deep inference on embedded systems: A co-design approach. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 37, 11 (2018), 2881–2893.Google ScholarGoogle ScholarCross RefCross Ref
  16. Byoungwoo Kang and Gerbrand Ceder. 2009. Battery materials for ultrafast charging and discharging. Nature 458, 7235 (2009), 190.Google ScholarGoogle Scholar
  17. Elias Khalil, Hanjun Dai, Yuyu Zhang, Bistra Dilkina, and Le Song. 2017. Learning combinatorial optimization algorithms over graphs. In NIPS. 6348–6358. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Andre Kurs, Aristeidis Karalis, Robert Moffatt, John D. Joannopoulos, Peter Fisher, and Marin Soljačić. 2007. Wireless power transfer via strongly coupled magnetic resonances. Science 317, 5834 (2007), 83–86.Google ScholarGoogle Scholar
  19. Weifa Liang, Wenzheng Xu, Xiaojiang Ren, Xiaohua Jia, and Xiaola Lin. 2016. Maintaining large-scale rechargeable sensor networks perpetually via multiple mobile charging vehicles. ACM Transactions on Sensor Networks (TOSN) 12, 2 (2016), 14. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Chi Lin, Chunyang Guo, Jing Deng, and Guowei Wu. 2018. 3DCS: A 3-D dynamic collaborative scheduling scheme for wireless rechargeable sensor networks with heterogeneous chargers. In IEEE ICDCS. 311–320.Google ScholarGoogle Scholar
  21. Chi Lin, Zhiyuan Wang, Jing Deng, Lei Wang, Jiankang Ren, and Guowei Wu. 2018. mTS: Temporal- and spatial-collaborative charging for wireless rechargeable sensor networks with multiple vehicles. In IEEE INFOCOM. 99–107.Google ScholarGoogle Scholar
  22. Chi Lin, Jingzhe Zhou, Chunyang Guo, Houbing Song, Guowei Wu, and Mohammad S. Obaidat. 2018. TSCA: A temporal-spatial real-time charging scheduling algorithm for on-demand architecture in wireless rechargeable sensor networks. IEEE Transactions on Mobile Computing 17, 1 (2018), 211–224.Google ScholarGoogle ScholarCross RefCross Ref
  23. Tang Liu, Baijun Wu, Hongyi Wu, and Jian Peng. 2017. Low-cost collaborative mobile charging for large-scale wireless sensor networks. IEEE Transactions on Mobile Computing 16, 8 (2017), 2213–2227.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Tang Liu, Baijun Wu, Wenzheng Xu, Xianbo Cao, Jian Peng, and Hongyi Wu. 2020. Learning an effective charging scheme for mobile devices. In IEEE IPDPS. IEEE.Google ScholarGoogle Scholar
  25. Tang Liu, Baijun Wu, Shihao Zhang, Jian Peng, and Wenzheng Xu. 2020. An effective multi-node charging scheme for wireless rechargeable sensor networks. In IEEE Conference on Computer Communications (IEEE INFOCOM’20). IEEE, 2026–2035.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Huadong Ma, Dong Zhao, and Peiyan Yuan. 2014. Opportunities in mobile crowd sensing. IEEE Communications Magazine 52, 8 (2014), 29–35.Google ScholarGoogle ScholarCross RefCross Ref
  27. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. 2013. Playing atari with deep reinforcement learning. arXiv:1312.5602. Retrieved from https://arxiv.org/abs/1312.5602.Google ScholarGoogle Scholar
  28. Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, et al. 2015. Human-level control through deep reinforcement learning. Nature 518, 7540 (2015), 529.Google ScholarGoogle Scholar
  29. Martin L. Puterman. 2014. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the game of Go with deep neural networks and tree search. Nature 529, 7587 (2016), 484–489.Google ScholarGoogle Scholar
  31. Chaoming Song, Tal Koren, Pu Wang, and Albert-László Barabási. 2010. Modelling the scaling properties of human mobility. Nature Physics 6, 10 (2010), 818–823.Google ScholarGoogle ScholarCross RefCross Ref
  32. Chaoming Song, Zehui Qu, Nicholas Blumm, and Albert-László Barabási. 2010. Limits of predictability in human mobility. Science 327, 5968 (2010), 1018–1021.Google ScholarGoogle Scholar
  33. Richard S. Sutton and Andrew G. Barto. 2012. Reinforcement Learning: An Introduction. Vol. 1. MIT Press, Cambridge. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Ning Wang, Jie Wu, and Haipeng Dai. 2019. Bundle charging: Wireless charging energy minimization in dense wireless sensor networks. In IEEE ICDCS.Google ScholarGoogle Scholar
  35. Zhenchun Wei, Fei Liu, Zengwei Lyu, Xu Ding, Lei Shi, and Chengkai Xia. 2018. Reinforcement learning for a novel mobile charging strategy in wireless rechargeable sensor networks. In International Conference on Wireless Algorithms, Systems, and Applications. Springer, 485–496.Google ScholarGoogle ScholarCross RefCross Ref
  36. Wicharge. 2020. Long-Rrange Wireless WI-CHARGERS. https://wi-charge.com/product_cat/transmitters/.Google ScholarGoogle Scholar
  37. Jie Wu, Mingjun Xiao, and Liusheng Huang. 2013. Homing spread: Community home-based multi-copy routing in mobile social networks. In IEEE INFOCOM. 2319–2327.Google ScholarGoogle Scholar
  38. Tao Wu, Panlong Yang, Haipeng Dai, Wanru Xu, and Mingxue Xu. 2019. Collaborated tasks-driven mobile charging and scheduling: A near optimal result. In IEEE INFOCOM. IEEE, 1810–1818.Google ScholarGoogle Scholar
  39. Liguang Xie, Yi Shi, Y. Thomas Hou, Wenjing Lou, and Hanif D. Sherali. 2013. On traveling path and related problems for a mobile station in a rechargeable sensor network. In ACM MobiHoc. 109–118. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Wenzheng Xu, Weifa Liang, Xiaohua Jia, Haibin Kan, Yinlong Xu, and Xinming Zhang. 2020. Minimizing the maximum charging delay of multiple mobile chargers under the multi-node energy charging scheme. IEEE Transactions on Mobile Computing 20, 5 (2021), 1846–1861.Google ScholarGoogle ScholarCross RefCross Ref
  41. Wenzheng Xu, Zichuan Xu, Jian Peng, Weifa Liang, Tang Liu, Xiaohua Jia, and Sajal K. Das. 2020. Approximation algorithms for the team orienteering problem. In IEEE INFOCOM.Google ScholarGoogle Scholar
  42. Yuanyuan Yang and Cong Wang. 2015. Wireless Rechargeable Sensor Networks. Springer.Google ScholarGoogle Scholar
  43. Sheng Zhang, Jie Wu, and Sanglu Lu. 2015. Collaborative mobile charging. IEEE Transactions on Computers 3, 64 (2015), 654–667.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Yu Zheng, Longhao Wang, Ruochi Zhang, Xing Xie, and Wei-Ying Ma. 2008. GeoLife: Managing and understanding your past life over maps. In IEEE MDM. 211–212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. Yu Zheng, Lizhu Zhang, Xing Xie, and Wei-Ying Ma. 2009. Mining interesting locations and travel sequences from GPS trajectories. In ACM WWW. 791–800. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. RLC: A Reinforcement Learning-Based Charging Algorithm for Mobile Devices

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Sensor Networks
      ACM Transactions on Sensor Networks  Volume 17, Issue 4
      November 2021
      403 pages
      ISSN:1550-4859
      EISSN:1550-4867
      DOI:10.1145/3472298
      Issue’s Table of Contents

      Copyright © 2021 Association for Computing Machinery.

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 16 July 2021
      • Accepted: 1 March 2021
      • Revised: 1 February 2021
      • Received: 1 June 2020
      Published in tosn Volume 17, Issue 4

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format .

    View HTML Format