Abstract
Cognitive Vehicular Networks (CVNs) can improve spectrum utilization by intelligently using idle spectrum, so as to fulfill the needs of communication. The previous researches only considered vehicle-to-vehicle(V2V) links or vehicle-to-infrastructure (V2I) links and ignored the influence of spectrum sensing errors. Therefore, in this paper, V2V links and V2I links are simultaneously discussed in the presence of spectrum sensing errors in the CVNs communication environment that we establish, and a dynamic spectrum access problem aiming at spectrum utilization is framed. In order to solve the above problems, the reinforcement learning method is introduced in this paper. But the impact of two kinds of collisions on the spectrum access rate of cognitive vehicles is neglected in the reinforcement learning method, and the above collisions which exist between cognitive vehicles, between cognitive vehicles and primary vehicles. Hence, different reward functions are designed according to different collision situations, and an improved reinforcement learning method is utilized to improve the success probability of spectrum access. To verify the effectiveness of the improved method, the performance and convergence of the proposed method are significantly better than other methods by comparing with the Myopic method, DQN and traditional DDQN in Python.
Similar content being viewed by others
References
Najada, A. L., Mahgoub, H. (2016). I.: Anticipation and alert system of congestion and accidents in vanet using big data analysis for intelligent transportation systems. In: 2016 IEEE symposium series on computational intelligence (SSCI), pp. 1–8. IEEE.
Ullah, A., Yao, X., Shaheen, S., & Ning, H. (2019). Advances in position based routing towards its enabled fog-oriented vanet-a survey. IEEE Transactions on Intelligent Transportation Systems, 21(2), 828–840.
Mutalik, P., Nagaraj, S., Vedavyas, J., Biradar, R. V., Patil, V. G. C. (2016). A comparative study on aodv, dsr and dsdv routing protocols for intelligent transportation system (its) in metro cities for road traffic safety using vanet route traffic analysis (vrta). In: 2016 IEEE international conference on advances in electronics, communication and computer technology (ICAECCT), pp. 383–386. IEEE.
Gao, H., Liu, C., Li, Y., & Yang, X. (2020). V2vr: Reliable hybrid-network-oriented v2v data transmission and routing considering rsus and connectivity probability. IEEE Transactions on Intelligent Transportation Systems, 22(6), 3533–3546.
Villas, L. A., Boukerche, A., Maia, G., Pazzi, R. W., & Loureiro, A. A. (2014). Drive: An efficient and robust data dissemination protocol for highway and urban vehicular ad hoc networks. Computer Networks, 75, 381–394.
Cunha, F., Villas, L., Boukerche, A., Maia, G., Viana, A., Mini, R. A., & Loureiro, A. A. (2016). Data communication in vanets: Protocols, applications and challenges. Ad Hoc Networks, 44, 90–103.
Shaibani, R., Zahary, A. (2018). Survey of context-aware video transmission over vehicular ad-hoc networks (vanets). EAI Endorsed Transactions on Mobile Communications and Applications 4(15).
Wang, R., Xu, Z., Zhao, X., & Hu, J. (2019). V2v-based method for the detection of road traffic congestion. IET Intelligent Transport Systems, 13(5), 880–885.
Priyan, M., & Devi, G. U. (2019). A survey on internet of vehicles: Applications, technologies, challenges and opportunities. International Journal of Advanced Intelligence Paradigms, 12(1–2), 98–119.
Wang, X., Wang, C., Zhang, J., Zhou, M., & Jiang, C. (2016). Improved rule installation for real-time query service in software-defined internet of vehicles. IEEE Transactions on Intelligent Transportation Systems, 18(2), 225–235.
Vasudev, H., Deshpande, V., Das, D., & Das, S. K. (2020). A lightweight mutual authentication protocol for v2v communication in internet of vehicles. IEEE Transactions on Vehicular Technology, 69(6), 6709–6717.
Mitola, J., & Maguire, G. Q. (1999). Cognitive radio: Making software radios more personal. IEEE Personal Communications, 6(4), 13–18.
Mitola, J. (2000). An integrated agent architecture for software defined radio: Dissertation doctor of technology, royal institute of technology, sweden, may 8.
Di Felice, M., Doost-Mohammady, R., Chowdhury, K. R., & Bononi, L. (2012). Smart radios for smart vehicles: Cognitive vehicular networks. IEEE Vehicular Technology Magazine, 7(2), 26–33.
Ahmed, Z., Jamal, H., Khan, S., Mehboob, R., & Ashraf, A. (2009). Cognitive communication device for vehicular networking. IEEE Transactions on Consumer Electronics, 55(2), 371–375.
Di Felice, M., Chowdhury, K.R., Bononi, L. (2011).Cooperative spectrum management in cognitive vehicular ad hoc networks. In: 2011 IEEE vehicular networking conference (VNC), pp. 47–54 IEEE.
Zhang, H., & Guo, C. (2019). Beam alignment-based mmwave spectrum sensing in cognitive vehicular etworks. In: 2019 IEEE global conference on signal and information processing (GlobalSIP), pp. 1–5. IEEE.
Li, M., Zhao, L., & Liang, H. (2017). An smdp-based prioritized channel allocation scheme in cognitive enabled vehicular ad hoc networks. IEEE Transactions on Vehicular Technology, 66(9), 7925–7933.
Cheng, N., Zhang, N., Lu, N., Shen, X., Mark, J. W., & Liu, F. (2013). Opportunistic spectrum access for cr-vanets: A game-theoretic approach. IEEE Transactions on Vehicular Technology, 63(1), 237–251.
Gill, K.S., Heath, K.N., Chuke, S., Haider, A., Gegear, R.J., Ryder, E.F., & Wyglinski, A.M.(2020). Bumblebee-inspired c-v2x dynamic spectrum access testbed using openairinterface. In: 2020 IEEE 91st vehicular technology conference (VTC2020-Spring), pp. 1–5. IEEE.
Yang, C., Fu, Y., Zhang, Y., Xie, S., & Yu, R. (2013). Energy-efficient hybrid spectrum access scheme in cognitive vehicular ad hoc networks. IEEE Communications Letters, 17(2), 329–332.
Van Hasselt, H., Guez, A., & Silver, D. (2016). Deep reinforcement learning with double q-learning. In: Proceedings of the AAAI conference on artificial intelligence, vol. 30
Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Silver, D., & Wierstra, D. (20125) Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971
Mnih, V., Badia, A.P., Mirza, M., Graves, A., Lillicrap, T., Harley, T., Silver, D., & Kavukcuoglu, K. (2016). Asynchronous methods for deep reinforcement learning. In: International conference on machine learning, pp. 1928–1937. PMLR.
Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J., Bellemare, M. G., Graves, A., Riedmiller, M., Fidjeland, A. K., & Ostrovski, G. (2015). Human-level control through deep reinforcement learning. Nature, 518(7540), 529–533.
Wang, Y., Li, X., Wan, P., & Shao, R. (2021). Intelligent dynamic spectrum access using deep reinforcement learning for vanets. IEEE Sensors Journal, 21(14), 15554–15563.
Choe, C., Ahn, J., Choi, J., Park, D., Kim, M., & Ahn, S. (2020). A robust channel access using cooperative reinforcement learning for congested vehicular networks. IEEE Access, 8, 135540–135557.
Choe, C., Choi, J., Ahn, J., Park, D., & Ahn, S. (2020). Multiple channel access using deep reinforcement learning for congested vehicular networks. In: 2020 IEEE 91st vehicular technology conference (VTC2020-Spring), pp. 1–6. IEEE.
Li, X., Lu, L., Ni, W., Jamalipour, A., Zhang, D., & Du, H. (2022). Federated multi-agent deep reinforcement learning for resource allocation of vehicle-to-vehicle communications. IEEE Transactions on Vehicular Technology, 71(8), 8810–8824.
Sroka, P., & Kliks, A. (2022). Distributed learning for vehicular dynamic spectrum access in autonomous driving. In: 2022 IEEE international conference on pervasive computing and communications workshops and other affiliated events (PerCom Workshops), pp. 605–610. IEEE.
Liu, X., Sun, C., Yau, K. -L. A., & Wu, C. (2022). Joint collaborative big spectrum data sensing and reinforcement learning based dynamic spectrum access for cognitive internet of vehicles. IEEE Transactions on Intelligent Transportation Systems, 1–11.
Meinilä, J., Kyösti, P., Jämsä, T., Hentilä, L. (2009). Winner ii channel models. In: Radio technologies and concepts for IMT-advanced,
Liu, K., Zhao, Q., & Krishnamachari, B. (2010). Dynamic multichannel access with imperfect channel state detection. IEEE Transactions on Signal Processing, 58(5), 2795–2808.
Wang, S., Liu, H., Gomes, P. H., & Krishnamachari, B. (2018). Deep reinforcement learning for dynamic multichannel access in wireless networks. IEEE Transactions on Cognitive Communications and Networking, 4(2), 257–265.
Liang, L., Ye, H., & Li, G. Y. (2019). Spectrum sharing in vehicular networks based on multi-agent reinforcement learning. IEEE Journal on Selected Areas in Communications, 37(10), 2282–2292.
Acknowledgements
This work is supported in part by the National Natural Science Foundation of China under grant no. 61571209 and 61501059, Science and Technology Department of Jilin Provincial, China (Grant No. YDZJ202201ZYTS653),Science and Technology Department of Jilin Provincial, China (Grant No. 20180101336JC).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have not disclosed any competing interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, L., Zhao, Q., Fu, K. et al. Multi-user reinforcement learning based multi-reward for spectrum access in cognitive vehicular networks. Telecommun Syst 83, 51–65 (2023). https://doi.org/10.1007/s11235-023-01004-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11235-023-01004-6