Skip to main content
Log in

BiLSTM Based Reinforcement Learning for Resource Allocation and User Association in LTE-U Networks

  • Published:
Wireless Personal Communications Aims and scope Submit manuscript

Abstract

LTE-unlicensed (LTE-U) technology is a promising innovation to extend the capacity of cellular networks. The primary challenge for LTE-U is the fair coexistence between LTE systems and the incumbent WiFi systems. In this paper, we aim to maximize the long-term average per-user LTE throughput with long-term fairness guarantee by jointly considering resource allocation and user association on the unlicensed spectrum within a prediction window. We first formulate the problem as an NP-hard combinatorial optimization problem, then reformulate it as a non-cooperative game by applying the penalty function method. To solve the game, a novel reinforcement learning approach based on Bi-directional LSTM neural network is proposed, which enables small base stations (SBSs) to predict a sequence of future actions over the next prediction window based on the historical network information. It is shown that the proposed approach can converge to a mixed-strategy Nash equilibrium of the studied game and ensure the long-term fair coexistence between different access technologies. Finally, the effectiveness of the proposed algorithm is demonstrated by numerical simulation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Cisco. (2017). Cisco visual networking index: Global mobile data traffic forecast update 2016-2021, Whitepaper.

  2. Nokia. (2014). Nokia LTE for unlicensed spectrum.

  3. Ko, H., Lee, J., & Pack, S. (2016). A fair listen-before-talk algorithm for coexistence of LTE-U and WiFi. IEEE Transactions on Vehicular Technology, 65(12), 10116–10120.

    Article  Google Scholar 

  4. Qualcomm research. (2014). LTE in unlicensed spectrum: Harmonious coexistence with Wi-Fi.

  5. Chen, M., Saad, W., & Yin, C. (2017). Echo state networks for self-organizing resource allocation in LTE-U with uplink-downlink decoupling. IEEE Transactions on Wireless Communications, 16(1), 3–16.

    Article  Google Scholar 

  6. Chen, Q., Yu, G., Shan, H., & Maaref, A. (2016). Cellular meets WiFi: Traffic offloading or resource sharing? IEEE Transactions on Wireless Communications, 15(5), 3354–3367.

    Article  Google Scholar 

  7. Chen, Q., Yu, G., & Ding, Z. (2016). Optimizing unlicensed spectrum sharing for LTE-U and WiFi network coexistence. IEEE Journal on Selected Areas in Communications, 34(10), 2562–2574.

    Article  Google Scholar 

  8. Challita, U., Dong, L., & Saad, W. (2017). Deep learning for proactive resource allocation in LTE-U networks. In 23th European wireless conference, Dresden, Germany (pp. 1–6).

  9. Liu, R., Chen, Q., & Yu, G. (2018). Joint user association and resource optimization for unlicensed LTE systems. In IEEE international conference on communications (pp. 1–6). Kansas, MO, USA.

  10. Lopez-Perez, D., Guvenc, I., de la Roche, G., Kountouris, M., Quek, T. Q. S., & Zhang, J. (2011). Enhanced intercell interference coordination challenges in heterogeneous networks. IEEE Wireless Communications, 18(3), 22–30.

    Article  Google Scholar 

  11. Bianchi, G. (2000). Performance analysis of IEEE 802.11 distributed coordination function. IEEE Journal on Selected Areas in Communications, 18(3), 535–547.

    Article  Google Scholar 

  12. Schuster, M., & Paliwal, K. (1997). Bidirectional recurrent neural networks. IEEE Transactions on Signal Processing, 45(11), 2673–2681.

    Article  Google Scholar 

  13. Rumelhart, D., Hinton, G., & Williams, R. (1986). Learning representations by back-propagating errors. Nature, 323, 533–536.

    Article  Google Scholar 

  14. Sutton, R., McAllester, D., Singh, S., & Mansour, Y. (2000). Policy gradient methods for reinforcement learning with function approximation. Neural Information Processing Systems, 12, 1057–1063.

    Google Scholar 

  15. Balazinska, M., & Castro, P. (2003). IBM Watson Research Center. CRAWCAD.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhikun Luo.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Luo, Z., Yu, G. BiLSTM Based Reinforcement Learning for Resource Allocation and User Association in LTE-U Networks. Wireless Pers Commun 114, 2629–2641 (2020). https://doi.org/10.1007/s11277-020-07493-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11277-020-07493-x

Keywords

Navigation