Abstract
Topic Modeling is a well-known text-mining strategy that detects potential underlying topics for documents. It plays a pivotal role in recommender systems for processing proliferated user-generated content (UGC) for personalized recommendations. Its application presents unique challenges in tourism sector due to the diversity, dynamicity, and multimodality of tourism data. This study presents a comprehensive analysis of selected promising topic models specifically in context of tourism recommender systems. The study conducts experimental evaluation of models’ performance on five datasets, and highlights their advantages and unique characteristics based on multiple evaluation parameters. Results reveal no best approach in general, rather optimality of models depend on data characteristics, as thoroughly discussed in this paper. It further discusses open issues for the tourism context-related application of topic models, and future research directions.
This research is supported by Amarena Company srl.
References
Alenezi, T., Hirtle, S.: Normalized attraction travel personality representation for improving travel recommender systems. IEEE Access (2022)
Angelov, D.: Top2vec: distributed representations of topics. arXiv preprint arXiv:2008.09470 (2020)
Bao, J., Xu, C., Liu, P., Wang, W.: Exploring bikesharing travel patterns and trip purposes using smart card data and online point of interests. Netw. Spat. Econ. 17, 1231–1253 (2017)
Bianchi, F., Terragni, S., Hovy, D.: Pre-training is a hot topic: Contextualized document embeddings improve topic coherence (2020)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. J. Mach. Learn. Res. 3(Jan), 993–1022 (2001)
Dieng, A.B., Ruiz, F.J., Blei, D.M.: Topic modeling in embedding spaces. Trans. Assoc. Comput. Ling. 8, 439–453 (2020)
Grootendorst, M.: Bertopic: Neural topic modeling with a class-based tf-idf procedure. arXiv preprint arXiv:2203.05794 (2022)
Guo, Y., Barnes, S.J., Jia, Q.: Mining meaning from online ratings and reviews: tourist satisfaction analysis using latent Dirichlet allocation. Tour. Manage. 59, 467–483 (2017)
Hu, N., Zhang, T., Gao, B., Bose, I.: What do hotel customers complain about? text analysis using structural topic model. Tour. Manage. 72, 417–426 (2019)
Kamal, M., Chatzigiannakis, I.: Influential factors for tourist profiling for personalized tourism recommendation systems–a compact survey. In: 2021 International Conference on Innovative Computing (ICIC), pp. 1–6. IEEE (2021)
Korenčić, D., Ristov, S., Repar, J., Šnajder, J.: A topic coverage approach to evaluation of topic models. IEEE Access 9, 123280–123312 (2021)
Krumm, J., Davies, N., Narayanaswami, C.: User-generated content. IEEE Pervasive Comput. 7(4), 10–11 (2008)
Lee, D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in neural information processing systems 13 (2000)
Liu, Y., et al.: Roberta: a robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 (2019)
Lui, M., Lau, J.H., Baldwin, T.: Automatic detection and language identification of multilingual documents. Trans. Assoc. Comput. Ling. 2, 27–40 (2014)
Vu, H.Q., Li, G., Law, R.: Discovering implicit activity preferences in travel itineraries by topic modeling. Tour. Manage. 75, 435–446 (2019)
Yan, Q., Jiang, T., Zhou, S., Zhang, X.: Exploring tourist interaction from user-generated content: topic analysis and content analysis. J. Vacation Mark., 13567667221135196 (2022)
Zhao, N., Fan, G., Qi, Z., Shi, J.: Exploring the current situation of cultural tourism scenic spots based on lda model-take Nanjing, Jiangsu province, China as an example. Procedia Comput. Sci. 221, 826–832 (2023)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Kamal, M., Romani, G., Ricciuti, G., Anagnostopoulos, A., Chatzigiannakis, I. (2024). Analyzing Topic Models: A Tourism Recommender System Perspective. In: Barolli, L. (eds) Advanced Information Networking and Applications. AINA 2024. Lecture Notes on Data Engineering and Communications Technologies, vol 200. Springer, Cham. https://doi.org/10.1007/978-3-031-57853-3_21
Download citation
DOI: https://doi.org/10.1007/978-3-031-57853-3_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-57852-6
Online ISBN: 978-3-031-57853-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)