Abstract
Nowadays, the web has emerged as an enormous multimedia data resource. Social media platforms are becoming the mass producers of user-generated multimedia content. Web search engines usually organize media-specific information, such as text, images, video, etc., in specialized data repositories (verticals), providing easy access to multimedia content. Web search engines have become efficient in retrieving data across various distinct verticals. However, users still need help formulating queries to retrieve the relevant multimedia results from verticals. Novice users often issue short-length ambiguous queries due to a lack of domain knowledge or query formulation experience, resulting in the retrieval of irrelevant results. The query formulation itself is a time-consuming process for the users. We presented an end-to-end deep-learning automatic query recommendation approach to address the associated issues. The proposed method autonomously extracts the domain knowledge using pseudo-relevance feedback, transforms it into a unified text-to-text summary, and assists users in generating non-ambiguous and well-balanced query recommendations. The proposed system employs Google’s real-time dataset and is compared to the Google search engine. The evaluation consists of empirical and usability perspectives. The empirical evaluation of shorter query formulation and reformulation time obtained 89% accuracy scores in automated query recommendation. The usability testing (N=37) reveals 85.4% usefulness & ease-of-use, and “A” category proposed system usability.
Similar content being viewed by others
Notes
Pre-trained model employed in this research uses the default python library configuration.
References
Rashid U, Bhatti MA (2017) A framework to explore results in multiple media information aggregated search. Multimed Tools Appl 76(24):25787–25826
Pouyanfar S, Yang Y, Chen S-C, Shyu M-L, Iyengar S (2018) Multimedia big data analytics: a survey. ACM Comput Surv (CSUR) 51(1):1–34
Vidinli IB, Ozcan R (2016) New query suggestion framework and algorithms: a case study for an educational search engine. Inf Process Manage 52(5):733–752. https://doi.org/10.1016/j.ipm.2016.02.001
Kuzi S, Zhai C, Tian Y, Tang H (2020) Figexplorer: a system for retrieval and exploration of figures from collections of research articles. In: Proceedings of the 43rd international ACM SIGIR conference on research and development in information retrieval, pp 2133–2136
Oussous A, Benjelloun F-Z, Lahcen AA, Belfkih S (2018) Big data technologies: a survey. J King Saud Univ- Comput Inf Sci 30(4):431–448
Pámies-Estrems D, Castellá-Roca J, Viejo A (2016) Working at the web search engine side to generate privacy-preserving user profiles. Expert Syst Appl 64:523–535. https://doi.org/10.1016/j.eswa.2016.08.033
Khan AR, Rashid U, Saleem K, Ahmed A (2021) An architecture for non-linear discovery of aggregated multimedia document web search results. PeerJ Comput Sci 7:449
Tan SS-L, Goonawardene N (2017) Internet health information seeking and the patient-physician relationship: a systematic review. J Med Internet Res 19(1):9
Kathuria M, Nagpal C, Duhan N (2016) Journey of web search engines: milestones, challenges & innovations. Int J Inf Technol Comput Sci 12:47–58
Jiang J, Ni C (2016) What affects word changes in query reformulation during a task-based search session. In: Proceedings of the 2016 ACM on conference on human information interaction and retrieval, pp 111–120
Toms EG, O’Brien H, Mackenzie T, Jordan C, Freund L, Toze S, Dawe E, Macnutt A (2007) Task effects on interactive search: the query factor. In: International workshop of the initiative for the evaluation of XML Retrieval, pp 359–372 . Springer
Bilal D, Gwizdka J (2018) Children’s query types and reformulations in google search. Inf Process Manage 54(6):1022–1041
Lin S-C, Yang J-H, Nogueira R, Tsai M-F, Wang C-J, Lin J (2020) Query reformulation using query history for passage retrieval in conversational search. arXiv:2005.02230
Maxwell D, Bailey P, Hawking D (2017) Large-scale generative query autocompletion. In: Proceedings of the 22nd australasian document computing symposium, pp 1–8
Li C, Sun Y, He B, Wang L, Hui K, Yates A, Sun L, Xu J (2018) Nprf: a neural pseudo relevance feedback framework for ad-hoc information retrieval. arXiv:1810.12936
Rashid U, Javid A, Khan AR, Liu L, Ahmed A, Khalid O, Saleem K, Meraj S, Iqbal U, Nawaz R (2022) A hybrid mask rcnn-based tool to localize dental cavities from real-time mixed photographic images. PeerJ Comput Sc 8:888
Rahman MM, Abdullah NA (2018) A personalized group-based recommendation approach for web search in e-learning. IEEE Access 6:34166–34178
Wang J, Pan M, He T, Huang X, Wang X, Tu X (2020) A pseudo-relevance feedback framework combining relevance matching and semantic matching for information retrieval. Inf Process Manage 57(6):102342
Rashid U, Viviani M, Pasi G (2016) A graph-based approach for visualizing and exploring a multimedia search result space. Inf Sci 370:303–322
Song W, Liang JZ, Cao XL, Park SC (2014) An effective query recommendation approach using semantic strategies for intelligent information retrieval. Expert Syst Appl 41(2):366–372. https://doi.org/10.1016/j.eswa.2013.07.052
Rashid U, Saleem K, Ahmed A (2021) Mirre approach: nonlinear and multimodal exploration of mir aggregated search results. Multimed Tools Appl 80(13):20217–20253
Russell-Rose T, Tate T (2012) Designing the Search Experience: the Information Architecture of Discovery. Newnes
Kofler C, Larson M, Hanjalic A (2016) User intent in multimedia search: a survey of the state of the art and future challenges. ACM Comput Surv (CSUR) 49(2):1–37
Liao Z, Song Y, Zhou D (2020) Query suggestion. In: Query understanding for search engines, pp 171–203. Springer
Kumar M, Bindal A, Gautam R, Bhatia R (2018) Keyword query based focused web crawler. Procedia Comput Sci 125:584–590
Ooi J, Ma X, Qin H, Liew SC (2015) A survey of query expansion, query suggestion and query refinement techniques. In: 2015 4th International conference on software engineering and computer systems (ICSECS), pp 112–117. IEEE
Azad HK, Deepak A (2019) Query expansion techniques for information retrieval: a survey. Inf Process Manage 56(5):1698–1735
Chen W, Cai F, Chen H, de Rijke M (2017) Personalized query suggestion diversification. In: Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval, pp 817–820
Ahmad WU, Chang K-W, Wang H (2019) Context attentive document ranking and query suggestion. In: Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval, pp 385–394
Chen W, Cai F, Chen H, de Rijke M (2018) Attention-based hierarchical neural query suggestion. In: The 41st international ACM SIGIR conference on research & development in information retrieval, pp 1093–1096
Ahmad W.U, Chang K.-W, Wang H (2018) Multi-task learning for document ranking and query suggestion. In: International conference on learning representations
Jeffery S.R, Franklin M.J, Halevy AY (2008) Pay-as-you-go user feedback for dataspace systems. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data, pp 847–860
Balakrishnan V, Ahmadi K, Ravana SD (2015) Improving retrieval relevance using users’ explicit feedback. Aslib Journal of Information Management
Jayarathna S, Patra A, Shipman F (2015) Unified relevance feedback for multi-application user interest modeling. In: Proceedings of the 15th ACM/IEEE-CS joint conference on digital libraries, pp 129–138
Xu S, Jiang H, Lau FC (2008) Personalized online document, image and video recommendation via commodity eye-tracking. In: Proceedings of the 2008 ACM conference on recommender systems, pp 83–90
Su Y, Yang S, Sun H, Srivatsa M, Kase S, Vanni M, Yan X (2015) Exploiting relevance feedback in knowledge graph search. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144
Stai E, Kafetzoglou S, Tsiropoulou EE, Papavassiliou S (2018) A holistic approach for personalization, relevance feedback & recommendation in enriched multimedia content. Multimed Tools Appl 77(1):283–326
Zamani H, Dadashkarimi J, Shakery A, Croft WB (2016) Pseudo-relevance feedback based on matrix factorization. In: Proceedings of the 25th ACM international on conference on information and knowledge management, pp 1483–1492
ALMasri M, Berrut C, Chevallet J-P (2016) A comparison of deep learning based query expansion with pseudo-relevance feedback and mutual information. In: European conference on information retrieval, pp 709–715. Springer
Keikha A, Ensan F, Bagheri E (2018) Query expansion using pseudo relevance feedback on wikipedia. J Intell Inf Syst 50(3):455–478
Jiang J-Y, Wang W (2018) Rin: reformulation inference network for context-aware query suggestion. In: Proceedings of the 27th ACM international conference on information and knowledge management, pp 197–206
Chen W, Cai F, Chen H, De Rijke M (2020) Personalized query suggestion diversification in information retrieval. Front Comput Sci 14(3):1–14
Ding H, Zhang S, Garigliotti D, Balog K (2018) Generating high-quality query suggestion candidates for task-based search. In: European conference on information retrieval, pp 625–631. Springer
Dehghani M, Rothe S, Alfonseca E, Fleury P (2017) Learning to attend, copy, and generate for session-based query suggestion. In: Proceedings of the 2017 ACM on conference on information and knowledge management, pp1747–1756
Sordoni A, Bengio Y, Vahabi H, Lioma C, Grue Simonsen J, Nie J-Y (2015) A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In: Proceedings of the 24th ACM international on conference on information and knowledge management, pp 553–562
Shokouhi M (2013) Learning to personalize query auto-completion. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval, pp 103–112
Li X, Chen Y, Pettit B, Rijke MD (2019) Personalised reranking of paper recommendations using paper content and user behavior. ACM Trans Inf Syst (TOIS) 37(3):1–23
Zhang X, Jiang X, Qin J (2020) Time-aware query suggestion diversification for temporally ambiguous queries. The Electronic Library
Cai F, Reinanda R, Rijke MD (2016) Diversifying query auto-completion. ACM Trans Inf Syst (TOIS) 34(4):1–33
Mustar A, Lamprier S, Piwowarski B (2021) On the study of transformers for query suggestion. ACM Trans Inf Syst (TOIS) 40(1):1–27
Bodigutla PK (2021) High quality related search query suggestions using deep reinforcement learning. arXiv:2108.04452
Yu H, Xiong C, Callan J (2021) Improving query representations for dense retrieval with pseudo relevance feedback. arXiv:2108.13454
Yu H, Dai Z, Callan J (2021) Pgt: pseudo relevance feedback using a graph-based transformer. arXiv:2101.07918
Valcarce D, Parapar J, Barreiro Á (2018) Lime: linear methods for pseudo-relevance feedback. In: Proceedings of the 33rd annual ACM symposium on applied computing, pp 678–687
Lv Y, Zhai C, Chen W (2011) A boosting approach to improving pseudo-relevance feedback. In: Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval, pp 165–174
White RW, Roth RA (2009) Exploratory search: beyond the query-response paradigm. Synth Lect Inf Concepts Retr Serv 1(1):1–98
Atwood R, Dervin B (1981) Challenges to sociocultural predictors of information seeking: a text of race versus situation movement state. Ann Int Commun Assoc 5(1):549–569. https://doi.org/10.1080/23808985.1981.11923862
Wenxiu P (2015) Analysis of new media communication based on lasswell’s “5w” model. J Educ Soc Res 5(3):245–245
McCarley J, Chakravarti R, Sil A (2019) Structured pruning of a bert-based question answering model. arXiv:1910.06360
Chandrasekaran D, Mago V (2021) Evolution of semantic similarity-a survey. ACM Comput Surv (CSUR) 54(2):1–37
Cutrell E, Guan Z (2007) What are you looking for? an eye-tracking study of information usage in web search. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 407–416
Thorleuchter D, den Poel DV, Prinzie A (2010) Mining ideas from textual information. Expert Syst Appl 37(10):7182–7188. https://doi.org/10.1016/j.eswa.2010.04.013
Chang Y, Ounis I, Kim M (2006) Query reformulation using automatically generated query concepts from a document space. Inf Process Manage 42(2):453–468
Khan A.R, Rashid U (2021) A relational aggregated disjoint multimedia search results approach using semantics. In: 2021 International conference on artificial intelligence (ICAI), pp 62–67. https://doi.org/10.1109/ICAI52203.2021.9445229
Khan AR, Rashid U, Ahmed N (2022) An explanatory study on user behavior in discovering aggregated multimedia web content. IEEE Access 10:56316–56330. https://doi.org/10.1109/ACCESS.2022.3177597
Shekhar A, Marsden N (2018) Cognitive walkthrough of a learning management system with gendered personas. In: Proceedings of the 4th conference on gender & IT, pp 191–198
Alroobaea R, Mayhew PJ (2014) How many participants are really enough for usability studies? In: 2014 Science and information conference, pp 48–56. IEEE
Marcum JW (2002) Rethinking Inf Lit Libr Q 72(1):1–26
Taramigkou M, Apostolou D, Mentzas G (2017) Supporting creativity through the interactive exploratory search paradigm. Int J Hum Comput Interact 33(2):94–114
Li Y, Belkin NJ (2008) A faceted approach to conceptualizing tasks in information seeking. Inf Process Manage 44(6):1822–1837
Lewis JR, Sauro J (2018) Item benchmarks for the system usability scale. Journal of Usability Studies 13(3)
Lewis JR (1991) Psychometric evaluation of an after-scenario questionnaire for computer usability studies: the asq. ACM Sigchi Bulletin 23(1):78–81
Shi J, Mo X, Sun Z (2012) Content validity index in scale development. Zhong nan da xue xue bao. Yi xue ban= Journal of Central South University. Med Sci 37(2): 152–155
Brown A, Evans M, Jay C, Glancy M, Jones R, Harper S (2014) Hci over multiple screens. In: CHI’14 extended abstracts on human factors in computing systems, pp 665–674
Kim JY, Teevan J, Craswell N (2016) Explicit in situ user feedback for web search results. In: Proceedings of the 39th international acm sigir conference on research and development in information retrieval, pp 829–832
Tablan V, Bontcheva K, Roberts I, Cunningham H (2015) Mimir: an open-source semantic search framework for interactive information seeking and discovery. J Web Semant 30:52–68
Huurdeman H, Kamps J, Wilson ML (2019) The multi-stage experience: the simulated work task approach to studying information seeking stages. CEUR Workshop Proceedings
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Khan, T., Rashid, U. & Khan, A.R. End-to-end pseudo relevance feedback based vertical web search queries recommendation. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-18559-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-024-18559-4