Skip to main content

Advertisement

Log in

End-to-end pseudo relevance feedback based vertical web search queries recommendation

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Nowadays, the web has emerged as an enormous multimedia data resource. Social media platforms are becoming the mass producers of user-generated multimedia content. Web search engines usually organize media-specific information, such as text, images, video, etc., in specialized data repositories (verticals), providing easy access to multimedia content. Web search engines have become efficient in retrieving data across various distinct verticals. However, users still need help formulating queries to retrieve the relevant multimedia results from verticals. Novice users often issue short-length ambiguous queries due to a lack of domain knowledge or query formulation experience, resulting in the retrieval of irrelevant results. The query formulation itself is a time-consuming process for the users. We presented an end-to-end deep-learning automatic query recommendation approach to address the associated issues. The proposed method autonomously extracts the domain knowledge using pseudo-relevance feedback, transforms it into a unified text-to-text summary, and assists users in generating non-ambiguous and well-balanced query recommendations. The proposed system employs Google’s real-time dataset and is compared to the Google search engine. The evaluation consists of empirical and usability perspectives. The empirical evaluation of shorter query formulation and reformulation time obtained 89% accuracy scores in automated query recommendation. The usability testing (N=37) reveals 85.4% usefulness & ease-of-use, and “A” category proposed system usability.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20

Similar content being viewed by others

Notes

  1. Pre-trained model employed in this research uses the default python library configuration.

  2. https://www.djangoproject.com/

  3. https://screenrec.com/

References

  1. Rashid U, Bhatti MA (2017) A framework to explore results in multiple media information aggregated search. Multimed Tools Appl 76(24):25787–25826

  2. Pouyanfar S, Yang Y, Chen S-C, Shyu M-L, Iyengar S (2018) Multimedia big data analytics: a survey. ACM Comput Surv (CSUR) 51(1):1–34

  3. Vidinli IB, Ozcan R (2016) New query suggestion framework and algorithms: a case study for an educational search engine. Inf Process Manage 52(5):733–752. https://doi.org/10.1016/j.ipm.2016.02.001

  4. Kuzi S, Zhai C, Tian Y, Tang H (2020) Figexplorer: a system for retrieval and exploration of figures from collections of research articles. In: Proceedings of the 43rd international ACM SIGIR conference on research and development in information retrieval, pp 2133–2136

  5. Oussous A, Benjelloun F-Z, Lahcen AA, Belfkih S (2018) Big data technologies: a survey. J King Saud Univ- Comput Inf Sci 30(4):431–448

  6. Pámies-Estrems D, Castellá-Roca J, Viejo A (2016) Working at the web search engine side to generate privacy-preserving user profiles. Expert Syst Appl 64:523–535. https://doi.org/10.1016/j.eswa.2016.08.033

  7. Khan AR, Rashid U, Saleem K, Ahmed A (2021) An architecture for non-linear discovery of aggregated multimedia document web search results. PeerJ Comput Sci 7:449

  8. Tan SS-L, Goonawardene N (2017) Internet health information seeking and the patient-physician relationship: a systematic review. J Med Internet Res 19(1):9

  9. Kathuria M, Nagpal C, Duhan N (2016) Journey of web search engines: milestones, challenges & innovations. Int J Inf Technol Comput Sci 12:47–58

  10. Jiang J, Ni C (2016) What affects word changes in query reformulation during a task-based search session. In: Proceedings of the 2016 ACM on conference on human information interaction and retrieval, pp 111–120

  11. Toms EG, O’Brien H, Mackenzie T, Jordan C, Freund L, Toze S, Dawe E, Macnutt A (2007) Task effects on interactive search: the query factor. In: International workshop of the initiative for the evaluation of XML Retrieval, pp 359–372 . Springer

  12. Bilal D, Gwizdka J (2018) Children’s query types and reformulations in google search. Inf Process Manage 54(6):1022–1041

  13. Lin S-C, Yang J-H, Nogueira R, Tsai M-F, Wang C-J, Lin J (2020) Query reformulation using query history for passage retrieval in conversational search. arXiv:2005.02230

  14. Maxwell D, Bailey P, Hawking D (2017) Large-scale generative query autocompletion. In: Proceedings of the 22nd australasian document computing symposium, pp 1–8

  15. Li C, Sun Y, He B, Wang L, Hui K, Yates A, Sun L, Xu J (2018) Nprf: a neural pseudo relevance feedback framework for ad-hoc information retrieval. arXiv:1810.12936

  16. Rashid U, Javid A, Khan AR, Liu L, Ahmed A, Khalid O, Saleem K, Meraj S, Iqbal U, Nawaz R (2022) A hybrid mask rcnn-based tool to localize dental cavities from real-time mixed photographic images. PeerJ Comput Sc 8:888

  17. Rahman MM, Abdullah NA (2018) A personalized group-based recommendation approach for web search in e-learning. IEEE Access 6:34166–34178

  18. Wang J, Pan M, He T, Huang X, Wang X, Tu X (2020) A pseudo-relevance feedback framework combining relevance matching and semantic matching for information retrieval. Inf Process Manage 57(6):102342

  19. Rashid U, Viviani M, Pasi G (2016) A graph-based approach for visualizing and exploring a multimedia search result space. Inf Sci 370:303–322

  20. Song W, Liang JZ, Cao XL, Park SC (2014) An effective query recommendation approach using semantic strategies for intelligent information retrieval. Expert Syst Appl 41(2):366–372. https://doi.org/10.1016/j.eswa.2013.07.052

  21. Rashid U, Saleem K, Ahmed A (2021) Mirre approach: nonlinear and multimodal exploration of mir aggregated search results. Multimed Tools Appl 80(13):20217–20253

  22. Russell-Rose T, Tate T (2012) Designing the Search Experience: the Information Architecture of Discovery. Newnes

  23. Kofler C, Larson M, Hanjalic A (2016) User intent in multimedia search: a survey of the state of the art and future challenges. ACM Comput Surv (CSUR) 49(2):1–37

  24. Liao Z, Song Y, Zhou D (2020) Query suggestion. In: Query understanding for search engines, pp 171–203. Springer

  25. Kumar M, Bindal A, Gautam R, Bhatia R (2018) Keyword query based focused web crawler. Procedia Comput Sci 125:584–590

  26. Ooi J, Ma X, Qin H, Liew SC (2015) A survey of query expansion, query suggestion and query refinement techniques. In: 2015 4th International conference on software engineering and computer systems (ICSECS), pp 112–117. IEEE

  27. Azad HK, Deepak A (2019) Query expansion techniques for information retrieval: a survey. Inf Process Manage 56(5):1698–1735

  28. Chen W, Cai F, Chen H, de Rijke M (2017) Personalized query suggestion diversification. In: Proceedings of the 40th international ACM SIGIR conference on research and development in information retrieval, pp 817–820

  29. Ahmad WU, Chang K-W, Wang H (2019) Context attentive document ranking and query suggestion. In: Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval, pp 385–394

  30. Chen W, Cai F, Chen H, de Rijke M (2018) Attention-based hierarchical neural query suggestion. In: The 41st international ACM SIGIR conference on research & development in information retrieval, pp 1093–1096

  31. Ahmad W.U, Chang K.-W, Wang H (2018) Multi-task learning for document ranking and query suggestion. In: International conference on learning representations

  32. Jeffery S.R, Franklin M.J, Halevy AY (2008) Pay-as-you-go user feedback for dataspace systems. In: Proceedings of the 2008 ACM SIGMOD international conference on management of data, pp 847–860

  33. Balakrishnan V, Ahmadi K, Ravana SD (2015) Improving retrieval relevance using users’ explicit feedback. Aslib Journal of Information Management

  34. Jayarathna S, Patra A, Shipman F (2015) Unified relevance feedback for multi-application user interest modeling. In: Proceedings of the 15th ACM/IEEE-CS joint conference on digital libraries, pp 129–138

  35. Xu S, Jiang H, Lau FC (2008) Personalized online document, image and video recommendation via commodity eye-tracking. In: Proceedings of the 2008 ACM conference on recommender systems, pp 83–90

  36. Su Y, Yang S, Sun H, Srivatsa M, Kase S, Vanni M, Yan X (2015) Exploiting relevance feedback in knowledge graph search. In: Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining, pp 1135–1144

  37. Stai E, Kafetzoglou S, Tsiropoulou EE, Papavassiliou S (2018) A holistic approach for personalization, relevance feedback & recommendation in enriched multimedia content. Multimed Tools Appl 77(1):283–326

  38. Zamani H, Dadashkarimi J, Shakery A, Croft WB (2016) Pseudo-relevance feedback based on matrix factorization. In: Proceedings of the 25th ACM international on conference on information and knowledge management, pp 1483–1492

  39. ALMasri M, Berrut C, Chevallet J-P (2016) A comparison of deep learning based query expansion with pseudo-relevance feedback and mutual information. In: European conference on information retrieval, pp 709–715. Springer

  40. Keikha A, Ensan F, Bagheri E (2018) Query expansion using pseudo relevance feedback on wikipedia. J Intell Inf Syst 50(3):455–478

  41. Jiang J-Y, Wang W (2018) Rin: reformulation inference network for context-aware query suggestion. In: Proceedings of the 27th ACM international conference on information and knowledge management, pp 197–206

  42. Chen W, Cai F, Chen H, De Rijke M (2020) Personalized query suggestion diversification in information retrieval. Front Comput Sci 14(3):1–14

  43. Ding H, Zhang S, Garigliotti D, Balog K (2018) Generating high-quality query suggestion candidates for task-based search. In: European conference on information retrieval, pp 625–631. Springer

  44. Dehghani M, Rothe S, Alfonseca E, Fleury P (2017) Learning to attend, copy, and generate for session-based query suggestion. In: Proceedings of the 2017 ACM on conference on information and knowledge management, pp1747–1756

  45. Sordoni A, Bengio Y, Vahabi H, Lioma C, Grue Simonsen J, Nie J-Y (2015) A hierarchical recurrent encoder-decoder for generative context-aware query suggestion. In: Proceedings of the 24th ACM international on conference on information and knowledge management, pp 553–562

  46. Shokouhi M (2013) Learning to personalize query auto-completion. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval, pp 103–112

  47. Li X, Chen Y, Pettit B, Rijke MD (2019) Personalised reranking of paper recommendations using paper content and user behavior. ACM Trans Inf Syst (TOIS) 37(3):1–23

  48. Zhang X, Jiang X, Qin J (2020) Time-aware query suggestion diversification for temporally ambiguous queries. The Electronic Library

  49. Cai F, Reinanda R, Rijke MD (2016) Diversifying query auto-completion. ACM Trans Inf Syst (TOIS) 34(4):1–33

  50. Mustar A, Lamprier S, Piwowarski B (2021) On the study of transformers for query suggestion. ACM Trans Inf Syst (TOIS) 40(1):1–27

  51. Bodigutla PK (2021) High quality related search query suggestions using deep reinforcement learning. arXiv:2108.04452

  52. Yu H, Xiong C, Callan J (2021) Improving query representations for dense retrieval with pseudo relevance feedback. arXiv:2108.13454

  53. Yu H, Dai Z, Callan J (2021) Pgt: pseudo relevance feedback using a graph-based transformer. arXiv:2101.07918

  54. Valcarce D, Parapar J, Barreiro Á (2018) Lime: linear methods for pseudo-relevance feedback. In: Proceedings of the 33rd annual ACM symposium on applied computing, pp 678–687

  55. Lv Y, Zhai C, Chen W (2011) A boosting approach to improving pseudo-relevance feedback. In: Proceedings of the 34th international ACM SIGIR conference on research and development in information retrieval, pp 165–174

  56. White RW, Roth RA (2009) Exploratory search: beyond the query-response paradigm. Synth Lect Inf Concepts Retr Serv 1(1):1–98

  57. Atwood R, Dervin B (1981) Challenges to sociocultural predictors of information seeking: a text of race versus situation movement state. Ann Int Commun Assoc 5(1):549–569. https://doi.org/10.1080/23808985.1981.11923862

  58. Wenxiu P (2015) Analysis of new media communication based on lasswell’s “5w” model. J Educ Soc Res 5(3):245–245

  59. McCarley J, Chakravarti R, Sil A (2019) Structured pruning of a bert-based question answering model. arXiv:1910.06360

  60. Chandrasekaran D, Mago V (2021) Evolution of semantic similarity-a survey. ACM Comput Surv (CSUR) 54(2):1–37

  61. Cutrell E, Guan Z (2007) What are you looking for? an eye-tracking study of information usage in web search. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 407–416

  62. Thorleuchter D, den Poel DV, Prinzie A (2010) Mining ideas from textual information. Expert Syst Appl 37(10):7182–7188. https://doi.org/10.1016/j.eswa.2010.04.013

  63. Chang Y, Ounis I, Kim M (2006) Query reformulation using automatically generated query concepts from a document space. Inf Process Manage 42(2):453–468

  64. Khan A.R, Rashid U (2021) A relational aggregated disjoint multimedia search results approach using semantics. In: 2021 International conference on artificial intelligence (ICAI), pp 62–67. https://doi.org/10.1109/ICAI52203.2021.9445229

  65. Khan AR, Rashid U, Ahmed N (2022) An explanatory study on user behavior in discovering aggregated multimedia web content. IEEE Access 10:56316–56330. https://doi.org/10.1109/ACCESS.2022.3177597

  66. Shekhar A, Marsden N (2018) Cognitive walkthrough of a learning management system with gendered personas. In: Proceedings of the 4th conference on gender & IT, pp 191–198

  67. Alroobaea R, Mayhew PJ (2014) How many participants are really enough for usability studies? In: 2014 Science and information conference, pp 48–56. IEEE

  68. Marcum JW (2002) Rethinking Inf Lit Libr Q 72(1):1–26

  69. Taramigkou M, Apostolou D, Mentzas G (2017) Supporting creativity through the interactive exploratory search paradigm. Int J Hum Comput Interact 33(2):94–114

  70. Li Y, Belkin NJ (2008) A faceted approach to conceptualizing tasks in information seeking. Inf Process Manage 44(6):1822–1837

  71. Lewis JR, Sauro J (2018) Item benchmarks for the system usability scale. Journal of Usability Studies 13(3)

  72. Lewis JR (1991) Psychometric evaluation of an after-scenario questionnaire for computer usability studies: the asq. ACM Sigchi Bulletin 23(1):78–81

  73. Shi J, Mo X, Sun Z (2012) Content validity index in scale development. Zhong nan da xue xue bao. Yi xue ban= Journal of Central South University. Med Sci 37(2): 152–155

  74. Brown A, Evans M, Jay C, Glancy M, Jones R, Harper S (2014) Hci over multiple screens. In: CHI’14 extended abstracts on human factors in computing systems, pp 665–674

  75. Kim JY, Teevan J, Craswell N (2016) Explicit in situ user feedback for web search results. In: Proceedings of the 39th international acm sigir conference on research and development in information retrieval, pp 829–832

  76. Tablan V, Bontcheva K, Roberts I, Cunningham H (2015) Mimir: an open-source semantic search framework for interactive information seeking and discovery. J Web Semant 30:52–68

  77. Huurdeman H, Kamps J, Wilson ML (2019) The multi-stage experience: the simulated work task approach to studying information seeking stages. CEUR Workshop Proceedings

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Umer Rashid.

Ethics declarations

Competing of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khan, T., Rashid, U. & Khan, A.R. End-to-end pseudo relevance feedback based vertical web search queries recommendation. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-024-18559-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11042-024-18559-4

Keywords

Navigation