ABSTRACT
We introduce an entity-centric search experience, called Active Objects, in which entity-bearing queries are paired with actions that can be performed on the entities. For example, given a query for a specific flashlight, we aim to present actions such as reading reviews, watching demo videos, and finding the best price online. In an annotation study conducted over a random sample of user query sessions, we found that a large proportion of queries in query logs involve actions on entities, calling for an automatic approach to identifying relevant actions for entity-bearing queries. In this paper, we pose the problem of finding actions that can be performed on entities as the problem of probabilistic inference in a graphical model that captures how an entity bearing query is generated. We design models of increasing complexity that capture latent factors such as entity type and intended actions that determine how a user writes a query in a search box, and the URL that they click on. Given a large collection of real-world queries and clicks from a commercial search engine, the models are learned efficiently through maximum likelihood estimation using an EM algorithm. Given a new query, probabilistic inference enables recommendation of a set of pertinent actions and hosts. We propose an evaluation methodology for measuring the relevance of our recommended actions, and show empirical evidence of the quality and the diversity of the discovered actions.
- Balasubramanian, N. and Cucerzan, S. Topic Pages: An Alternative to the Ten Blue Links. In IEEE-ICSC (2010). Google ScholarDigital Library
- Blei, D.M., Ng, A. and Jordan, M. Latent Dirichlet Allocation. In Journal of Machine Learning Research, 3:993--1022, (2003). Google ScholarDigital Library
- Broder, A. A Taxonomy of Web Search. SIGIR Forum, volume 36 number 2 pages 3--10 (2002). Google ScholarDigital Library
- Broder, A., Fontoura, M., Gabrilovich, E., Joshi, A., Josifovski, V., Zhang, T. Robust Classification of Rare Queries Using Web Knowledge. In SIGIR (2007). Google ScholarDigital Library
- Carman, M.J., Crestani, F., Harvey, M., and Baillie, M. Towards Query Log Based Personalization using Topic Models. In Proceedings of CIKM (2010). Google ScholarDigital Library
- Cucerzan, S. Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proceedings of EMNLP (2007).Google Scholar
- Curran, J. R. and Clark, S. Language independent NER using a maximum entropy tagger. In CoNLL, pp. 164--167 (2003). Google ScholarDigital Library
- Dalvi, N., Kumar, R., Pang, B., Ramakrishnan, R., Tomkins, A., Bohannon, P., Keerthi, S., Merugu, S., A Web of Concepts. In Proceedings of PODS (2009). Google ScholarDigital Library
- Dupret, G. and Piwowarski, B. A User Behavior Model for Average Precision and its Generalization to Graded Judgments. In Proceedings of SIGIR, pages 531--538 (2010). Google ScholarDigital Library
- Gao, J., Toutanova, K. and Yih, W. Clickthrough-Based Latent Semantic Models for Web Search. In Proceedings of SIGIR (2011). Google ScholarDigital Library
- Guo, J., Xu, G., Cheng, X. and Li, H. Named Entity Recognition in Query. In Proceedings of SIGIR, pages 267--274 (2009). Google ScholarDigital Library
- Hamerly, G. and Elkan, C. Learning the K in K-Means. In Proceedings of the 7th Annual Conference on Neural Information Processing Systems (NIPS) (2003).Google Scholar
- Hearst, M. Automatic Acquisition of Hyponyms from Large Text Corpora. In COLING, Nantes, France, (1992). Google ScholarDigital Library
- Jain, A. and Pennacchiotti, M. Domain-Independent Entity Extraction from Web Search Query Logs. In WWW (2011). Google ScholarDigital Library
- Jansen, B.J., Booth, D. and Spink, A. Determining the User Intent of Web Search Engine Queries. In WWW (2007). Google ScholarDigital Library
- Kemke, C. and Walker, E. Planning with Action Abstraction and Plan Decomposition Hierarchies. In IAT (2006). Google ScholarDigital Library
- Metzinger, T. and Gallese, V. The Emergence of a Shared Action Ontology: Building Blocks for a Theory. In Consciousness and Cognition, 12, 549--571 (2003).Google ScholarCross Ref
- Pantel, P. and Fuxman, A. Jigs and Lures: Associating Web Queries with Structured Entities. In ACL (2011). Google ScholarDigital Library
- Rose, D. E. and Levinson, D. Understanding User Goals in Web Search. In Proceedings of WWW (2004). Google ScholarDigital Library
- Sauper, C. and Barzilay, R. Automatically Generating Wikipedia Articles: A Structure-Aware Approach. In Proceedings of ACL (2009). Google ScholarDigital Library
- Sekine, S. and Suzuki, H. Acquiring Ontological Knowledge from Query Logs. In Proceedings of WWW (2007). Google ScholarDigital Library
- Yin, X. and Shah, S. Building Taxonomy of Web Search Intents for Name Entity Queries. In WWW (2010) Google ScholarDigital Library
Index Terms
- Active objects: actions for entity-centric search
Recommendations
Structured query suggestion for specialization and parallel movement: effect on search behaviors
WWW '12: Proceedings of the 21st international conference on World Wide WebQuery suggestion, which enables the user to revise a query with a single click, has become one of the most fundamental features of Web search engines. However, it is often difficult for the user to choose from a list of query suggestions, and to ...
RebaCQ: query refinement based on consecutive queries
IRI'09: Proceedings of the 10th IEEE international conference on Information Reuse & IntegrationPrevious studies reveal that half of the queries submitted to search engines have no follow-up click-through data. This may indicate that users are either dissatisfied with the performance of current search engines or have difficulty formulating correct ...
Reranking search results for sparse queries
CIKM '11: Proceedings of the 20th ACM international conference on Information and knowledge managementIt is well known that clickthrough data can be used to improve the effectiveness of search results: broadly speaking, a query's past clicks are a predictor of future clicks on documents. However, when a new or unusual query appears, or when a system is ...
Comments