ABSTRACT
The workload on web search engines is actually multiclass, being derived from the activities of both human users and automated robots. It is important to distinguish between these two classes in order to reliably characterize human web search behavior, and to study the effect of robot activity. We suggest an approach based on a multi-dimensional characterization of search sessions, and take first steps towards implementing it by studying the interaction between the query submittal rate and the minimal interval of time between different queries.
- N. Buzikashvili, "Sliding window technique for the web log analysis". In 16th Intl. World Wide Web Conf., pp. 1213--1214, May 2007. Google ScholarDigital Library
- N. N. Buzikashvili and B. J. Jansen, "Limits of the web log analysis artifacts". In Workshop on Logging Traces of Web Activity: The Mechanics of Data Collection, May 2006.Google Scholar
- O. Etzioni, "Moving up the information food chain: deploying softbots on the world wide web". AI Magazine 18(2), pp. 11--18, Summer 1997.Google Scholar
- N. Geens, J. Huysmans, and J. Vanthienen, "Evaluation of web robot discovery techniques: a benchmarking study". In 6th Industrial Conf. Data Mining, pp. 121--130, Jul 2006. (LNCS vol. 4065). Google ScholarDigital Library
- B. J. Jansen, T. Mullen, A. Spink, and J. Pedersen, "Automated gathering of web information: an in-depth examination of agents interacting with search engines". ACM Trans. Internet Technology 6(4), pp. 442--464, Nov 2006. Google ScholarDigital Library
- B. J. Jansen and A. Spink, "How are we searching the world wide web? a comparison of nine search engine transaction logs". Inf. Process. & Management 42(1), pp. 248--263, Jan 2006. Google ScholarDigital Library
- A. Spink and B. J. Jansen, Web Search: Public Searching of the Web. Kluwer Academic Publishers, 2004. Google ScholarDigital Library
- A. Stassopoulou and M. D. Dikaiakos, "Web robot detection: a probabilistic reasoning approach". Computer Networks, 2009. (to appear). Google ScholarDigital Library
Index Terms
- Distinguishing humans from robots in web search logs: preliminary results using query rates and intervals
Recommendations
Toward Perceiving Robots as Humans: Three Handshake Models Face the Turing-Like Handshake Test
In the Turing test a computer model is deemed to “think intelligently” if it can generate answers that are indistinguishable from those of a human. We developed an analogous Turing-like handshake test to determine if a machine can produce similarly ...
Distinguishing first-line defaults and second-line conceptualization in reasoning about humans, robots, and computers
In the previous research, we demonstrated that people distinguish between human and nonhuman intelligence by assuming that humans are more likely to engage in intentional goal-directed behaviors than computers or robots. In the present study, we tested ...
Distinguishing defaults and second-line conceptualization in reasoning about humans, robots, and computers
HRI '09: Proceedings of the 4th ACM/IEEE international conference on Human robot interactionIn previous research, we demonstrated that people distinguish between human and nonhuman intelligence by assuming that humans are more likely to engage in intentional goal-directed behaviors than computers or robots. In the present study, we tested ...
Comments