Abstract
This paper presents a model for visual focus of attention recognition in the Ambient Kitchen, a pervasive computing prototyping environment. The kitchen is equipped with several blended displays on one wall and users may use information presented on these displays from multiple locations. Our goal is to recognize which display the user is looking at so that the environment can adjust the display content accordingly. We propose a dynamic Bayesian network model to infer the focus of attention, which models the relation between multiple foci of attention, multiple user locations and faces captured by the multiple cameras in the environment. Head pose is not explicitly computed but measured by a similarity vector which represents the likelihoods of multiple face clusters. Video data are collected in the Ambient Kitchen environment and experimental results demonstrate the effectiveness of our model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Roda, C., Thomas, J.: Attention aware systems: Theories, applications, and research agenda. Computers in Human Behavior 22, 557–587 (2006)
Langton, S.R.H., Watt, R.J., Bruce, V.: Do the Eyes Have it? Cues to the Direction of Social Attention. Trends in Cognitive Sciences 4(2), 50–58 (2000)
Stiefelhagen, R.: Tracking Focus of Attention in Meetings. In: Proc. Fourth IEEE Conf. Multimodal Interfaces (2002)
Voit, M., Stiefelhagen, R.: Deducing the Visual Focus of Attention from Head Pose Estimation in Dynamic Multi-view Meeting Scenarios. In: ACM and IEEE International Conference on Multimodal Interfaces (ICMI 2008), Chania, Crete, Greece, October 20-22 (2008)
Ba, S.O., Odobez, J.M.: A Study on Visual Focus of Attention Recognition from Head Pose in a Meeting Room. In: Renals, S., Bengio, S., Fiscus, J.G. (eds.) MLMI 2006. LNCS, vol. 4299, pp. 75–87. Springer, Heidelberg (2006)
Otsuka, K., Sawada, H., Yamato, J.: Automatic Inference of Cross-modal Nonverbal Interactions in Multiparty Conversations. In: Proc. of ACM 9th Int. Conf. Multimodal Interfaces (ICMI 2007), Nagoya, Japan, November 2007, pp. 255–262 (2007)
Smith, K., Ba, S.O., Perez, D.G., Odobez, J.M.: Tracking the multi person wandering visual focus of attention. In: Proceedings of the 8th international conference on Multimodal interfaces, Banff, Alberta, Canada, November 2-4 (2006)
Zhang, H., Toth, L., Deng, W., Guo, J., Yang, J.: Monitoring Visual Focus of Attention via Local Discriminant Projection. In: Proceedings of ACM International Conference on Multimedia Information Retrieval (2008)
Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 511–518 (2001)
Jones, M., Viola, P.: Fast multi-view face detection. Technical Report TR2003-96, MERL (June 2003)
Ross, D., Lim, J., Lin, R.-S., Yang, M.-H.: Incremental Learning for Robust Visual Tracking. International Journal of Computer Vision (2007)
Frey, B., Jojic, N.: A comparison of algorithms for inference and learning in probabilistic graphical models. IEEE Trans. Pattern Analysis and Machine Intelligence 27(9), 1–25 (2005)
Olivier, P., Monk, A., Xu, G., Hoey, J.: Ambient Kitchen: designing situated services using a high fidelity prototyping environment. In: Proceedings of 2nd International Conference on Pervasive Technologies Related to Assistive Environments, Workshop on Affect and Behaviour Related Assistance in Support for the Elderly, Corfu, Greece (June 2009)
Pham, C., Olivier, P.: Slice and Dice: Recognizing food preparation activities using embedded accelerometers. In: Proceedings of the 3rd European Conference on Ambient Intelligence (AmI 2009), Salzburg, Austria (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dong, L., Di, H., Tao, L., Xu, G., Oliver, P. (2010). Visual Focus of Attention Recognition in the Ambient Kitchen. In: Zha, H., Taniguchi, Ri., Maybank, S. (eds) Computer Vision – ACCV 2009. ACCV 2009. Lecture Notes in Computer Science, vol 5996. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12297-2_53
Download citation
DOI: https://doi.org/10.1007/978-3-642-12297-2_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12296-5
Online ISBN: 978-3-642-12297-2
eBook Packages: Computer ScienceComputer Science (R0)