An Integrated Two-Stage Framework for Robust Head Pose Estimation

Wu, Junwen; Trivedi, Mohan M.

doi:10.1007/11564386_25

An Integrated Two-Stage Framework for Robust Head Pose Estimation

Junwen Wu¹⁹ &
Mohan M. Trivedi¹⁹

Conference paper

887 Accesses
8 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3723))

Abstract

Subspace analysis has been widely used for head pose estimation. However, such techniques are usually sensitive to data alignment and background noise. In this paper a two-stage approach is proposed to address this issue by combining the subspace analysis together with the topography method. The first stage is based on the subspace analysis of Gabor wavelets responses. Different subspace techniques were compared for better exploring the underlying data structure. Nearest prototype matching using Euclidean distance was used to get the pose estimate. The single pose estimated was relaxed to a subset of poses around it to incorporate certain tolerance to data alignment and background noise. In the second stage, the uncertainty is eliminated by analyzing finer geometrical structure details captured by bunch graphs. This coarse-to-fine framework was evaluated with a large data set. We examined 86 poses, with the pan angle spanning from –90^o to 90^o and the tilt angle spanning from –60^o to 45^o. The experimental results indicate that the integrated approach has a remarkably better performance than using subspace analysis alone.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Pappu, R., Beardsley, P.A.: Qualitative approach to classifying gaze direction. In: Proceedings of the IEEE Conf. on Automatic Face and Gesture Recognition (1998)
Google Scholar
Stiefelhagen, R.: Tracking focus of attention in meetings. In: Proceedings of the IEEE International Conference on Multimodal Interfaces, ICMI 2002 (2002)
Google Scholar
Huang, K., Trivedi, M.M., Gandhi, T.: Driver’s View and Vehicle Surround Estimation using Omnidirectional Video Stream. In: Proceedings of IEEE Intelligent Vehicles Symposium, Columbus, OH, June 9-11, pp. 444–449 (2003)
Google Scholar
Braathen, B., Bartlett, M.S., Movellan, J.R.: 3-d head pose estimation from video by stochastic particle filtering. In: Proceedings of the 8th Annual Joint Symposium on Neural Computation (2001)
Google Scholar
Li, Y., Gong, S., Liddell, H.: Support vector regression and classification based multi-view face detection and recognition. In: Proceeding of IEEE International Conference on Automatic Face and Gesture Recognition, pp. 300–305 (July 2000)
Google Scholar
Li, S.Z., Fu, Q.D., Gu, L., Scholkopf, B., Cheng, Y.M., Zhang, H.J.: Kernel machine based learning for multi-view face detection and pose estimation. In: Proceedings of 8th IEEE International Conference on Computer Vision (July 2001)
Google Scholar
Cordea, M., Petriu, E., Georganas, N., Petriu, D., Whalen, T.: Real-time 2.5d head pose recovery for model-based video-coding. In: Proceedings of the IEEE Instrumentation and Measurement Technology Conference (2000)
Google Scholar
Horprasert, T., Yacoob, Y., Davis, L.S.: An anthropometric shape model for estimating head orientation. In: Proceedings of the 3rd International Workshop on Visual Form (1997)
Google Scholar
Morency, L., Sundberg, P., Darrell, T.: Pose estimation using 3d view-based eigenspaces. In: Proceedings of the IEEE International Workshop on Analysis and Modeling of Faces and Gestures, in Conjunction with ICCV 2003, pp. 45–52 (2003)
Google Scholar
Seemann, E., Nickel, K., Stiefelhagen, R.: Head pose estimation using stereo vision for human-robot interaction. In: Proceedings of the 6th IEEE International Conference on Automatic Face and Gesture Recognition (2004)
Google Scholar
Chen, L., Zhang, L., Hu, Y., Li, M., Zhang, H.: Head pose estimation using fisher manifold learning. In: Proceedings of the IEEE International Workshop on Analysis and Modeling of Faces and Gestures, in Conjunction with ICCV 2003 (2003)
Google Scholar
Gong, S., Sherrah, J., Ong, E.: Understanding pose discrimination in similarity space. In: Proceedings of the The Eleventh British Machine Vision Conference, BMVC 1999 (1999)
Google Scholar
Wei, Y., Fradet, L., Tan, T.: Head pose estimation using gabor eigenspace modeling. In: Proceedings of the IEEE International Conference on Image Processing (ICIP 2002), vol. 1, pp. 281–284 (2002)
Google Scholar
Srinivasan, S., Boyer, K.L.: Head pose estimation using view based eigenspaces. In: Proceedings of the 16th International Conference on Pattern Recognition, vol. 4, pp. 302–305 (2002)
Google Scholar
Potzsch, M., Kruger, N., von der Malsburg, C.: Determination of face position and pose with a learned representation based on labeled graphs. Technical report, Institute for Neuroinformatik, RuhrUniversitat, Bochum, Internal Report (1996)
Google Scholar
Krüger, V., Sommer, G.: Efficient head pose estimation with gabor wavelet networks. In: Proceedings of the The Eleventh British Machine Vision Conference, BMVC 2000 (2000)
Google Scholar
Wiskott, L., Fellous, J., Krüger, N., von der Malsburg, C.: Face recognition by elastic bunch graph matching. In: Sommer, G., Daniilidis, K., Pauli, J. (eds.) CAIP 1997. LNCS, vol. 1296, Springer, Heidelberg (1997)
Google Scholar
Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Classification, 2nd edn. Wiley-interscience, Hoboken
Google Scholar
Scholkopf, B., Smola, A., Muller, K.-R.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation 10, 1299–1319 (1998)
Article Google Scholar
Li, Y., Gong, S., Liddell, H.: Recognising trajectories of facial identities using kernel discriminant analysis. In: Proceedings of the British Machine Vision Conference (BMVC 2001), pp. 613–622 (2001)
Google Scholar
Mika, S., Ratsch, G., Weston, J., Scholkopf, B., Muller, K.: Fisher discriminant analysis with kernels. In: Proceedings of the IEEE Neural Networks for Signal Processing Workshop, pp. 41–48 (1999)
Google Scholar
Mac Lennan, J.: Gabor representations ofspatiotemporal visual images. Technical report, Computer Science Department, University of Tennessee, Knoxville, CS-91-144 (1991); Accessible via: http://www.cs.utk.edu/~mclennan
Ham, J., Lee, D.D., Mika, S., Scholkopf, B.: A kernel view of dimensionality reduction of manifolds. In: Proceedings of the International Conference on Machine Learning (2004)
Google Scholar
Viola, P., Jones, M.: Robust Real-time Object Detection. In: Proceedings of the Second International Workshop on Statistical and Computational Theories of Vision - Modeling, Learning and Sampling. Jointed with ICCV 2001 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Vision and Robotics Research Laboratory, University of California, San Diego, La Jolla, CA, 92093, USA
Junwen Wu & Mohan M. Trivedi

Authors

Junwen Wu
View author publications
You can also search for this author in PubMed Google Scholar
Mohan M. Trivedi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Intuitive Surgical Inc, 950 Kifer Road, 94086, Sunnyvale, CA, USA
Wenyi Zhao
Shaogang Gong, Department of Computer Science, Queen Mary, University of London, E1 4NS, London, UK
Shaogang Gong
Microsoft Research Asia, P.O. Box, Beijing, P.R. China
Xiaoou Tang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, J., Trivedi, M.M. (2005). An Integrated Two-Stage Framework for Robust Head Pose Estimation. In: Zhao, W., Gong, S., Tang, X. (eds) Analysis and Modelling of Faces and Gestures. AMFG 2005. Lecture Notes in Computer Science, vol 3723. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11564386_25

Download citation

DOI: https://doi.org/10.1007/11564386_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29229-6
Online ISBN: 978-3-540-32074-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics