kPose: A New Representation For Action Recognition

Zhou, Zhuoli; Song, Mingli; Zhang, Luming; Tao, Dacheng; Bu, Jiajun; Chen, Chun

doi:10.1007/978-3-642-19318-7_34

Zhuoli Zhou¹⁹,
Mingli Song¹⁹,
Luming Zhang¹⁹,
Dacheng Tao²⁰,
Jiajun Bu¹⁹ &
…
Chun Chen¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6494))

Included in the following conference series:

Asian Conference on Computer Vision

2901 Accesses
1 Citations

Abstract

Human action recognition is an important problem in computer vision. Most existing techniques use all the video frames for action representation, which leads to high computational cost. Different from these techniques, we present a novel action recognition approach by describing the action with a few frames of representative poses, namely kPose. Firstly, a set of pose templates corresponding to different pose classes are learned based on a newly proposed Pose-Weighted Distribution Model (PWDM). Then, a local set of kPoses describing an action are extracted by clustering the poses belonging to the action. Thirdly, a further kPose selection is carried out to remove the redundant poses among the different local sets, which leads to a global set of kPoses with the least redundancy. Finally, a sequence of kPoses is obtained to describe the action by searching the nearest kPose in the global set. And the proposed action classification is carried out by comparing the obtained pose sequence with each local set of kPose. The experimental results validate the proposed method by remarkable recognition accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: Tenth IEEE International Conference on Computer Vision (2005)
Google Scholar
Davis, J., Bobick, A.: The representation and recognition of action using temporal templates. In: IEEE Conference on Computer Vision and Pattern Recognition (1997)
Google Scholar
Efros, A., Berg, A., Mori, G., Malik, J.: Recognizing action at a distance. In: Ninth IEEE International Conference on Computer Vision (2007)
Google Scholar
Yamato, J., Ohya, J., Ishii, K.: Recognizing human action in time-sequential images using hidden Markov model. In: Proc. Comp. Vis. and Pattern Rec., pp. 379–385 (1992)
Google Scholar
Hatun, K., Duygulu, P.: Pose sentences: A new representation for action recognition using sequence of pose words. In: 19th International Conference on Pattern Recognition (2008)
Google Scholar
Mauthner, T., Roth, P., Bischof, H.: Instant action recognition. In: Salberg, A.-B., Hardeberg, J.Y., Jenssen, R. (eds.) SCIA 2009. LNCS, vol. 5575, pp. 1–10. Springer, Heidelberg (2009)
Chapter Google Scholar
Niebles, J., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Roth, P., Mauthner, T., Khan, I., Bischof, H.: Efficient Human Action Recognition by Cascaded Linear Classification (2010)
Google Scholar
Schindler, K., Van Gool, L.: Action Snippets: How many frames does human action recognition require? In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Thurau, C., Hlavác, V.: Pose primitive based human action recognition in videos or still images. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar
Dalai, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (2005)
Google Scholar
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Chapter Google Scholar
Laptev, I., Lindeberg, T.: SpaceCtime interest points. In: Tenth IEEE International Conference on Computer Vision (2003)
Google Scholar
Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey Vision Conference, p. 50 (1988)
Google Scholar
Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (2005)
Google Scholar
Danafar, S., Gheissari, N.: Action recognition for surveillance applications using optic flow and SVM. In: Yagi, Y., Kang, S.B., Kweon, I.S., Zha, H. (eds.) ACCV 2007, Part II. LNCS, vol. 4844, pp. 457–466. Springer, Heidelberg (2007)
Chapter Google Scholar
Lu, W., Little, J.: Simultaneous tracking and action recognition using the pca-hog descriptor. In: Canadian Conference on Computer and Robot Vision (2006)
Google Scholar
Patron-Perez, A., Reid, I.: A probabilistic framework for recognizing similar actions using spatio-temporal features. In: British Machine Vision Conference
Google Scholar
Likas, A., Vlassis, N., et al.: The global k-means clustering algorithm. Pattern Recognition, 451–461 (2003)
Google Scholar
Yu, L., Liu, H.: Feature selection for high-dimensional data: A fast correlation-based filter solution. Machine Learning-International, 856 (2003)
Google Scholar
Fathi, A., Mori, G.: Action recognition by learning mid-level motion features. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Computer Science, Zhejiang University, Hangzhou, China
Zhuoli Zhou, Mingli Song, Luming Zhang, Jiajun Bu & Chun Chen
School of Computer Engineering, Nanyang Technological University, Singapore
Dacheng Tao

Authors

Zhuoli Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Mingli Song
View author publications
You can also search for this author in PubMed Google Scholar
Luming Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dacheng Tao
View author publications
You can also search for this author in PubMed Google Scholar
Jiajun Bu
View author publications
You can also search for this author in PubMed Google Scholar
Chun Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technion – Israel Institute of Technology, Department of Computer Science, 32000, Haifa, Israel
Ron Kimmel
The University of Auckland, 37 Kohimarama Road , Mission Bay, 1071, Auckland, New Zealand
Reinhard Klette
National Institute of Informatics, Chiyoda, 1018430, Tokyo, Japan
Akihiro Sugimoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhou, Z., Song, M., Zhang, L., Tao, D., Bu, J., Chen, C. (2011). kPose: A New Representation For Action Recognition. In: Kimmel, R., Klette, R., Sugimoto, A. (eds) Computer Vision – ACCV 2010. ACCV 2010. Lecture Notes in Computer Science, vol 6494. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19318-7_34

Download citation

DOI: https://doi.org/10.1007/978-3-642-19318-7_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19317-0
Online ISBN: 978-3-642-19318-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics