Abstract
Human action recognition is an important problem in computer vision. Most existing techniques use all the video frames for action representation, which leads to high computational cost. Different from these techniques, we present a novel action recognition approach by describing the action with a few frames of representative poses, namely kPose. Firstly, a set of pose templates corresponding to different pose classes are learned based on a newly proposed Pose-Weighted Distribution Model (PWDM). Then, a local set of kPoses describing an action are extracted by clustering the poses belonging to the action. Thirdly, a further kPose selection is carried out to remove the redundant poses among the different local sets, which leads to a global set of kPoses with the least redundancy. Finally, a sequence of kPoses is obtained to describe the action by searching the nearest kPose in the global set. And the proposed action classification is carried out by comparing the obtained pose sequence with each local set of kPose. The experimental results validate the proposed method by remarkable recognition accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Blank, M., Gorelick, L., Shechtman, E., Irani, M., Basri, R.: Actions as space-time shapes. In: Tenth IEEE International Conference on Computer Vision (2005)
Davis, J., Bobick, A.: The representation and recognition of action using temporal templates. In: IEEE Conference on Computer Vision and Pattern Recognition (1997)
Efros, A., Berg, A., Mori, G., Malik, J.: Recognizing action at a distance. In: Ninth IEEE International Conference on Computer Vision (2007)
Yamato, J., Ohya, J., Ishii, K.: Recognizing human action in time-sequential images using hidden Markov model. In: Proc. Comp. Vis. and Pattern Rec., pp. 379–385 (1992)
Hatun, K., Duygulu, P.: Pose sentences: A new representation for action recognition using sequence of pose words. In: 19th International Conference on Pattern Recognition (2008)
Mauthner, T., Roth, P., Bischof, H.: Instant action recognition. In: Salberg, A.-B., Hardeberg, J.Y., Jenssen, R. (eds.) SCIA 2009. LNCS, vol. 5575, pp. 1–10. Springer, Heidelberg (2009)
Niebles, J., Fei-Fei, L.: A hierarchical model of shape and appearance for human action classification. In: IEEE Conference on Computer Vision and Pattern Recognition (2007)
Roth, P., Mauthner, T., Khan, I., Bischof, H.: Efficient Human Action Recognition by Cascaded Linear Classification (2010)
Schindler, K., Van Gool, L.: Action Snippets: How many frames does human action recognition require? In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Thurau, C., Hlavác, V.: Pose primitive based human action recognition in videos or still images. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Dalai, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conference on Computer Vision and Pattern Recognition (2005)
Bay, H., Tuytelaars, T., Van Gool, L.: SURF: Speeded up robust features. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3951, pp. 404–417. Springer, Heidelberg (2006)
Laptev, I., Lindeberg, T.: SpaceCtime interest points. In: Tenth IEEE International Conference on Computer Vision (2003)
Harris, C., Stephens, M.: A combined corner and edge detector. In: Alvey Vision Conference, p. 50 (1988)
Dollar, P., Rabaud, V., Cottrell, G., Belongie, S.: Behavior recognition via sparse spatio-temporal features. In: 2nd Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (2005)
Danafar, S., Gheissari, N.: Action recognition for surveillance applications using optic flow and SVM. In: Yagi, Y., Kang, S.B., Kweon, I.S., Zha, H. (eds.) ACCV 2007, Part II. LNCS, vol. 4844, pp. 457–466. Springer, Heidelberg (2007)
Lu, W., Little, J.: Simultaneous tracking and action recognition using the pca-hog descriptor. In: Canadian Conference on Computer and Robot Vision (2006)
Patron-Perez, A., Reid, I.: A probabilistic framework for recognizing similar actions using spatio-temporal features. In: British Machine Vision Conference
Likas, A., Vlassis, N., et al.: The global k-means clustering algorithm. Pattern Recognition, 451–461 (2003)
Yu, L., Liu, H.: Feature selection for high-dimensional data: A fast correlation-based filter solution. Machine Learning-International, 856 (2003)
Fathi, A., Mori, G.: Action recognition by learning mid-level motion features. In: IEEE Conference on Computer Vision and Pattern Recognition (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhou, Z., Song, M., Zhang, L., Tao, D., Bu, J., Chen, C. (2011). kPose: A New Representation For Action Recognition. In: Kimmel, R., Klette, R., Sugimoto, A. (eds) Computer Vision – ACCV 2010. ACCV 2010. Lecture Notes in Computer Science, vol 6494. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19318-7_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-19318-7_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19317-0
Online ISBN: 978-3-642-19318-7
eBook Packages: Computer ScienceComputer Science (R0)