Abstract
The recognition of manual actions, i.e., hand movements, hand postures and gestures, plays an important role in human-computer interaction, while belonging to a category of particularly difficult tasks. Using a Vicon system to capture 3D spatial data, we investigate the recognition of manual actions in tasks such as pouring a cup of milk and writing into a book. We propose recognizing sequences in multidimensional time-series by first learning a smooth quantization of the data, and then using a variant of dynamic time warping to recognize short sequences of prototypical motions in a long unknown sequence. An experimental analysis validates our approach. Short manual actions are successfully recognized and the approach is shown to be spatially invariant. We also show that the approach speeds up processing while not decreasing recognition performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vicon motion capture system, http://www.vicon.com/
Caridakis, G., Karpouzis, K., Drosopoulos, A., Kollias, S.: SOMM: Self organizing Markov map for gesture recognition. Pattern Recogn. Lett. 31(1), 52–59 (2009)
Chang, L.Y., Pollard, N.S., Mitchell, T.M., Xing, E.P.: Feature selection for grasp recognition from optical markers. In: Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp. 2944–2950 (2007)
Ekvall, S., Kragic, D.: Grasp recognition for programming by demonstration. In: Proc. IEEE Int. Conf. Robotics and Automation, pp. 748–753 (2005)
Gavrila, D.M., Davis, L.S.: 3-D model-based tracking and recognition of human movement. In: Proc. Int. Work. on Face and Gesture Recognition, Zurich, Switzerland (1995)
Jones, E., Oliphant, T., Peterson, P., et al.: SciPy: Open Source scientific tools for Python (2001-2010)
Keogh, E.J., Chu, S., Hart, D., Pazzani, M.J.: An online algorithm for segmenting time series. In: Proc. of the 2001 IEEE Int. Conf. on Data Mining, pp. 289–296 (2001)
Kohonen, T.: The self-organizing map. Proc. IEEE 78(9), 1464–1480 (1990)
Lee, H.-K., Kim, J.H.: An HMM-based threshold model approach for gesture recognition. IEEE Trans. Pattern Anal. Mach. Intell. 21, 961–973 (1999)
Lloyd, S.P.: Least squares quantization in PCM. IEEE Trans. Inform. Theor. 28(2), 129–137 (1982)
Martinetz, T., Schulten, K.: A neural-gas network learns topologies. Artificial Neural Networks, 397–402 (1991)
Ritter, H., Robert, H., Röthling, F., Steil, J.J.: Manual intelligence as a Rosetta Stone for robot cognition. In: ISRR (December 2007)
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal Process. 26, 43–49 (1978)
Stiefmeier, T., Roggen, D.: Gestures are strings: Efficient online gesture spotting and classification using string matching. In: Proc. of 2nd Int. Conf. on Body Area Networks, BodyNets (2007)
Wendemuth, A.: Grundlagen der stochastischen Sprachverarbeitung, Oldenbourg (2004)
Zito, T., Wilbert, N., Wiskott, L., Berkes, P.: Modular toolkit for Data Processing (MDP): a Python data processing frame work. Frontiers in Neuroinformatics 2, 8 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martin, M., Maycock, J., Schmidt, F.P., Kramer, O. (2010). Recognition of Manual Actions Using Vector Quantization and Dynamic Time Warping. In: Graña Romay, M., Corchado, E., Garcia Sebastian, M.T. (eds) Hybrid Artificial Intelligence Systems. HAIS 2010. Lecture Notes in Computer Science(), vol 6076. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13769-3_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-13769-3_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13768-6
Online ISBN: 978-3-642-13769-3
eBook Packages: Computer ScienceComputer Science (R0)