skip to main content
10.1145/1865987.1866012acmconferencesArticle/Chapter ViewAbstractPublication PagesicdscConference Proceedingsconference-collections
research-article

On efficient use of multi-view data for activity recognition

Published:31 August 2010Publication History

ABSTRACT

The focus of the paper is on studying five different methods to combine multi-view data from an uncalibrated smart camera network for human activity recognition. The multi-view classification scenarios studied can be divided to two categories: view selection and view fusion methods. Selection uses a single view to classify, whereas fusion merges multi-view data either on the feature- or label-level. The five methods are compared in the task of classifying human activities in three fully annotated datasets: MAS, VIHASI and HOMELAB, and a combination dataset MAS+VIHASI. Classification is performed based on image features computed from silhouette images with a binary tree structured classifier using 1D CRF for temporal modeling. The results presented in the paper show that fusion methods outperform practical selection methods. Selection methods have their advantages, but they strongly depend on how good of a selection criteria is used, and how well this criteria adapts to different environments. Furthermore, fusion of features outperforms other scenarios within more controlled settings. But the more variability exists in camera placement and characteristics of persons, the more likely improved accuracy in multi-view activity recognition can be achieved by combining candidate labels.

References

  1. }}Muhavi-mas dataset, http://dipersec.king.ac.uk/muhavi-mas.Google ScholarGoogle Scholar
  2. }}Srikanth Cherla, Kaustubh Kulkarni, Amit Kale, and V. Ramasubramanian. Towards fast, view-invariant human action recognition. Computer Vision and Pattern Recognition Workshop, 0:1--8, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  3. }}Robert Collins, Ralph Gross, and Jianbo Shi. Silhouette-based human identification from body shape and gait. In Intl' Conference on Face and Gesture, pages 351--356, October 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. }}A. Farhadi, D. Forsyth, and R. White. Transfer learning in sign language. In Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on, pages 1--8, 17--22 2007.Google ScholarGoogle ScholarCross RefCross Ref
  5. }}Ali Farhadi and Mostafa Kamali Tabrizi. Learning to recognize activities from the wrong view point. In ECCV '08: Proceedings of the 10th European Conference on Computer Vision, pages 154--166, Berlin, Heidelberg, 2008. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. }}Hongzhe Han, Zhiliang Wang, Jiwei Liu, Zhengxi Li, Bin Li, and Zhongtao Han. Adaptive background modeling with shadow suppression. In Proc. of Intelligent Transportation Systems, pages 720--724, 2003.Google ScholarGoogle Scholar
  7. }}John Lafferty, Andrew McCallum, and Fernando Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data, 2001.Google ScholarGoogle Scholar
  8. }}Yijuan Lu, Ira Cohen, Xiang Sean Zhou, and Qi Tian. Feature selection using principal feature analysis. In MULTIMEDIA '07: Proceedings of the 15th international conference on Multimedia, pages 301--304, New York, NY, USA, 2007. ACM. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. }}Andrew Mccallum, Dayne Freitag, and Fernando Pereira. Maximum entropy markov models for information extraction and segmentation. pages 591--598. Morgan Kaufmann, 2000.Google ScholarGoogle Scholar
  10. }}Lawrence R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. In Proceedings of the IEEE, pages 257--286, 1989.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. On efficient use of multi-view data for activity recognition

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICDSC '10: Proceedings of the Fourth ACM/IEEE International Conference on Distributed Smart Cameras
          August 2010
          252 pages
          ISBN:9781450303170
          DOI:10.1145/1865987

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 31 August 2010

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate92of117submissions,79%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader