research-article

On efficient use of multi-view data for activity recognition

Authors:
Tommi Määttä

University of Technology, Eindhoven, The Netherlands

University of Technology, Eindhoven, The Netherlands
View Profile

,
Aki Härmä

Philips Research, Eindhoven, The Netherlands

Philips Research, Eindhoven, The Netherlands
View Profile

,
Hamid Aghajan

Stanford University

Stanford University
View Profile

ICDSC '10: Proceedings of the Fourth ACM/IEEE International Conference on Distributed Smart CamerasAugust 2010Pages 158–165https://doi.org/10.1145/1865987.1866012

Published:31 August 2010Publication History

ICDSC '10: Proceedings of the Fourth ACM/IEEE International Conference on Distributed Smart Cameras

Pages 158–165

ABSTRACT

The focus of the paper is on studying five different methods to combine multi-view data from an uncalibrated smart camera network for human activity recognition. The multi-view classification scenarios studied can be divided to two categories: view selection and view fusion methods. Selection uses a single view to classify, whereas fusion merges multi-view data either on the feature- or label-level. The five methods are compared in the task of classifying human activities in three fully annotated datasets: MAS, VIHASI and HOMELAB, and a combination dataset MAS+VIHASI. Classification is performed based on image features computed from silhouette images with a binary tree structured classifier using 1D CRF for temporal modeling. The results presented in the paper show that fusion methods outperform practical selection methods. Selection methods have their advantages, but they strongly depend on how good of a selection criteria is used, and how well this criteria adapts to different environments. Furthermore, fusion of features outperforms other scenarios within more controlled settings. But the more variability exists in camera placement and characteristics of persons, the more likely improved accuracy in multi-view activity recognition can be achieved by combining candidate labels.

References

}}Muhavi-mas dataset, http://dipersec.king.ac.uk/muhavi-mas.Google Scholar
}}Srikanth Cherla, Kaustubh Kulkarni, Amit Kale, and V. Ramasubramanian. Towards fast, view-invariant human action recognition. Computer Vision and Pattern Recognition Workshop, 0:1--8, 2008.Google ScholarCross Ref
}}Robert Collins, Ralph Gross, and Jianbo Shi. Silhouette-based human identification from body shape and gait. In Intl' Conference on Face and Gesture, pages 351--356, October 2002. Google ScholarDigital Library
}}A. Farhadi, D. Forsyth, and R. White. Transfer learning in sign language. In Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on, pages 1--8, 17--22 2007.Google ScholarCross Ref
}}Ali Farhadi and Mostafa Kamali Tabrizi. Learning to recognize activities from the wrong view point. In ECCV '08: Proceedings of the 10th European Conference on Computer Vision, pages 154--166, Berlin, Heidelberg, 2008. Springer-Verlag. Google ScholarDigital Library
}}Hongzhe Han, Zhiliang Wang, Jiwei Liu, Zhengxi Li, Bin Li, and Zhongtao Han. Adaptive background modeling with shadow suppression. In Proc. of Intelligent Transportation Systems, pages 720--724, 2003.Google Scholar
}}John Lafferty, Andrew McCallum, and Fernando Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data, 2001.Google Scholar
}}Yijuan Lu, Ira Cohen, Xiang Sean Zhou, and Qi Tian. Feature selection using principal feature analysis. In MULTIMEDIA '07: Proceedings of the 15th international conference on Multimedia, pages 301--304, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
}}Andrew Mccallum, Dayne Freitag, and Fernando Pereira. Maximum entropy markov models for information extraction and segmentation. pages 591--598. Morgan Kaufmann, 2000.Google Scholar
}}Lawrence R. Rabiner. A tutorial on hidden markov models and selected applications in speech recognition. In Proceedings of the IEEE, pages 257--286, 1989.Google ScholarCross Ref

Index Terms

On efficient use of multi-view data for activity recognition
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object recognition
      2. Image and video acquisition
2. Software and its engineering
  1. Software organization and properties
    1. Software system structures
      1. Distributed systems organizing principles
        Organizing principles for web applications

Recommendations

Multi-view stacking for activity recognition with sound and accelerometer data

A multi-view stacking based method for sensor fusion in activity recognition.Home-tasks activities recognition from accelerometer and audio signals.Combining heterogeneous wearable sensors using stacked generalization. Many Ambient Intelligence (AmI) ...
Read More
Child-activity recognition from multi-sensor data
MB '10: Proceedings of the 7th International Conference on Methods and Techniques in Behavioral Research

The automatic recognition of child activity using multi-sensor data enables various applications such as child-development monitoring, energy-expenditure estimation, child-obesity prevention, child safety in and around the home, etc. We formulate the ...
Read More
A practical multi-sensor activity recognition system for home-based care

To cope with the increasing number of aging population, a type of care which can help prevent or postpone entry into institutional care is preferable. Activity recognition can be used for home-based care in order to help elderly people to remain at home ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ICDSC '10: Proceedings of the Fourth ACM/IEEE International Conference on Distributed Smart Cameras
August 2010
252 pages
ISBN:9781450303170
DOI:10.1145/1865987
General Chair:
Marilyn Wolf
Georgia Tech
,
Program Chairs:
Gian Luca Foresti
University of Udine, Italy
,
Horst Bischof
Graz University of Technology, Austria
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 31 August 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate92of117submissions,79%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 13
  Total Citations
  View Citations
- 114
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

On efficient use of multi-view data for activity recognition

ICDSC '10: Proceedings of the Fourth ACM/IEEE International Conference on Distributed Smart Cameras

ABSTRACT

References

Cited By

Index Terms

Recommendations

Multi-view stacking for activity recognition with sound and accelerometer data

Child-activity recognition from multi-sensor data

A practical multi-sensor activity recognition system for home-based care