skip to main content
10.1145/2733373.2806332acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

Learned vs. Hand-Crafted Features for Pedestrian Gender Recognition

Published:13 October 2015Publication History

ABSTRACT

This paper addresses the problem of image features selection for pedestrian gender recognition. Hand-crafted features (such as HOG) are compared with learned features which are obtained by training convolutional neural networks. The comparison is performed on the recently created collection of versatile pedestrian datasets which allows us to evaluate the impact of dataset properties on the performance of features. The study shows that hand-crafted and learned features perform equally well on small-sized homogeneous datasets. However, learned features significantly outperform hand-crafted ones in the case of heterogeneous and unfamiliar (unseen) datasets. Our best model which is based on learned features obtains 79% average recognition rate on completely unseen datasets. We also show that a relatively small convolutional neural network is able to produce competitive features even with little training data.

References

  1. L. Cao, M. Dikmen, Y. Fu, and T. S. Huang. Gender recognition from body. In ACM MM, Canada, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman. Return of the devil in the details: Delving deep into convolutional nets. CoRR, abs/1405.3531, 2014.Google ScholarGoogle Scholar
  3. M. Collins, J. Zhang, P. Miller, and H. Wang. Full body image feature representations for gender profiling. In ICCV, Japan, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  4. N. Dalal and B. Triggs. Histograms of oriented gradients for human detection. In CVPR, USA, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Deng, P. Luo, C. C. Loy, and X. Tang. Pedestrian attribute recognition at far distance. In ACM MM, USA, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. I. J. Goodfellow, Y. Bulatov, J. Ibarz, S. Arnoud, and V. Shet. Multi-digit number recognition from street view imagery using deep convolutional neural networks. CoRR, abs/1312.6082, 2013.Google ScholarGoogle Scholar
  7. J. A. Hanley and B. J. McNeil. The meaning and use of the area under a receiver operating characteristic (roc) curve. Radiology, 1982.Google ScholarGoogle Scholar
  8. G. E. Hinton, O. Vinyals, and J. Dean. Distilling the knowledge in a neural network. In NIPS Deep Learning Workshop, Canada, 2014.Google ScholarGoogle Scholar
  9. Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. CoRR, abs/1408.5093, 2014.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Joachims. Making large scale SVM learning practical. 1999.Google ScholarGoogle Scholar
  11. A. Krizhevsky, I. Sutskever, and G. E. Hinton. Imagenet classification with deep convolutional neural networks. In NIPS, USA, 2012.Google ScholarGoogle Scholar
  12. R. Layne, T. M. Hospedales, S. Gong, et al. Person re-identification by attributes. In BMVC, UK, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  13. Y. LeCun and Y. Bengio. Convolutional networks for images, speech, and time series. The handbook of brain theory and neural networks, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. G. Lowe. Object recognition from local scale-invariant features. In ICCV, Canada, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C.-B. Ng, Y.-H. Tay, and B.-M. Goi. A convolutional neural network for pedestrian gender recognition. In ISNN. Springer, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Ojala, M. Pietik\"ainen, and D. Harwood. A comparative study of texture measures with classification based on featured distributions. Pattern recognition, 1996.Google ScholarGoogle Scholar
  17. A. S. Razavian, H. Azizpour, J. Sullivan, and S. Carlsson. Cnn features off-the-shelf: an astounding baseline for recognition. CoRR, abs/1403.6382, 2014.Google ScholarGoogle Scholar
  18. O. Russakovsky, J. Deng, H. Su, J. Krause, and S. S. et al. ImageNet Large Scale Visual Recognition Challenge. CoRR, abs/1409.0575, 2014.Google ScholarGoogle Scholar
  19. Y. Taigman, M. Yang, M. Ranzato, and L. Wolf. Deepface: Closing the gap to human-level performance in face verification. In CVPR, USA, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Learned vs. Hand-Crafted Features for Pedestrian Gender Recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '15: Proceedings of the 23rd ACM international conference on Multimedia
      October 2015
      1402 pages
      ISBN:9781450334594
      DOI:10.1145/2733373

      Copyright © 2015 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 13 October 2015

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      MM '15 Paper Acceptance Rate56of252submissions,22%Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader