skip to main content
10.1145/2911996.2912069acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
short-paper

Incremental Learning for Fine-Grained Image Recognition

Published:06 June 2016Publication History

ABSTRACT

This paper considers the problem of fine-grained image recognition with a growing vocabulary. Since in many real world applications we often have to add a new object category or visual concept with just a few images to learn from, it is crucial to develop a method that is able to generalize the recognition model from existing classes to new classes. Deep convolutional neural networks are capable of constructing powerful image representations; however, these networks usually rely on a logistic loss function that cannot handle the incremental learning problem. In this paper, we present a new method that can efficiently learn a new class given only a limited number of training examples, which we evaluate on the problems of food and clothing recognition. To illustrate the performance of our proposed method on the task of recognizing different kinds of food, when using only 1.3\% of training examples per category we achieved about 73\% of the performance (as measured by F1-score) compared to when using all available training data.

References

  1. R. Ando and T. Zhang. A framework for learning predictive structures from multiple tasks and unlabeled data. In JMLR, pages 1817--1853, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. G. Andrew and J. Gao. Scalable training of L1-regularized log-linear models. In ICML, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Bergstra, O. Breuleux, F. Bastien, P. Lamblin, R. Pascanu, G. Desjardins, J. Turian, D. Warde-Farley, and Y. Bengio. Theano: a CPU and GPU math expression compiler. In SciPy, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  4. L. Bossard, M. Guillaumin, and L. Van Gool. Food-101 -- mining discriminative components with random forests. In ECCV, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  5. K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman. Return of the devil in the details: delving deep into convolutional nets. In BMVC, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  6. X. Chen, A. Shrivastava, and A. Gupta. NEIL: Extracting Visual Knowledge from Web Data. In ICCV, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Guadarrama, E. Rodner, K. Saenko, N. Zhang, R. Farrell, J. Donahue, and T. Darrell. Open-vocabulary object retrieval. Robotics: Science and Systems, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  8. M. Hasan and A. Roy-Chowdhury. Incremental activity modeling and recognition in streaming videos. In CVPR, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Karayev, M. Trentacoste, H. Han, A. Agarwala, T. Darrell, A. Hertzmann, and H. Winnemoeller. Recognizing image style. In BMVC, 2014.Google ScholarGoogle ScholarCross RefCross Ref
  10. A. Krizhevsky, I. Sutskever, and G. Hinton. ImageNet classification with deep convolutional neural networks. In NIPS, 2012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Kuznetsova, S. Hwang, B. Rosenhahn, and L. Sigal. Expanding object detector's horizon: incremental learning framework for object detection in videos. In CVPR, 2015.Google ScholarGoogle ScholarCross RefCross Ref
  12. M. Lin, Q. Chen, and S. Yan. Network in network. ICLR, 2013.Google ScholarGoogle Scholar
  13. T. Mensink, J. Verbeek, F. Perronnin, and G. Csurka. Distance-based image classification: Generalizing to new classes at near-zero cost. TPAMI, 35(11):2624--2637, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Nesterov. A method of solving a convex programming problem with convergence rate O(1/k2). Soviet Mathematics Doklady, 27(2):372--376, 1983.Google ScholarGoogle Scholar
  15. A. Razavian, H. Azizpour, J. Sullivan, and S. Carlsson. CNN features off-the-shelf: an astounding baseline for recognition. CoRR, abs/1403.6382, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Ristin, M. Guillaumin, J. Gall, and L. Van Gool. Incremental learning of NCM forests for large-scale image classification. In CVPR, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. Simonyan and A. Zisserman. Very deep convolutional networks for large-scale image recognition. In ICLR, 2015.Google ScholarGoogle Scholar
  18. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich. Going deeper with convolutions. In CVPR, 2015.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Incremental Learning for Fine-Grained Image Recognition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      ICMR '16: Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval
      June 2016
      452 pages
      ISBN:9781450343596
      DOI:10.1145/2911996

      Copyright © 2016 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 6 June 2016

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • short-paper

      Acceptance Rates

      ICMR '16 Paper Acceptance Rate20of120submissions,17%Overall Acceptance Rate254of830submissions,31%

      Upcoming Conference

      ICMR '24
      International Conference on Multimedia Retrieval
      June 10 - 14, 2024
      Phuket , Thailand

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader