Annotate. Train. Evaluate. A Unified Tool for the Analysis and Visualization of Workflows in Machine Learning Applied to Object Detection

Storz, Michael; Ritter, Marc; Manthey, Robert; Lietz, Holger; Eibl, Maximilian

doi:10.1007/978-3-642-39342-6_22

Annotate. Train. Evaluate. A Unified Tool for the Analysis and Visualization of Workflows in Machine Learning Applied to Object Detection

Michael Storz¹⁷,
Marc Ritter¹⁷,
Robert Manthey¹⁷,
Holger Lietz¹⁸ &
…
Maximilian Eibl¹⁷

Conference paper

2344 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8008))

Abstract

The development of classifiers for object detection in images is a complex task that comprises the creation of representative and potentially large datasets from a target object by repetitive and time-consuming intellectual annotations, followed by a sequence of methods to train, evaluate and optimize the generated classifier. This is conventionally achieved by the usage and combination of many different tools. Here, we present a holistic approach to this scenario by providing a unified tool that covers the single development stages in one solution to facilitate the development process. We prove this concept by the example of creating a face detection classifier.

Download to read the full chapter text

Chapter PDF

References

Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes (VOC) Challenge. International Journal of Computer Vision 88(2), 303–338 (2010)
Article Google Scholar
Deng, J., Dong, W., Socher, R., Li, L., Li, K., Fei-Fei, L.: ImageNet: A large-scale hierarchical image database. In: IEEE International Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009)
Google Scholar
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM 38(11), 39–41 (1995)
Article Google Scholar
Jain, A.K., Duin, R.P.W., Gregory, R.L. (eds.): The Oxford Companion to the Mind, 2nd edn., pp. 698–703. Oxford University Press, Oxford (2004)
Google Scholar
Schneiderman, H.A.: Statistical method for 3D object detection applied to faces and cars. PhD Thesis, Carnegie Mellon University (2000)
Google Scholar
Huang, C., Ai, H., Li, Y., Lao, S.: High-Performance Rotation Invariant Multiview Face Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(4), 671–686 (2007)
Article Google Scholar
Phillips, P., Moon, H., Rizvi, S., Rauss, P.: The FERET evaluation methodology for face-recognition algorithms. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(10), 1090–1104 (2000)
Article Google Scholar
Angelova, A., Abu-Mostafam, Y., Perona, P.: Pruning training sets for learning of object categories. In: IEEE International Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, pp. 494–501 (2005)
Google Scholar
Georghiades, A., Belhumeur, P., Kriegman, D.: From few to many: illumination cone models for face recognition under variable lighting and pose. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(6), 643–660 (2001)
Article Google Scholar
Sim, T., Baker, S., Bsat, M.: The CMU Pose, Illumination, and Expression (PIE) database. In: Fifth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 46–51 (2002)
Google Scholar
Gross, R., Matthews, I., Cohn, J., Kanade, T., Baker, S.: Multi-PIE. Journal Image and Vision Computing 28(5), 807–813 (2010)
Article Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Journal Computer Vision and Image Understanding 106(1), 59–70 (2007)
Article Google Scholar
Griffin, G., Holub, A., Perona, P.: Caltech-256 Object Category Dataset. California Institute of Technology. Technical Report 7694 (2007)
Google Scholar
Russell, B., Torralba, A., Murphy, K., Freeman, W.: LabelMe: a database and web-based tool for image annotation. International Journal of Computer Vision 77(1), 157–173 (2008)
Article Google Scholar
Ahn, L., von, D.L.: Labeling images with a computer game. In: Proceedings of the 2004 Conference on Human Factors in Computing Systems, pp. 319–326 (2004)
Google Scholar
Yao, B., Yang, X., Zhu, S.-C.: Introduction to a large-scale general purpose ground truth database: Methodology, annotation tool and benchmarks. In: Yuille, A.L., Zhu, S.-C., Cremers, D., Wang, Y. (eds.) EMMCVPR 2007. LNCS, vol. 4679, pp. 169–183. Springer, Heidelberg (2007)
Chapter Google Scholar
Ladicky, L., Russell, C., Kohli, P., Torr, P.H.S.: Graph cut based inference with co-occurrence statistics. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 239–253. Springer, Heidelberg (2010)
Chapter Google Scholar
Boykov, Y., Veksler, O., Zabih, R.: Fast approximate energy minimization via graph cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(11), 1222–1239 (2001)
Article Google Scholar
Doermann, D., Mihalcik, D.: Tools and Techniques for Video Performance Evaluation. In: Proc. 15th International Conference on Pattern Recognition, vol. 4, pp. 167–170 (2000)
Google Scholar
Lachiche, N., Flach, P.A.: Improving Accuracy and Cost of Two-class and Multi-class Probabilistic Classifiers Using ROC Curves. In: 20th International Conference on Machine Learning, pp. 416–423 (2003)
Google Scholar
Tanner, W.P.J.R., Swets, J.A., Welch, H.W.: A New Theory of Visual Detection. Defense Technical Information Center, Electronic Defense Group, University of Michigan. Technical Reports, p. 42 (1953)
Google Scholar
Metz, C.E.: Receiver operating characteristic analysis: A tool for the quantitative evaluation of observer performance and imaging systems. Journal of the American College Radiology 3(6), 413–422 (2006)
Article Google Scholar
World Meteorological Organization (Eds.): Manual on the Global Data Processing System, part II, Attachments II.7 and II.8. 2010, Updated in 2012. Switzerland, p. 193 (2012)
Google Scholar
Provost, F.J., Fawcett, T.: Robust Classification for Imprecise Environments. Machine Learning 42(3), 203–231 (2001)
Article MATH Google Scholar
Schreiner, C., Zhang, H., Guerrero, C., Torkkola, K., Zhang, K.: A Semi-Automatic Data Annotation Tool for Driving Simulator Data Reduction. In: Driving Simulation Conference, North America, p. 9 (2007)
Google Scholar
Meudt, S., Bigalke, L., Schwenker, F.: Atlas Annotation tool using partially supervised learning and multi-view co-learning in human-computer-interaction scenarios. In: 11th International Conference on Information Science, Signal Processing and their Applications, pp. 1309–1312 (2012)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA Data Mining Software: An Update. SIGKDD Explorations 11(1), 10–18 (2009)
Article Google Scholar
Shafait, F., Reif, M., Kofler, C., Breuel, T.: Pattern Recognition Engineering. In: RapidMiner Community Meeting and Conference, Dortmund, Germany (2010)
Google Scholar
Chang, H.J., Yi, K.M., Yin, S., Kim, S.W., Baek, Y.M., Ahn, H.S., Choi, J.Y.: PIL-EYE: Integrated System for Sustainable Development of Intelligent Visual Surveillance Algorithms. In: IEEE Digital Image Computing: Techniques and Applications, pp. 231–236 (2011)
Google Scholar
Sung, K.K., Poggio, T.: Example-based learning for view-based human face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(1), 39–51 (1998)
Article Google Scholar
Schapire, R., Freund, Y.: A decision theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences 55(1), 119–139 (1997)
Article MathSciNet MATH Google Scholar
Schneiderman, H.: Learning statistical structure for object detection. In: Petkov, N., Westenberg, M.A. (eds.) CAIP 2003. LNCS, vol. 2756, pp. 434–441. Springer, Heidelberg (2003)
Chapter Google Scholar
Rowley, H., Baluja, S., Kanade, T.: Neural network-based face detection. In: International Conference on Computer Vision and Pattern Recognition, pp. 203–208 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Chair Media Informatics, Technische Universität Chemnitz, Chemnitz, Germany
Michael Storz, Marc Ritter, Robert Manthey & Maximilian Eibl
Professorship on Communications Engineering, Technische Universität Chemnitz, Chemnitz, Germany
Holger Lietz

Authors

Michael Storz
View author publications
You can also search for this author in PubMed Google Scholar
Marc Ritter
View author publications
You can also search for this author in PubMed Google Scholar
Robert Manthey
View author publications
You can also search for this author in PubMed Google Scholar
Holger Lietz
View author publications
You can also search for this author in PubMed Google Scholar
Maximilian Eibl
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

The Open University of Japan, 2-11 Wakaba, 261-8586, Chiba-shi, Mihama-ku, Japan
Masaaki Kurosu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Storz, M., Ritter, M., Manthey, R., Lietz, H., Eibl, M. (2013). Annotate. Train. Evaluate. A Unified Tool for the Analysis and Visualization of Workflows in Machine Learning Applied to Object Detection. In: Kurosu, M. (eds) Human-Computer Interaction. Towards Intelligent and Implicit Interaction. HCI 2013. Lecture Notes in Computer Science, vol 8008. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39342-6_22

Download citation

DOI: https://doi.org/10.1007/978-3-642-39342-6_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39341-9
Online ISBN: 978-3-642-39342-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics