Elsevier

Computers in Industry

Volume 83, December 2016, Pages 46-54
Computers in Industry

Automatic visual inspection: An approach with multi-instance learning

https://doi.org/10.1016/j.compind.2016.09.002Get rights and content

Highlights

  • An approach for quality inspection with multi-instance learning is proposed.

  • Using weakly labeled images reduces the labeling effort in quality inspection.

  • Experiments show that the approach can be effectively used in real-world applications.

Abstract

One of the industrial applications of computer vision is automatic visual inspection. In the last decade, standard supervised learning methods have been used to detect defects in different kind of products. These methods are trained with a set of images where every image has to be manually segmented and labeled by experts in the application domain. These manual segmentations require a large amount of high quality delineations (on pixels), which can be time consuming and often a difficult task. Multi-instance learning (MIL), in contrast to standard supervised classifiers, avoids this task and can, therefore, be trained with weakly labeled images. In this paper, we propose an approach for the automatic visual inspection that uses MIL for defect detection. The approach has been tested with data from three artificial benchmark datasets and three real-world industrial scenarios: inspection of artificial teeth, weld defect detection and fishbone detection. Results show that the proposed approach can be used with weakly labeled images for defect detection on automatic visual inspection systems. This approach is able to increase the area under the receiver-operating characteristic curve (AUC) up to 6.3% compared with the naïve MIL approach of propagating the bag labels.

Introduction

Inspection is the process of determining whether a product deviates from a given set of specifications [1]. Traditionally, the inspection has been done by human operators; however, machine vision is being used to automate this process [1], [2], [3], [4]. An automatic inspection offers many advantages, such as an increased productivity, high standards of product quality in mass production, and the elimination of human errors [2], [5]. An automatic visual inspection system could be developed by one of the following three approaches:

  • Template-matching, where an image of the object under inspection is compared with one or more reference images [6].

  • The design-rule verification approach, that checks for the violation of a set of generic rules [6].

  • Inspection based on machine learning techniques, where the inspection system learns and generalizes the relations between the object features and the defects found in it [7].

The first two approaches have the disadvantage of requiring an explicit definition of rules and thresholds [8]. It leads to trial and error process that could take a long time before the system is ready to perform a specific inspection task. To deal with these issues, the use of machine learning techniques attempts to assimilate the reasoning skills of human inspectors [9].

Many systems have been proposed in the literature for automatic visual inspection at different domains, for example, for the quality control assurance in semiconductor [10], food [9], paper [11], fabric [12], metal [13] and ceramic tiles [14] industries, among others. Most of these systems use different standard supervised learning algorithms, such as decision trees [15], statistical classifiers [16], artificial neural networks [17], and support vector machines [18]. In order to learn to recognize defects in the products to be inspected, these algorithms should be trained with a dataset of images labeled by human operators. The decisions that the operators make about the defects will inevitably be different for some of the products to be inspected due to the subjectivity of the process (inter and intra operator contradictions) [19]. In this way, it is difficult to accurately and consistently assign labels to input images. Even in the industry, it is common having only weakly labeled images where the absence or presence of defects in an image is known, but the location of the defect and its precise delimitation are not available. In fact, this happens because manual annotation of the defects is time consuming, laborious or even impracticable in industrial applications.

We consider a relatively new learning paradigm called multi-instance learning (MIL) [20] applied to the automatic visual inspection process for defect detection. MIL allows to train a classifier with data that is assumed to have some ambiguity in how the labels are assigned, as occurs in visual inspection applications. The basic idea of this learning paradigm is that an object is represented by a bag, which is a set of feature vectors called instances, and the objective is to classify the bag as either positive or negative, in a two-class problem. MIL has been used in several computer vision applications, such as image categorization [21], image retrieval [22], object recognition [23] and target tracking [24], among others.

In contrast to other existing approaches, we propose to model the defect detection problem as a MIL problem, where the images of the objects to be inspected are taken as bags and their potential defects as instances. We tested the proposed approach with three artificial benchmark datasets from the inspection of textured materials and three real-world datasets from the artificial teeth industry, the welding industry and the fish fillet industry. The structure of the remaining parts of the paper is as follows: Section 2 presents the MIL paradigm. Section 3 introduces the proposed approach. Experimental results and their discussion are presented in Section 4. Section 5 depicts our conclusions.

Section snippets

Multi-instance learning

In pattern recognition, a standard supervised classifier learns a model that can be use to predict the class labels of unseen objects. In order to learn that model, each object is represented by a single d-dimensional feature vector xid which has associated a unique class label yi  Ω = {ω1, …, ωC} that specifies the object class. According to this representation, the classifier defines a mapping function from the feature space towards the set of class labels:f(x):dΩ.

In many pattern recognition

Automatic visual inspection with MIL

In order to develop an automatic visual inspection system using the multi-instance learning paradigm, we need first to model the problem using bags and instances. To do that, we propose to use a segmentation approach to locate the potential defects in each image; then color, texture and shape features of them are extracted. Next, we consider an image as a bag, and the low-level features from all the potential defects as the instances. As a result, we can transform the automatic visual

Experiments and results

We perform several experiments to show the usefulness of the MIL approach for automatic visual inspection. We deal with six image datasets, three from real industrial scenarios and three synthetic datasets. Since one goal of this work is to illustrate that using MIL allows to detect defects with a reasonable performance, all the parameters of the algorithms were fixed for all experiments, as described in Table 2. This holds for all MIL algorithms we tested.

Conclusions

In this paper, we present a general MIL-based approach for automated visual inspection. This approach reduces the effort and time used in labeling the image dataset because it is only required to know the absence or presence of defects in the image, but its exact location and segmentation is not needed. It is opposite to existing standard supervised approaches for automated visual inspection that require a large amount of high quality manual delineations (on pixels), which are often time

Acknowledgment

This work was done while Carlos Mera was a visiting Ph.D. student at Pontificia Universidad Católica de Chile, supported by a scholarship from COLCIENCIAS -Departamento Administrativo de Ciencia, Tecnología e Innovación de Colombia. The authors also would like to thank the reviwers for their valuable suggestions and the agreement between Universidad Nacional de Colombia, Sede Medellín, and New Stetic S.A. to provide the images of teeth used in the experiments.

References (55)

  • C.W.M. Yuen et al.

    A hybrid model using genetic algorithm and neural network for classifying garment defects

    Exp. Syst. Appl.

    (2009)
  • D. Mery et al.

    Automated fish bone detection using X-ray imaging

    J. Food Eng.

    (2011)
  • T.G. Dietterich et al.

    Solving the multiple instance problem with axis-parallel rectangles

    Artif. Intell.

    (1997)
  • J. Amores

    Multiple instance classification: review, taxonomy and comparative study

    Artif. Intell.

    (2013)
  • V. Cheplygina et al.

    Multiple instance learning with bag dissimilarities

    Pattern Recognit.

    (2015)
  • D. Li et al.

    Pornographic images recognition based on spatial pyramid partition and multi-instance ensemble learning

    Knowl. Based Syst.

    (2015)
  • R.T. Chin et al.

    Automated visual inspection: a survey

    IEEE Trans. Pattern Anal. Mach. Intell.

    (1982)
  • V. Piuri et al.

    Computational intelligence in industrial quality control

  • D. Sannen

    A Flexible Framework for Learning Visual Quality Inspection: Information Fusion Approaches (Ph.D.)

    (2010)
  • X. Li et al.

    Improving automatic detection of defects in castings by applying wavelet technique

    IEEE Trans. Ind. Electron.

    (2006)
  • A. Kumar

    Computer-vision-based fabric defect detection: a survey

    IEEE Trans. Ind. Electron.

    (2008)
  • S. Chen et al.

    Automated inspection of engineering ceramic grinding surface damage based on image recognition

    Int. J. Adv. Manuf. Technol.

    (2012)
  • D. Sannen et al.

    Learning visual quality inspection from multiple humans using ensembles of classifiers

  • D. Li

    RSSVM-based multi-instance learning for image categorization

    Int. J. Comput. Sci. Issues

    (2012)
  • C. Zhang et al.

    A multiple instance learning approach for content based image retrieval using one-class support vector machine

  • P. Dollar et al.

    Multiple component learning for object detection

  • B. Babenko et al.

    Visual tracking with online multiple instance learning

  • Cited by (29)

    • Development of training image database using web crawling for vision-based site monitoring

      2022, Automation in Construction
      Citation Excerpt :

      In addition, other researchers investigated how to distribute the efforts required for labeling the data. For instance, Mera et al. [44] employed multi-instance learning trained with weakly labeled images. Liu and Golparvar-Fard [45] used a crowdsourcing technique to label the types of the workers' activities, and the technique referred to the collaborative participation of a crowd of people on an online platform (e.g., Amazon Mechanical Turk).

    • Fantastic plastic? An image-based test method to detect aesthetic defects in batches based on reference samples

      2020, Polymer Testing
      Citation Excerpt :

      Profiling different defect types within manufacturing can help in automating AQC for machine vision using quantifiable anomaly descriptors. Many different materials (e.g. leather [10], ceramic [11], stone [12], metals [3,13], and plastic etc. [4]) make AVI subject to real-time and in situ testing and can contribute detection, localization and classification of defects in polymer products. Highly reflective materials (e.g. plastic) gives varying reflectance and is part of the problem for AVI since specular highlights on objects create noise in the images and eradicate homogeneity.

    • A novel convolutional neural network based fault recognition method via image fusion of multi-vibration-signals

      2019, Computers in Industry
      Citation Excerpt :

      Complex feature extraction processes require specialized knowledge strongly and influence the final diagnosis result significantly. Recently, deep learning has been widely used in computer vision [23–25], acoustic recognition [26] and medical images [27,28]. Hinton et al. proposed hierarchical neural network model possessing strong pattern learning ability, for example deep belief network (DBN), which accelerated the development of deep learning [29,30].

    • An intelligent vision-based approach for helmet identification for work safety

      2018, Computers in Industry
      Citation Excerpt :

      They have been applied in face recognition [23,25,56,61], vehicle detection [22,53,65], pedestrian detection [18,42,55]. With the development of this field, automated visual inspections have been used widely in industrial applications [2,8,9,49,52]. For helmet identification, numerous related studies have been conducted.

    View all citing articles on Scopus
    View full text