Automatic visual inspection: An approach with multi-instance learning

doi:10.1016/j.compind.2016.09.002

Computers in Industry

Volume 83, December 2016, Pages 46-54

https://doi.org/10.1016/j.compind.2016.09.002 Get rights and content

Highlights

•
An approach for quality inspection with multi-instance learning is proposed.
•
Using weakly labeled images reduces the labeling effort in quality inspection.
•
Experiments show that the approach can be effectively used in real-world applications.

Abstract

One of the industrial applications of computer vision is automatic visual inspection. In the last decade, standard supervised learning methods have been used to detect defects in different kind of products. These methods are trained with a set of images where every image has to be manually segmented and labeled by experts in the application domain. These manual segmentations require a large amount of high quality delineations (on pixels), which can be time consuming and often a difficult task. Multi-instance learning (MIL), in contrast to standard supervised classifiers, avoids this task and can, therefore, be trained with weakly labeled images. In this paper, we propose an approach for the automatic visual inspection that uses MIL for defect detection. The approach has been tested with data from three artificial benchmark datasets and three real-world industrial scenarios: inspection of artificial teeth, weld defect detection and fishbone detection. Results show that the proposed approach can be used with weakly labeled images for defect detection on automatic visual inspection systems. This approach is able to increase the area under the receiver-operating characteristic curve (AUC) up to 6.3% compared with the naïve MIL approach of propagating the bag labels.

Introduction

Inspection is the process of determining whether a product deviates from a given set of specifications [1]. Traditionally, the inspection has been done by human operators; however, machine vision is being used to automate this process [1], [2], [3], [4]. An automatic inspection offers many advantages, such as an increased productivity, high standards of product quality in mass production, and the elimination of human errors [2], [5]. An automatic visual inspection system could be developed by one of the following three approaches:

•
Template-matching, where an image of the object under inspection is compared with one or more reference images [6].
•
The design-rule verification approach, that checks for the violation of a set of generic rules [6].
•
Inspection based on machine learning techniques, where the inspection system learns and generalizes the relations between the object features and the defects found in it [7].

The first two approaches have the disadvantage of requiring an explicit definition of rules and thresholds [8]. It leads to trial and error process that could take a long time before the system is ready to perform a specific inspection task. To deal with these issues, the use of machine learning techniques attempts to assimilate the reasoning skills of human inspectors [9].

Many systems have been proposed in the literature for automatic visual inspection at different domains, for example, for the quality control assurance in semiconductor [10], food [9], paper [11], fabric [12], metal [13] and ceramic tiles [14] industries, among others. Most of these systems use different standard supervised learning algorithms, such as decision trees [15], statistical classifiers [16], artificial neural networks [17], and support vector machines [18]. In order to learn to recognize defects in the products to be inspected, these algorithms should be trained with a dataset of images labeled by human operators. The decisions that the operators make about the defects will inevitably be different for some of the products to be inspected due to the subjectivity of the process (inter and intra operator contradictions) [19]. In this way, it is difficult to accurately and consistently assign labels to input images. Even in the industry, it is common having only weakly labeled images where the absence or presence of defects in an image is known, but the location of the defect and its precise delimitation are not available. In fact, this happens because manual annotation of the defects is time consuming, laborious or even impracticable in industrial applications.

We consider a relatively new learning paradigm called multi-instance learning (MIL) [20] applied to the automatic visual inspection process for defect detection. MIL allows to train a classifier with data that is assumed to have some ambiguity in how the labels are assigned, as occurs in visual inspection applications. The basic idea of this learning paradigm is that an object is represented by a bag, which is a set of feature vectors called instances, and the objective is to classify the bag as either positive or negative, in a two-class problem. MIL has been used in several computer vision applications, such as image categorization [21], image retrieval [22], object recognition [23] and target tracking [24], among others.

In contrast to other existing approaches, we propose to model the defect detection problem as a MIL problem, where the images of the objects to be inspected are taken as bags and their potential defects as instances. We tested the proposed approach with three artificial benchmark datasets from the inspection of textured materials and three real-world datasets from the artificial teeth industry, the welding industry and the fish fillet industry. The structure of the remaining parts of the paper is as follows: Section 2 presents the MIL paradigm. Section 3 introduces the proposed approach. Experimental results and their discussion are presented in Section 4. Section 5 depicts our conclusions.

Section snippets

Multi-instance learning

In pattern recognition, a standard supervised classifier learns a model that can be use to predict the class labels of unseen objects. In order to learn that model, each object is represented by a single d-dimensional feature vector $x_{i} \in ℝ^{d}$ which has associated a unique class label y_i ∈ Ω = {ω₁, …, ω_C} that specifies the object class. According to this representation, the classifier defines a mapping function from the feature space towards the set of class labels: $f (x) : ℝ^{d} \mapsto Ω .$

In many pattern recognition

Automatic visual inspection with MIL

In order to develop an automatic visual inspection system using the multi-instance learning paradigm, we need first to model the problem using bags and instances. To do that, we propose to use a segmentation approach to locate the potential defects in each image; then color, texture and shape features of them are extracted. Next, we consider an image as a bag, and the low-level features from all the potential defects as the instances. As a result, we can transform the automatic visual

Experiments and results

We perform several experiments to show the usefulness of the MIL approach for automatic visual inspection. We deal with six image datasets, three from real industrial scenarios and three synthetic datasets. Since one goal of this work is to illustrate that using MIL allows to detect defects with a reasonable performance, all the parameters of the algorithms were fixed for all experiments, as described in Table 2. This holds for all MIL algorithms we tested.

Conclusions

In this paper, we present a general MIL-based approach for automated visual inspection. This approach reduces the effort and time used in labeling the image dataset because it is only required to know the absence or presence of defects in the image, but its exact location and segmentation is not needed. It is opposite to existing standard supervised approaches for automated visual inspection that require a large amount of high quality manual delineations (on pixels), which are often time

Acknowledgment

This work was done while Carlos Mera was a visiting Ph.D. student at Pontificia Universidad Católica de Chile, supported by a scholarship from COLCIENCIAS -Departamento Administrativo de Ciencia, Tecnología e Innovación de Colombia. The authors also would like to thank the reviwers for their valuable suggestions and the agreement between Universidad Nacional de Colombia, Sede Medellín, and New Stetic S.A. to provide the images of teeth used in the experiments.

References (55)

T.S. Newman et al.
A survey of automated visual inspection
Comput. Vis. Image Underst.
(1995)
R.T. Chin
Automated visual inspection: 1981 to 1987
Comput. Vis. Graph. Image Process.
(1988)
E.N. Malamas et al.
A survey on industrial vision systems, applications and tools
Image Vis. Comput.
(2003)
J. Derganc et al.
Real-time automated visual inspection of color tablets in pharmaceutical blisters
Real Time Imag.
(2003)
C.-J. Du et al.
Learning techniques used in computer vision for food quality evaluation: a review
J. Food Eng.
(2006)
S.-H. Huang et al.
Automated visual inspection in the semiconductor industry: a survey
Comput. Ind.
(2015)
F. Bianconi et al.
A sequential machine vision procedure for assessing paper impurities
Comput. Ind.
(2014)
J. Molleda et al.
An improved 3D imaging system for dimensional quality inspection of rolled products in the metal industry
Comput. Ind.
(2013)
A.R. Farooq et al.
Dynamic photometric stereo for on line quality control of ceramic tiles
Comput. Ind.
(2005)
D. Unay et al.
Automatic grading of bi-colored apples by multispectral machine vision
Comput. Electron. Agric.
(2011)

C.W.M. Yuen et al.

A hybrid model using genetic algorithm and neural network for classifying garment defects

Exp. Syst. Appl.

(2009)

D. Mery et al.

Automated fish bone detection using X-ray imaging

J. Food Eng.

(2011)

T.G. Dietterich et al.

Solving the multiple instance problem with axis-parallel rectangles

Artif. Intell.

(1997)

J. Amores

Multiple instance classification: review, taxonomy and comparative study

Artif. Intell.

(2013)

V. Cheplygina et al.

Multiple instance learning with bag dissimilarities

Pattern Recognit.

(2015)

D. Li et al.

Pornographic images recognition based on spatial pyramid partition and multi-instance ensemble learning

Knowl. Based Syst.

(2015)

R.T. Chin et al.

Automated visual inspection: a survey

IEEE Trans. Pattern Anal. Mach. Intell.

(1982)

V. Piuri et al.

Computational intelligence in industrial quality control

D. Sannen

A Flexible Framework for Learning Visual Quality Inspection: Information Fusion Approaches (Ph.D.)

(2010)

X. Li et al.

Improving automatic detection of defects in castings by applying wavelet technique

IEEE Trans. Ind. Electron.

(2006)

A. Kumar

Computer-vision-based fabric defect detection: a survey

IEEE Trans. Ind. Electron.

(2008)

S. Chen et al.

Automated inspection of engineering ceramic grinding surface damage based on image recognition

Int. J. Adv. Manuf. Technol.

(2012)

D. Sannen et al.

Learning visual quality inspection from multiple humans using ensembles of classifiers

D. Li

RSSVM-based multi-instance learning for image categorization

Int. J. Comput. Sci. Issues

(2012)

C. Zhang et al.

A multiple instance learning approach for content based image retrieval using one-class support vector machine

P. Dollar et al.

Multiple component learning for object detection

B. Babenko et al.

Visual tracking with online multiple instance learning

Cited by (29)

Application of automation for in-line quality inspection, a zero-defect manufacturing approach
2023, Journal of Manufacturing Systems
Contemporary manufacturing must prioritise the sustainability of its manufacturing processes and systems. Zero Defect Manufacturing (ZDM) focusses on minimising waste of any kind using data-driven technology, hence enhancing the quality of all manufacturing aspects (product, process, service, etc.). Making things right on the first try is the central tenet of ZDM. In recent years, the application of automation for in-line quality inspection systems has begun to attract the interest of both practitioners and academics because of its capability to detect defects in real-time, and thus adapt the system to disturbances. In this work, we provide a systematic review of the literature on current trends in the application of automation for in-line quality inspection with the ultimate objective of achieving ZDM. Additionally, bibliometric and performance analyses have been performed to gain a complete picture of the field. In this work, we have collected bibliometric data from the most widely referred search engines for academic engineering papers, i.e. Scopus, Web of Science, and IEEE Explorer, involving a total of 145 academic publications from 2011 to 2021. Uniquely for this study, we used three research attributes for the analysis of the selected articles, that is, the level of automation, the condition for quality inspection, and the contribution to ZDM dimensions. The literature suggests that there is a lack of research on the use of in-line detection data for the prediction of defects or repair. Based on the results and our interpretation of the literature, an adapted framework of ZDM (Psarommatis et al., 2020a) and multi-layer quality inspection (Azamfirei et al., 2021a) is presented.
Development of training image database using web crawling for vision-based site monitoring
2022, Automation in Construction
Citation Excerpt :
In addition, other researchers investigated how to distribute the efforts required for labeling the data. For instance, Mera et al. [44] employed multi-instance learning trained with weakly labeled images. Liu and Golparvar-Fard [45] used a crowdsourcing technique to label the types of the workers' activities, and the technique referred to the collaborative participation of a crowd of people on an online platform (e.g., Amazon Mechanical Turk).
As most of the state-of-the-art technologies for vision-based monitoring were originated from machine learning or deep learning algorithms, it is crucial to build a large and rich training image database (DB). For this purpose, this paper proposes an automated framework that builds a large, high-quality training DB for construction site monitoring. The framework consists of three main processes: (1) automated construction image collection using web crawling, (2) automated image labeling using an image segmentation model, and (3) fully randomized foreground-background cross-oversampling. Using the developed framework, it was possible to automatically construct a training DB, composed of 5864 images, for the detection of construction objects in 53.5 min. The deep learning model trained by the DB successfully detected construction resources with an average precision of 92.71% and a recall rate of 88.14%. The findings of this study can reduce the time and effort required to develop vision-based site monitoring technologies.
Towards automatic visual inspection: A weakly supervised learning method for industrial applicable object detection
2020, Computers in Industry
Industrial visual detection is an essential part in modern industry for equipment maintenance and inspection. With the recent progress of deep learning, advanced industrial object detectors are built for smart industrial applications. However, deep learning methods are known data-hungry: the processes of data collection and annotation are labor-intensive and time-consuming. It is especially impractical in industrial scenarios to collect publicly available datasets due to the inherent diversity and privacy. In this paper, we explore automation of industrial visual inspection and propose a segmentation-aggregation framework to learn object detectors from weakly annotated visual data. The used minimum annotation is only image-level category labels without bounding boxes. The method is implemented and evaluated on collected insulator images and public PASCAL VOC benchmarks to verify its effectiveness. The experiments show that our models achieve high detection accuracy and can be applied in industry to achieve automatic visual inspection with minimum annotation cost.
Fantastic plastic? An image-based test method to detect aesthetic defects in batches based on reference samples
2020, Polymer Testing
Citation Excerpt :
Profiling different defect types within manufacturing can help in automating AQC for machine vision using quantifiable anomaly descriptors. Many different materials (e.g. leather [10], ceramic [11], stone [12], metals [3,13], and plastic etc. [4]) make AVI subject to real-time and in situ testing and can contribute detection, localization and classification of defects in polymer products. Highly reflective materials (e.g. plastic) gives varying reflectance and is part of the problem for AVI since specular highlights on objects create noise in the images and eradicate homogeneity.
The production of high-end manufactured products requires Aesthetic Quality Control (AQC) in the form of human visual inspection. Manufactures can reduce AQC costs by incorporating semi-automated visual defect detection in units with the existing 3D metrology scans. This paper demonstrates how an image-based test method for defect detection can reduce the workload related to human visual inspection by proposing a median master comparison of batch image series. Our contribution consist of a) contrast enhancing and sorting batch image series for human visual inspection and b) providing a quality index (nQI) incorporated into statistical process control (SPC) for monitoring and controlling the AQC process. Our data shows that the median master differencing together with the nQI is great for classification of defects in batch images series. We introduce a SPC design proposal where individual batches as well as aggregated data can be inspected in synergy with the principles of Six Sigma. Based on Six Sigma control limits we have reduced the number of images in need of review by AQC assessors by a factor of 13.
A novel convolutional neural network based fault recognition method via image fusion of multi-vibration-signals
2019, Computers in Industry
Citation Excerpt :
Complex feature extraction processes require specialized knowledge strongly and influence the final diagnosis result significantly. Recently, deep learning has been widely used in computer vision [23–25], acoustic recognition [26] and medical images [27,28]. Hinton et al. proposed hierarchical neural network model possessing strong pattern learning ability, for example deep belief network (DBN), which accelerated the development of deep learning [29,30].
This paper proposed a novel fault recognition method for rotating machinery on the basis of multi-sensor data fusion and bottleneck layer optimized convolutional neural network (MB-CNN). A conversion method converting vibration signals from multiple sensors to images is proposed that can integrate information to get richer features than vibration signals from single sensor. By this method feature maps of different fault types can be obtained without tedious parameter adjustments. Based on the feature maps from multi-sensor data, a corresponding novel convolutional neural network is also constructed. The constructed network performs the bottleneck layers with an increased number of input features to avoid information lost. The data at the same time node can be fused by the convolutional kernels of which the size matches the number of sensors. Practical examples of diagnosis for the wind power test rig and the centrifugal pump test rig are given in order to verify the effectiveness of the proposed approaches, and prediction accuracy of 99.47% and 97.32% is obtained respectively. Otherwise, the performances of other conventional methods such as deep belief network (DBN), support vector machine (SVM) and artificial neural network (ANN) are evaluated for contrast with the proposed method. As shown in the results, the novel convolutional neural network obtains higher recognition accuracy and faster convergence speed.
An intelligent vision-based approach for helmet identification for work safety
2018, Computers in Industry
Citation Excerpt :
They have been applied in face recognition [23,25,56,61], vehicle detection [22,53,65], pedestrian detection [18,42,55]. With the development of this field, automated visual inspections have been used widely in industrial applications [2,8,9,49,52]. For helmet identification, numerous related studies have been conducted.
Helmets are essential equipments to protect workers from danger during inspection and operation. Considering that some workers would not always obey the regulation, video surveillance systems covering the whole factory and supervisors are needed to monitor whether workers are wearing helmets or not. However, with a large number of surveillance screens, it is difficult to identify any helmet violation behavior during any time, which can lead to severe accidents. With the rapid development of image recognition technologies, computer vision-based inspections have been one of the most important industrial application areas. In this paper, an intelligent vision-based approach for helmet identification is proposed. This approach focuses on monitoring whether workers are wearing helmets or not, at the same time, identifying the colors of helmets. A color-based hybrid descriptor composed of local binary patterns (LBP), hu moment invariants (HMI) and color histograms (CH) is proposed to extract features of helmets with different colors (red, yellow and blue). Then a hierarchical support vector machine (H-SVM) is constructed to classify all features into four classes (red-helmet, yellow-helmet, blue-helmet and non-helmet). This approach is tested on our data set and the average accuracy of helmet identification is 90.3%.

View all citing articles on Scopus

View full text

Automatic visual inspection: An approach with multi-instance learning

Highlights

Abstract

Introduction

Section snippets

Multi-instance learning

Automatic visual inspection with MIL

Experiments and results

Conclusions

Acknowledgment

Comput. Vis. Image Underst.

Comput. Vis. Graph. Image Process.

Image Vis. Comput.

Real Time Imag.

J. Food Eng.

Comput. Ind.

Comput. Ind.

Comput. Ind.

Comput. Ind.

Comput. Electron. Agric.

Exp. Syst. Appl.

J. Food Eng.

Artif. Intell.

Artif. Intell.

Pattern Recognit.

Knowl. Based Syst.

Automated visual inspection: a survey

IEEE Trans. Pattern Anal. Mach. Intell.

Computational intelligence in industrial quality control

A Flexible Framework for Learning Visual Quality Inspection: Information Fusion Approaches (Ph.D.)

Improving automatic detection of defects in castings by applying wavelet technique

IEEE Trans. Ind. Electron.

Computer-vision-based fabric defect detection: a survey

IEEE Trans. Ind. Electron.

Automated inspection of engineering ceramic grinding surface damage based on image recognition

Int. J. Adv. Manuf. Technol.

Learning visual quality inspection from multiple humans using ensembles of classifiers

RSSVM-based multi-instance learning for image categorization

Int. J. Comput. Sci. Issues

A multiple instance learning approach for content based image retrieval using one-class support vector machine

Multiple component learning for object detection

Visual tracking with online multiple instance learning