Classifier ensemble generation and selection with multiple feature representations for classification applications in computer-aided detection and diagnosis on mammography

doi:10.1016/j.eswa.2015.10.014

Expert Systems with Applications

Volume 46, 15 March 2016, Pages 106-121

https://doi.org/10.1016/j.eswa.2015.10.014 Get rights and content

Highlights

•
Novel ensemble classifier framework for improved classification of breast lesions.
•
Ensemble generation algorithm using different types of breast lesion features.
•
Ensemble selection mechanism to find an optimal subset of component classifiers.
•
Impressive classification performance by comparing single classifier based methods.

Abstract

This paper presents a novel ensemble classifier framework for improved classification of mammographic lesions in Computer-aided Detection (CADe) and Diagnosis (CADx) systems. Compared to previously developed classification techniques in mammography, the main novelty of proposed method is twofold: (1) the “combined use” of different feature representations (of the same instance) and data resampling to generate more diverse and accurate base classifiers as ensemble members and (2) the incorporation of a novel “ensemble selection” mechanism to further maximize the overall classification performance. In addition, as opposed to conventional ensemble learning, our proposed ensemble framework has the advantage of working well with both weak and strong classifiers, extensively used in mammography CADe and/or CADx systems. Extensive experiments have been performed using benchmark mammogram dataset to test the proposed method on two classification applications: (1) false-positive (FP) reduction using classification between masses and normal tissues, and (2) diagnosis using classification between malignant and benign masses. Results showed promising results that the proposed method (area under the ROC curve (AUC) of 0.932 and 0.878, each obtained for the aforementioned two classification applications, respectively) impressively outperforms (by an order of magnitude) the most commonly used single neural network (AUC = 0.819 and AUC =0.754) and support vector machine (AUC = 0.849 and AUC = 0.773) based classification approaches. In addition, the feasibility of our method has been successfully demonstrated by comparing other state-of-the-art ensemble classification techniques such as Gentle AdaBoost and Random Forest learning algorithms.

Introduction

Breast cancer is the most common form of cancer among women and is the second leading cause of death (Kopans, 2007). To reduce the workload of radiologists and to improve the specificity and sensitivity in detection of breast cancer, two different types of automated screening systems are being developed (Suri & Rangayyan, 2006): (1) Computer-aided Detection (CADe) and (2) Computer-aided Diagnosis (CADx). Table 1 provides a brief review of CADe and CADx systems. Current CADe and/or CADx systems have been clearly shown to be quite sensitive in its ability to detect cancer, but one of their main drawbacks is the high number of FPs (defined in Table 1) (Suri and Rangayyan, 2006, Sampat, 2005). Hence, high FP rate for mass detection and diagnosis remains to be one of the major problems to be resolved in CADe/CADx study (Suri and Rangayyan, 2006, Sampat, 2005, Tang et al., 2009).

In typical CADe (or CADx) systems, classifier design is one of the key steps for determining FP rates (Suri and Rangayyan, 2006, Sampat, 2005). Thus far, research efforts have mostly been focused on the design of the single classifier in both CADe and CADx systems (Suri and Rangayyan, 2006, Sampat, 2005, Tang et al., 2009, Chan et al., 1999). It should be noted that there are two critical limitations within the classifier design process in mammogram images. First, the large variability in the appearance of mass patterns (Cheng et al., 2005, Velikova et al., 2013) – due to its irregular size, obscured borders, and complex mixtures of margin types – makes classification task quite difficult. Second, research in mammography is characterized by a restricted training data, due to cost, time, and availability to patient medical information and mammography images (Suri and Rangayyan, 2006, Bilska-Wolak and Floyd, 2004). On the other hand, the number of available features (arising from the integration of multiple heterogeneous feature types) is large (Cheng et al., 2005, Jesneck et al., 2006, Wei et al., 1997) (typically, in the thousands) relative to the number of training samples, so-called curse of dimensionality (Kuncheva, 2004). For these reasons, a single classifier design may face a great challenge in achieving a level of FP reduction that meets the requirement of clinical applications.

In this paper, to overcome the aforementioned limitations, we propose a new and novel ensemble classifier framework for classification applications (explained in Table 1) in mammographic CADe and CADx. This paper improves and extends preliminary work presented in Choi, Kim, Plataniotis, and Ro, (2012). In particular, this paper presents a new ensemble selection approach for selecting an optimal subset of base classifiers, aiming to further improve generalized (testing) classification performances. An improved ensemble generation technique is also outlined in the paper by introducing an advanced mechanism that allows the use of strong classifiers extensively used in mammography computer-aided detection and diagnosis systems. In addition, more insightful discussion of our ensemble generation on the local learner hypothesis viewpoint is provided. Moreover, we report integrated experimental results that are more extensive and rigorous in the following aspects: (1) additional assessment of our proposed ensemble classification on computer-aided diagnosis application; (2) the comparison of other state-of-the-art ensemble classification techniques; (3) comprehensive analysis using more classifier models.

The contents of the paper are organized as follows: Section 2 reviews previous work on classification of breast masses on mammograms in CADe and CADx systems. In Section 3, the region-of-interests (ROIs) segmentation and feature extraction methods used in our study are briefly described. Section 4 explains in detail the proposed ensemble classification framework. Section 5 contains the details of the image databases, and experimental setup and condition. In Section 6, we present a series of experimental results to demonstrate the effectiveness of the proposed method. Finally, concluding remarks are provided in Section 7.

Section snippets

Related work

In past years, considerable research efforts have been directed to classifier design aiming at classification applications in mammography. Wei et al. (1997) used global and local texture features extracted from manually selected ROIs of digitized mammograms, and linear discriminant analysis (LDA) to classify the masses from normal glandular tissues to minimize FP detections. Sahiner et al. (1996) proposed a convolution neural network (NN) for the task of discriminating between masses and normal

ROI segmentation and feature extraction

In typical CADe/CADx systems, segmentation of ROIs and feature extraction for generated ROIs are prerequisite steps prior to performing classification of ROIs (Suri & Rangayyan, 2006). Hence, in this section, we will briefly describe the segmentation algorithm and types of mammographic mass features used in our study before explaining our ensemble classifier. As recommended in Wei et al. (1997), Sahiner et al. (1996), Mudigonda et al., (2001), to perform a more realistic assessment of a

Proposed ensemble classifier system

Fig. 2 provides an overview of the proposed ensemble classifier framework. As shown in Fig. 2, this framework largely consists of three parts: (1) ensemble generation, (2) ensemble selection, and (3) ensemble fusion (or combination). Each of the three steps will be described in more detail in the following sections.

Data set and performance evaluation

The public Digital Database for Screening Mammography (DDSM) database (DB) (Heath, Bowyer, Kopans, Moore, & Kegelmeyer et al., 2000) was in our evaluation study. For data consistency purposes, all images were collected from the same type of scanner and resolution. We chose the scanner type Howtek 960 because a large number of cases are digitized by this type (Heath et al., 2000). All images collected from the DDSM were subsampled to 200 μm and quantized to 8 bits per pixel for computational

Evaluating classification of mass and normal tissues in CADe

The proposed ensemble classifier framework was tested on Dataset 1 described in Section 5. It should be noted that nine types of features each marked either E or E/X in the “Usage” column in Table 2 were used as different feature representations in this assessment [i.e., K (defined in Fig. 4) was set to 9]. As for base classifiers, SVM which utilizes a Radial Basis Function (Chang & Lin, 2011) (as kernel) and NN with back-propagation training algorithm (Setiono, 2001) was used.

We compared the

Discussion and conclusion

Note that several methods for classification algorithms have been developed as expert and intelligent systems in mammography (Diaz-Huerta et al., 2014, Junior et al., 2013, Nanni et al., 2012, Krishnan et al., 2010, Verma et al., 2010). However, most of these classification methods have been focused on the study of application of “the single classifier based solutions”. It has been widely accepted in Suri and Rangayyan (2006), Nishikawa (2007) and Tang et al. (2009) that mammographic mass

Acknowledgements

This work was partially supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. 2015R1A2A2A01005724).

References (69)

BriaA. et al.
Learning from unbalanced data: a cascade-based approach for detecting clustered microcalcifications
Medical Image Analysis
(2014)
ConstantinidisA.S. et al.
A new multi-expert decision combination algorithms and its application to the detection of circumscribed masses in digital mammograms
Pattern Recognition
(2001)
Diaz-HuertaC.C. et al.
Quantitative analysis of morphological techniques for automatic classification of micro-calcifications in digitized mammograms
Expert Systems with Applications
(2014)
FreundY. et al.
A decision-theoretic generalization of on-line learning and an application to boosting
Journal of Computer and System Sciences.
(1997)
GeorgiouH. et al.
Multi-scaled morphological features for the characterization of mammographic masses using statistical classification schemes
Artificial Intelligence in Medicine
(2007)
JainA. et al.
Score normalization in multimodal biometric systems
Pattern Recognition
(2005)
LiX. et al.
AdaBoost with SVM-based component classifiers
Engineering Applications of Artificial Intelligence
(2008)
LiX.Z. et al.
Background intensity independent texture features for assessing breast cancer risk in screening mammograms
Pattern Recognition Letters
(2013)
MavroforakisM.E. et al.
Mammographic masses characterization based on localized texture and dataset fractal analysis using linear, neural and support vector machine classifiers
Artificial Intelligence in Medicine
(2006)
NanniL. et al.
A very high performance system to discriminate tissues in mammograms as benign and malignant
Expert Systems with Applications
(2012)

NishikawaR.M.

Current status and future directions of computer-aided diagnosis in mammography

Computerized Medical Imaging and Graphics

(2007)

OliverA. et al.

Automatic microcalcification and cluster detection for digital and digitised mammograms

Knowlede-Based Systems

(2012)

RutaD. et al.

Classifier selection for majority voting

Information Fusion

(2005)

SampatM.P.

Computer-aided detection and diagnosis in mammography

VelikovaM. et al.

On the interplay of machine learning and background knowledge in image interpretation by Bayesian networks

Artificial Intelligence in Medicine

(2013)

VermaB.

Novel network architecture and learning algorithm for the classification of mass abnormalities in in digitized mammograms

Artificial Intelligence in Medicine

(2008)

AlimogluF. et al.

Combining multiple representations and classifiers for pen-based handwritten digit recognition

Turkish Journal of Electrical Engineering and Computer Sciences

(2001)

AsyaliM.H. et al.

Gene expression profile classification: a review

Current Bioinformatics

(2006)

Bilska-WolakA.O. et al.

Tolerance to missing data using a likelihood ratio based classifier for computer-aided classification of breast cancer

Physics in Medicine & Biology

(2004)

BreimanL.

Random forests

Machine Learning

(2001)

CatariousD.M. et al.

Incorporation of an iterative, linear segmentation routine into a mammographic mass CAD system

Medical Physics

(2004)

ChanH.P. et al.

Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers

Medical Physics

(1999)

ChangC.-C. et al.

LIBSVM: a library for support vector machines

Transactions on Intelligent Systems and Technology

(2011)

ChengH.D. et al.

Approaches for automated detection and classification of masses in mammograms

Pattern Recognition

(2005)

ChoiJ.Y. et al.

Multiresolution local binary pattern texture analysis combined with variable selection for application to false positive reduction in computer-aided detection of breast masses on mammograms

Physics in Medicine & Biology

(2012)

ChoiJ.Y. et al.

Combining multiple feature representations and adaboost ensemble learning for reducing false-positive detection in computer-aided detection of masses on mammograms

IEEE Engineering in Medicine and Biology Conference (EMBC)

(2012)

DietterichT.G.

An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization

Machine Learning

(2000)

DominguezA.R. et al.

Detection of masses in mammograms via statistically based enhancement, multilevel-thresholding segmentation, and region selection

Computerized Medical Imaging and Graphics

(2008)

DominguezA.R. et al.

Toward breast cancer diagnosis based on automated segmentation of masses in mammograms

Pattern Recognition

(2009)

EltonsyN.H. et al.

A concentric morphology model for the detection of masses in mammography

IEEE Transactions on Medical Imaging

(2007)

FriedmanJ. et al.

Additive logistic regression: a statistical view of boosting

Annals of Statistics

(2000)

GaltierV. et al.

AdaBoost parallelization on PC clusters with virtual shared memory for fast feature selection

HeathM. et al.

The digital database for screening mammography

HongB.W. et al.

Segmentation of regions of interest in mammograms in a topographic approach

IEEE Transactions on Information Technology in Biomedicine

(2010)

Cited by (49)

Computer-aided breast cancer detection and classification in mammography: A comprehensive review
2023, Computers in Biology and Medicine
Cancer is the second cause of mortality worldwide and it has been identified as a perilous disease. Breast cancer accounts for $\sim$ 20% of all new cancer cases worldwide, making it a major cause of morbidity and mortality. Mammography is an effective screening tool for the early detection and management of breast cancer. However, the identification and interpretation of breast lesions is challenging even for expert radiologists. For that reason, several Computer-Aided Diagnosis (CAD) systems are being developed to assist radiologists to accurately detect and/or classify breast cancer. This review examines the recent literature on the automatic detection and/or classification of breast cancer in mammograms, using both conventional feature-based machine learning and deep learning algorithms. The review begins with a comparison of algorithms developed specifically for the detection and/or classification of two types of breast abnormalities, micro-calcifications and masses, followed by the use of sequential mammograms for improving the performance of the algorithms. The available Food and Drug Administration (FDA) approved CAD systems related to triage and diagnosis of breast cancer in mammograms are subsequently presented. Finally, a description of the open access mammography datasets is provided and the potential opportunities for future work in this field are highlighted. The comprehensive review provided here can serve both as a thorough introduction to the field but also provide indicative directions to guide future applications.
A review on image-based approaches for breast cancer detection, segmentation, and classification
2021, Expert Systems with Applications
The breast cancer as the most life-threatening disease among the woman has emerged in the worldwide. It is supposed that the early testing and treatment for breast cancer detection would be avoided the surgeries and increase the survival rate. A variety of research studies have motivated to improve the diagnostic methods for early diagnosis of breast cancer. This study investigates the automatic and semi-automatic image-based approaches for breast cancer diagnosis. The scope of this research has limited to the images based diagnosis application journal that are published between 2016 and 2020 years. The principles and associated risk factors for diagnosis the breast cancer and existing imaging techniques are presented. The steps of diagnosis including preprocessing, segmentation, extracting tumor features, and tumor classification are investigated. The publicly available datasets for breast imaging are briefly introduced as well. The application issues, challenges of breast imaging technologies and future directions are discussed. Based on the detailed study, most proposed methods use one type of imaging modalities, however, the doctor need to investigate the multiple imaging techniques to accurate diagnosis and effective treatment. Moreover, handling the multiple imaging require the processing of big data using a cluster computing framework.
Feature discovery in NIR spectroscopy based Rocha pear classification
2021, Expert Systems with Applications
Non-invasive techniques for automatic fruit classification are gaining importance in the global agro-industry as they allow for optimizing harvesting, storage, management, and distribution decisions. Visible, near infra-red (NIR) diffuse reflectance spectroscopy is one of the most employed techniques in such fruit classification. Typically, after the acquisition of a fruit reflectance spectrum the wavelength domain signal is preprocessed and a classifier is designed. Up to now, little or no work considered the problem of feature generation and selection of the reflectance spectrum. This work aims at filling this gap, by exploiting a feature engineering phase before the classifier. The usual approach where the classifier is fed directly with the reflectances measured at each wavelength is contrasted with the proposed division of the spectra into bands and their characterization in wavelength, frequency, and wavelength-frequency domains. Feature selection is also applied for optimizing efficiency, predictive accuracy, and for mitigating over-training. A total of 3050 Rocha pear samples from different origins and harvest years are considered. Statistical tests of hypotheses on classification results of soluble solids content – a predictor of both fruit sweetness and ripeness – show that the proposed preliminary phase of feature engineering outperforms the usual direct approach both in terms of accuracy and in the number of necessary features. Moreover, the method allows for the identification of features that are physical chemistry meaningful.
Comparison of segmentation-free and segmentation-dependent computer-aided diagnosis of breast masses on a public mammography dataset
2021, Journal of Biomedical Informatics
Citation Excerpt :
We propose that these discrepancies result from some combination of differences in the segmentation techniques used, parameter tuning on small datasets in the original work, and implementation choices. Due to the importance of characteristics of the lesion margin in differentiating benign and malignant tumors, many existing CADx methods have been based on obtaining mathematical descriptions of the tumor outline [7,12–20]. Such segmentation-dependent techniques require accurate segmentation of the lesion margin in order to extract image features.
To compare machine learning methods for classifying mass lesions on mammography images that use predefined image features computed over lesion segmentations to those that leverage segmentation-free representation learning on a standard, public evaluation dataset.
We apply several classification algorithms to the public Curated Breast Imaging Subset of the Digital Database for Screening Mammography (CBIS-DDSM), in which each image contains a mass lesion. Segmentation-free representation learning techniques for classifying lesions as benign or malignant include both a Bag-of-Visual-Words (BoVW) method and a Convolutional Neural Network (CNN). We compare classification performance of these techniques to that obtained using two different segmentation-dependent approaches from the literature that rely on specific combinations of end classifiers (e.g. linear discriminant analysis, neural networks) and predefined features computed over the lesion segmentation (e.g. spiculation measure, morphological characteristics, intensity metrics).
We report area under the receiver operating characteristic curve (A_Z) values for malignancy classification on CBIS-DDSM for each technique. We find average A_Z values of 0.73 for a segmentation-free BoVW method, 0.86 for a segmentation-free CNN method, 0.75 for a segmentation-dependent linear discriminant analysis of Rubber-Band Straightening Transform features, and 0.58 for a hybrid rule-based neural network classification using a small number of hand-designed features.
We find that malignancy classification performance on the CBIS-DDSM dataset using segmentation-free BoVW features is comparable to that of the best segmentation-dependent methods we study, but also observe that a common segmentation-free CNN model substantially and significantly outperforms each of these (p < 0.05). These results reinforce recent findings suggesting that representation learning techniques such as BoVW and CNNs are advantageous for mammogram analysis because they do not require lesion segmentation, the quality and specific characteristics of which can vary substantially across datasets. We further observe that segmentation-dependent methods achieve performance levels on CBIS-DDSM inferior to those achieved on the original evaluation datasets reported in the literature. Each of these findings reinforces the need for standardization of datasets, segmentation techniques, and model implementations in performance assessments of automated classifiers for medical imaging.
Chaos enhanced grey wolf optimization wrapped ELM for diagnosis of paraquat-poisoned patients
2019, Computational Biology and Chemistry
Citation Excerpt :
Feature selection (FS) is an effective approach to figure out the high dimensional space of features. Due to it efficiently reducing the redundant features to improve the accuracy of identification, FS has been applied to the wide range of fields such as text classification (Ghareb et al., 2016), emotion recognition (Atkinson and Campos, 2016), medical diagnosis (Choi et al., 2016; Sheikhpour et al., 2016) and so on. In this study, we present an efficiently and effective diagnosis framework based on gas chromatography coupled with mass spectrometry (GC–MS), Enhanced grey wolf optimization (EGWO) and ELM together, namely GEE.
Paraquat (PQ) poisoning seriously harms the health of humanity. An effective diagnostic method for paraquat poisoned patients is a crucial concern. Nevertheless, it's difficult to identify the patients with low intake of PQ or delayed treatment. Here, a new efficient diagnostic approach to integrate machine learning and gas chromatography-mass spectrometry (GC–MS), named GEE, is proposed to identify the PQ poisoned patients. First, GC–MS provides the original data that efficiently identified the paraquat-poisoned patients. According to the high dimensionality of the original data, in the second stage, the chaos enhanced grey wolf optimization (EGWO) is adopted to search the optimal feature sets to improve the accuracy of identification. Finally, the extreme learning machine (ELM) is used to identify the PQ poisoned patients. To efficiently evaluate the proposed method, four measures were used in our experiments and comparisons were made with six other methods. The PQ-poisoned patients and robust volunteers can be well identified by GEE and the values of AUC, accuracy, sensitivity and specificity were 95.14%, 93.89%, 94.44% and 95.83%, respectively. Our experimental results demonstrated that GEE had better performance and might serve as a novel candidate diagnosis of PQ-poisoned patients.
Unlocking the Potential: The Crucial Role of Data Preprocessing in Big Data Analytics
2023, 2023 1st DMIHER International Conference on Artificial Intelligence in Education and Industry 4.0, IDICAIEI 2023

View all citing articles on Scopus

^¹: Present address: Department of Biomedical Engineering, Jungwon University, Chungcheongbuk-do, Republic of Korea.

View full text

Classifier ensemble generation and selection with multiple feature representations for classification applications in computer-aided detection and diagnosis on mammography

Highlights

Abstract

Introduction

Section snippets

Related work

ROI segmentation and feature extraction

Proposed ensemble classifier system

Data set and performance evaluation

Evaluating classification of mass and normal tissues in CADe

Discussion and conclusion

Acknowledgements

Medical Image Analysis

Pattern Recognition

Expert Systems with Applications

Journal of Computer and System Sciences.

Artificial Intelligence in Medicine

Pattern Recognition

Engineering Applications of Artificial Intelligence

Pattern Recognition Letters

Artificial Intelligence in Medicine

Expert Systems with Applications

Computerized Medical Imaging and Graphics

Knowlede-Based Systems

Information Fusion

Artificial Intelligence in Medicine

Artificial Intelligence in Medicine

Combining multiple representations and classifiers for pen-based handwritten digit recognition

Turkish Journal of Electrical Engineering and Computer Sciences

Gene expression profile classification: a review

Current Bioinformatics

Tolerance to missing data using a likelihood ratio based classifier for computer-aided classification of breast cancer

Physics in Medicine & Biology

Random forests

Machine Learning

Incorporation of an iterative, linear segmentation routine into a mammographic mass CAD system

Medical Physics

Classifier design for computer-aided diagnosis: Effects of finite sample size on the mean performance of classical and neural network classifiers

Medical Physics

LIBSVM: a library for support vector machines

Transactions on Intelligent Systems and Technology

Approaches for automated detection and classification of masses in mammograms

Pattern Recognition

Multiresolution local binary pattern texture analysis combined with variable selection for application to false positive reduction in computer-aided detection of breast masses on mammograms

Physics in Medicine & Biology

Combining multiple feature representations and adaboost ensemble learning for reducing false-positive detection in computer-aided detection of masses on mammograms

IEEE Engineering in Medicine and Biology Conference (EMBC)

An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization

Machine Learning

Detection of masses in mammograms via statistically based enhancement, multilevel-thresholding segmentation, and region selection

Computerized Medical Imaging and Graphics

Toward breast cancer diagnosis based on automated segmentation of masses in mammograms

Pattern Recognition

A concentric morphology model for the detection of masses in mammography

IEEE Transactions on Medical Imaging

Additive logistic regression: a statistical view of boosting

Annals of Statistics

AdaBoost parallelization on PC clusters with virtual shared memory for fast feature selection

The digital database for screening mammography

Segmentation of regions of interest in mammograms in a topographic approach

IEEE Transactions on Information Technology in Biomedicine