Mutant reduction based on dominance relation for weak mutation testing

doi:10.1016/j.infsof.2016.05.001

Information and Software Technology

Volume 81, January 2017, Pages 82-96

https://doi.org/10.1016/j.infsof.2016.05.001 Get rights and content

Abstract

Context: As a fault-based testing technique, mutation testing is effective at evaluating the quality of existing test suites. However, a large number of mutants result in the high computational cost in mutation testing. As a result, mutant reduction is of great importance to improve the efficiency of mutation testing.

Objective: We aim to reduce mutants for weak mutation testing based on the dominance relation between mutant branches.

Method: In our method, a new program is formed by inserting mutant branches into the original program. By analyzing the dominance relation between mutant branches in the new program, the non-dominated one is obtained, and the mutant corresponding to the non-dominated mutant branch is the mutant after reduction.

Results: The proposed method is applied to test ten benchmark programs and six classes from open-source projects. The experimental results show that our method reduces over 80% mutants on average, which greatly improves the efficiency of mutation testing.

Conclusion: We conclude that dominance relation between mutant branches is very important and useful in reducing mutants for mutation testing.

Introduction

Software testing, which is used to seek existing defects or faults in software before it is released to the market, is an important way to improve software quality. Mutation testing is commonly used to evaluate the quality of existing test suites to guide testers how they might be able to improve them [1]. Compared with other structural coverage criteria, test suites that are mutation adequate can reveal more faults [2]. It is noticeable that mutation testing has attracted widespread attention from researchers and developers in both academia and industry.

Mutation testing is a fault-based technique [3], [4], and the related concepts are given as follows. By making a simple syntactic change to the original program, a mutant is generated. A rule used to perform the syntactic changes is called a mutation operator. If a test datum can distinguish the outputs between a mutant and its original program, the mutant is said to be killed. A mutant is equivalent, if it cannot be killed by any test datum. Generally, the adequacy of mutation testing named mutation score is defined as the ratio of the number of killed mutants to the total number of non-equivalent mutants.

In order to optimize the execution of the traditional mutation testing, Howden first proposed weak mutation testing [5]. Instead of checking a mutant after executing the whole program, weak mutation testing checks a mutant immediately after executing the mutated statement.

In mutation testing, mutants are employed to reflect possible real faults in software under test [6], [7], [8]. Many lines of code (LOCs), complicated statements, and a variety of data types [9] in software greatly increase the number of mutants. It results in the high computational cost in mutation testing, therefore the mutant reduction is of great importance and necessity. Although there have been several techniques for mutant reduction [10], [11], [12], [13], [14], [15], [16], their efficiency needs to be further improved.

Just et al. focused on the COR and ROR mutation operators to identify redundant mutants [13]. Kaminski et al. sought a subset of relational operators that subsumes the others to reduce mutants [14]. Focusing only on a subset of mutation operators opens new research directions [13], [14]. Papadakis and Malevris transformed the problem of killing mutants into the problem of covering mutant branches in the new program, and generated test data by conventional approaches [17]. Although it is relatively efficient, a large number of mutants without reduction, will inevitably add high complexity to the new program.

In the previous work on dominance analysis, Marre and Bertolino employed the subsumption relation between entities in a ddgraph (a simplified control flow graph) to seek the minimal set of entities named the spanning set, so as to reduce the number of entities needed to cover [18]. In addition, Ghiduk and Girgis identified the non-dominated nodes in a control flow graph (CFG) by analyzing the dominance relation between nodes [19]. Both the above methods are performed among the original entities (nodes) for structural coverage testing. Different from the above work, we analyze the dominance relation between mutant branches, which are instrumented branches transformed from mutants based on the method presented by Papadakis and Malevris for weak mutation testing, with the aim to reduce the number of mutants, and to improve the efficiency of testing.

Considering all the traditional (method level) mutation operators, we first construct mutant branches based on the statements before and after mutation, and form the new program by fusing all mutant branches into the original program using the method proposed by Papadakis and Malevris [17]. Then, we identify redundant mutants according to the dominated mutant branches after manual analysis with the aid of the dominance relation graph. Mutants associated with the non-dominated ones will remain. The test data that cover the non-dominated mutant branches can also cover all the mutant branches, i.e., kill all the mutants before reduction in weak mutation testing.

The basic idea of defining the dominance relation between mutant branches and applying the dominance relation to reduce mutants was initially reported, with examples on several small programs, at the 2nd Chinese Search Based Software Engineering (CSBSE’2013) workshop [20]. Given the fact that the two-page abstract is preliminary, we have extended the idea in the following four new directions:

(1)
defining four concepts, mutant branch, dominance relation, non-dominated branch, and dominance relation graph;
(2)
presenting two theorems on how to form the non-dominated mutant branch set and identify the non-dominated mutants;
(3)
providing an example throughout the whole paper to intuitively demonstrate the above work;
(4)
evaluating the proposed method by applying it to ten benchmark programs and six classes from open-source projects with various sizes and complexities.

The main contributions of this paper are as follows:

• A method of reducing mutants is proposed for weak mutation testing, which is conducted by analyzing the dominance relation between mutant branches in the new program.

• Four definitions of identifying the dominance relation between mutant branches are provided, and the dominance relation graph is given to describe all the dominance relations in the new program.

• Two theorems of determining the non-dominated mutant branches are given, so as to reduce redundant mutants.

• The proposed method is applied to ten benchmark programs and six classes from open-source projects, and the experimental results suggest that our method reduces over 80% mutants.

Section snippets

Related work

Reducing mutants is of effectiveness to save computational cost for mutation testing. Weak mutation testing is a technique in view of saving execution time. Additionally, there are correlations among statements in a program, and correlation analysis is helpful to mutation testing. This section will review the related work from the above aspects.

The proposed method of reducing mutants

This section describes the method of reducing mutants by the dominance relation. In this method, we first construct mutant branches based on the statements before and after mutation, and a new program is formed by fusing all the mutant branches into the original program. Then, we analyze the dominance relation between mutant branches in the new program. Finally, we obtain the non-dominated mutant branches which correspond to the mutants after reduction.

Experiments

This section performs an experimental study to validate the effectiveness of the proposed method. First, the research questions are raised. Then, the experimental process is given. Finally, the experimental results on benchmark programs and open-source classes are analyzed.

Threats to the validity

This section presents several threats to the validity of our experiments and the methods of addressing them.

Construct validity: Determining the dominance relation between mutant branches is of considerable importance to mutant reduction. In the experiments, we determine the dominance relation by manual analysis. It is clear that testers with different skills and familiarities with a program will give different analysis results for the same pair of branches. To reduce this kind of threats, we

Conclusions

In mutation testing for complex software, there will be a large number of mutants being generated, which leads to a too high computational cost to be practically used. Reducing mutants is proved to be an effective way to improve the efficiency of mutation testing. However, there have been not yet efficient approaches available, which are able to reduce a large number of mutants and maintain a high mutation score at the same time.

We focus on mutant reduction in weak mutation testing, and propose

Acknowledgement

This work is jointly supported by National Natural Science Foundation of China (No. 61375067 and 61203304), Natural Science Foundation of Jiangsu Province (No. BK2012566). We would like to thank Dr. Edward C. Mignot (the formerly Professor of Shandong University) for polishing this paper.

References (54)

Y. Jia et al.
Higher order mutation testing
Inform. Softw. Technol.
(2009)
F. Lammermann et al.
Evaluating evolutionary testability for structure-oriented testing with software measurements
Appl. Soft Comput.
(2008)
J.J. Dominguez-Jimenez et al.
Evolutionary mutation testing
Inform. Softw. Technol.
(2011)
M. Polo et al.
Decreasing the cost of mutation testing with second-order mutants
Softw. Test. Verif. Rel.
(2008)
G. Fraser et al.
Mutation-driven generation of unit tests and oracles
IEEE Trans. Softw. Eng.
(2012)
A.J. Offutt et al.
An experimental evaluation of data flow and mutation testing
Software
(1996)
Y. Jia et al.
Analysis and survey of the development mutation testing
IEEE Trans. Softw. Eng.
(2011)
W.E. Howden
Weak mutation testing and completeness of test sets
IEEE Trans. Softw. Eng.
(1982)
ChenJ. et al.
Research on software fault injection testing
Chinese J. Softw.
(2009)
J.H. Andrews et al.
Is mutation an appropriate tool for testing experiments
Proceedings of International Conference on Software Engineering
(2005)

J.H. Andrews et al.

Using mutation analysis for assessing and comparing testing coverage criteria

IEEE Trans. Softw. Eng.

(2006)

R. Just et al.

Using state infection conditions to detect equivalent mutants and speed up mutation analysis

Proceedings of Dagstuhl Seminar 13021 Symbolic Methods in Testing

(2013)

R. Just et al.

Using non-redundant mutation operators and test suite prioritization to achieve efficient and scalable mutation analysis

Proceedings of 23rd International Symposium on Software Reliability Engineering

(2012)

R. Just et al.

Do redundant mutants affect the effectiveness and efficiency of mutation analysis

Proceedings of IEEE 5th International Conference on Software Tesing, Verification and Validation

(2012)

G. Kaminski et al.

Better predicate testing

Proceedings of 6th International Workshop on Automation of Software Test

(2011)

M.F. Lau et al.

An extended fault class hierarchy for specification-based testing

ACM Trans. Softw. Eng. Methodol.

(2005)

R.H. Untch et al.

Mutation analysis using mutant schemata

Proceedings of ACM SIGSOFT International Symposium on Software Testing and Analysis

(1993)

M. Papadakis et al.

Automatically performing weak mutation with the aid of symbolic execution, concolic testing and search-based testing

Softw. Qual. J.

(2011)

M. Marre et al.

Using spanning sets for coverage testing

IEEE Trans. Softw. Eng.

(2003)

A.S. Ghiduk et al.

Using genetic algorithms and dominance concepts for generating reduced test data

Informatica

(2010)

GongD. et al.

Mutant reduction based on the dominant relation

Abstract of 2nd Chinese Search Based Software Engineering (CSBSE’2013)

(2013)

A.T. Acree

On Mutation

(1980)

T. A.

Budd. Mutation Analysis of Program Test Data

(1980)

A.P. Mathur et al.

An Empirical Comparison of Mutation and Data Flow Based Test Adequacy Criteria

Technical Report

(1993)

S. Hussain

Mutation Clustering

(2008)

A.P. Mathur

Performance, effectiveness, and reliability issues in software testing

Proceedings of Computer Software and Applications Conference

(1991)

A.J. Offutt et al.

An experimental evaluation of selective mutation

Proceedings of 15th International Conference on Software Engineering

(1993)

Cited by (36)

Test data generation for covering mutation-based path using MGA for MPI program
2024, Journal of Systems and Software
Message Passing Interface (MPI) is a communication protocol used for parallel programming in various languages, valued for its reliability and broad applicability. Mutation testing is a software testing method for systematically simulating software faults. However, the significant number of inserted mutation branches in the program escalates the testing cost. To address this issue, we propose the comprehensive PMTDGM framework. The framework first generates mutation-based paths based on the relevancy of mutant branches and the difficulty of mutant branch coverage, followed by the establishment of a multitask model of path coverage. Finally, we employ a multi-population genetic algorithm (MGA) to generate test data. Our experiments, performed on six MPI programs of varying sizes and structures, demonstrate that the mutation-based paths have the small test sets and are easy to cover. Additionally, the multitask model and MGA can significantly improve the efficiency of generating test data and reduce the cost of mutation testing for MPI programs compared to traditional methods.
Spectral clustering based mutant reduction for mutation testing
2021, Information and Software Technology
Citation Excerpt :
Untch et al. [68] proposed a schema-based approach for generating one single meta-mutant of containing all mutants. In addition, in order to reduce a large number of mutants, Gong et al. [69] used domain correlation for weak mutation testing. This technology not only effectively reduced mutants, but also improved the efficiency of weak mutation testing, and then reduced the cost of mutation testing.
Mutation testing techniques, which attempt to construct a set of so-called mutants by seeding various faults into the software under test, have been widely used to generate test cases as well as to evaluate the effectiveness of a test suite. Its popularity in practice is significantly hindered by its high cost, majorly caused by the large number of mutants generated by the technique.
It is always a challenging task to reduce the number of mutants while preserving the effectiveness of mutation testing. In this paper, we make use of an intelligent technique, namely spectral clustering, to improve the efficacy of mutant reduction.
First of all, we give a family of definitions and the method to calculate the distance between mutants according to the weak mutation testing criteria. Then we propose a mutant reduction method based on spectral clustering (SCMT), including the determination method of the number of clusters, spectral clustering of mutants, and selection of representative mutants.
The experimental studies based on 12 object programs show that the new approach can significantly reduce the number of mutants without jeopardizing the performance of mutation testing. As compared with other benchmark techniques, the new approach based on weak mutation testing criteria cannot only consistently deliver high effectiveness of mutation testing, but also help significantly reduce the time-cost of mutation testing.
It is clearly demonstrated that the use of spectral clustering can help enhance the cost-effectiveness of mutation testing. The research reveals some potential research directions for not only mutation testing but also the broad area of software testing.
Boundary sampling to boost mutation testing for deep learning models
2021, Information and Software Technology
Citation Excerpt :
Although useful in multiple areas, mutation testing is extremely expensive [35], because it requires generating and executing each mutant against the test suite. One way to alleviate this problem is to reduce the number of generated mutants without significant loss of test effectiveness [55–57]. Jia and Harman [58] introduced the concept of higher order mutant (HOM), which was generated by applying mutation operators more than once.
Context: The prevalent application of Deep Learning (DL) models has raised concerns about their reliability. Due to the data-driven programming paradigm, the quality of test datasets is extremely important to gain accurate assessment of DL models. Recently, researchers have introduced mutation testing into DL testing, which applies mutation operators to generate mutants from DL models, and observes whether the test data can identify mutants to check the quality of test dataset. However, there still exist many factors (e.g., huge labeling efforts and high running cost) hindering the implementation of mutation testing for DL models.
Objective: We desire for an approach to selecting a smaller, sensitive, representative and efficient subset of the whole test dataset to promote the current mutation testing (e.g., reduce labeling and running cost) for DL Models.
Method: We propose boundary sample selection (BSS), which employs the distance of samples to decision boundary of DL models as the indicator to construct the appropriate subset. To evaluate the performance of BSS, we conduct an extensive empirical study with two widely-used datasets, three popular DL models, and 14 up-to-date DL mutation operators. Results
: We observe that (1) The sizes of our subsets generated by BSS are much smaller (about 3%-20% of the whole test set). (2) Under most mutation operators, our subsets are superior (about 9.94-21.63) than the whole test sets in observing mutation effects. (3) Our subsets could replace the whole test sets to a very high degree (higher than 97%) when considering mutation score. (4) The MRR values of our proposed subsets are clearly better (about 2.28-13.19 times higher) than that of the whole test sets.
Conclusions: The result shows that BSS can help testers save labelling cost, run mutation testing quickly and identify killed mutants early.
Mutation Testing Advances: An Analysis and Survey
2019, Advances in Computers
Citation Excerpt :
Sun et al. [131] explored the program path space and selected mutants that are as diverse as possible with respect to the paths covering them. Gong et al. [132] selected mutants that structurally dominate the others (covering them results in covering all the others). This work aims at weak mutation and attempts to statically identify dominance relations between the mutants.
Mutation testing realizes the idea of using artificial defects to support testing activities. Mutation is typically used as a way to evaluate the adequacy of test suites, to guide the generation of test cases, and to support experimentation. Mutation has reached a maturity phase and gradually gains popularity both in academia and in industry. This chapter presents a survey of recent advances, over the past decade, related to the fundamental problems of mutation testing and sets out the challenges and open problems for the future development of the method. It also collects advices on best practices related to the use of mutation in empirical studies of software testing. Thus, giving the reader a “mini-handbook”-style roadmap for the application of mutation testing as experimental methodology.
Test Data Generation for Covering Mutation-Based Path Using Mga for Mpi Program
2023, SSRN
iBiR: Bug-report-driven Fault Injection
2023, ACM Transactions on Software Engineering and Methodology

View all citing articles on Scopus

View full text

Mutant reduction based on dominance relation for weak mutation testing

Abstract

Introduction

Section snippets

Related work

The proposed method of reducing mutants

Experiments

Threats to the validity

Conclusions

Acknowledgement

Inform. Softw. Technol.

Appl. Soft Comput.

Inform. Softw. Technol.

Softw. Test. Verif. Rel.

Mutation-driven generation of unit tests and oracles

IEEE Trans. Softw. Eng.

An experimental evaluation of data flow and mutation testing

Software

Analysis and survey of the development mutation testing

IEEE Trans. Softw. Eng.

Weak mutation testing and completeness of test sets

IEEE Trans. Softw. Eng.

Research on software fault injection testing

Chinese J. Softw.

Is mutation an appropriate tool for testing experiments

Proceedings of International Conference on Software Engineering

Using mutation analysis for assessing and comparing testing coverage criteria

IEEE Trans. Softw. Eng.

Using state infection conditions to detect equivalent mutants and speed up mutation analysis

Proceedings of Dagstuhl Seminar 13021 Symbolic Methods in Testing

Using non-redundant mutation operators and test suite prioritization to achieve efficient and scalable mutation analysis

Proceedings of 23rd International Symposium on Software Reliability Engineering

Do redundant mutants affect the effectiveness and efficiency of mutation analysis

Proceedings of IEEE 5th International Conference on Software Tesing, Verification and Validation

Better predicate testing

Proceedings of 6th International Workshop on Automation of Software Test

An extended fault class hierarchy for specification-based testing

ACM Trans. Softw. Eng. Methodol.

Mutation analysis using mutant schemata

Proceedings of ACM SIGSOFT International Symposium on Software Testing and Analysis

Automatically performing weak mutation with the aid of symbolic execution, concolic testing and search-based testing

Softw. Qual. J.

Using spanning sets for coverage testing

IEEE Trans. Softw. Eng.

Using genetic algorithms and dominance concepts for generating reduced test data

Informatica

Mutant reduction based on the dominant relation

Abstract of 2nd Chinese Search Based Software Engineering (CSBSE’2013)

On Mutation

Budd. Mutation Analysis of Program Test Data

An Empirical Comparison of Mutation and Data Flow Based Test Adequacy Criteria

Technical Report

Mutation Clustering

Performance, effectiveness, and reliability issues in software testing

Proceedings of Computer Software and Applications Conference

An experimental evaluation of selective mutation

Proceedings of 15th International Conference on Software Engineering