research-article

Recalling the "imprecision" of cross-project defect prediction

Authors:
Foyzur Rahman

University of California Davis, Davis, CA

University of California Davis, Davis, CA
View Profile

,
Daryl Posnett

University of California Davis, Davis, CA

University of California Davis, Davis, CA
View Profile

,
Premkumar Devanbu

University of California Davis, Davis, CA

University of California Davis, Davis, CA
View Profile

FSE '12: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software EngineeringNovember 2012Article No.: 61Pages 1–11https://doi.org/10.1145/2393596.2393669

Published:11 November 2012Publication History

FSE '12: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering

Pages 1–11

ABSTRACT

There has been a great deal of interest in defect prediction: using prediction models trained on historical data to help focus quality-control resources in ongoing development. Since most new projects don't have historical data, there is interest in cross-project prediction: using data from one project to predict defects in another. Sadly, results in this area have largely been disheartening. Most experiments in cross-project defect prediction report poor performance, using the standard measures of precision, recall and F-score. We argue that these IR-based measures, while broadly applicable, are not as well suited for the quality-control settings in which defect prediction models are used. Specifically, these measures are taken at specific threshold settings (typically thresholds of the predicted probability of defectiveness returned by a logistic regression model). However, in practice, software quality control processes choose from a range of time-and-cost vs quality tradeoffs: how many files shall we test? how many shall we inspect? Thus, we argue that measures based on a variety of tradeoffs, viz., 5%, 10% or 20% of files tested/inspected would be more suitable. We study cross-project defect prediction from this perspective. We find that cross-project prediction performance is no worse than within-project performance, and substantially better than random prediction!

References

E. Arisholm, L. C. Briand, and M. Fuglerud. Data mining techniques for building fault-proneness models in telecom java software. In ISSRE, pages 215--224. IEEE Computer Society, 2007. Google ScholarDigital Library
E. Arisholm, L. C. Briand, and E. B. Johannessen. A systematic and comprehensive investigation of methods to build and evaluate fault prediction models. JSS, 83(1):2--17, 2010. Google ScholarDigital Library
A. Bachmann, C. Bird, F. Rahman, P. Devanbu, and A. Bernstein. The missing links: Bugs and bug-fix commits. In FSE, pages 97--106. ACM, 2010. Google ScholarDigital Library
C. Bird, A. Bachmann, E. Aune, J. Duffy, A. Bernstein, V. Filkov, and P. Devanbu. Fair and balanced?: bias in bug-fix datasets. In Proceedings of the the 7th FSE, pages 121--130. ACM, 2009. Google ScholarDigital Library
L. C. Briand, W. L. Melo, and J. Wüst. Assessing the applicability of fault-proneness models across object-oriented software projects. IEEE TSE, 28(7):706--720, 2002. Google ScholarDigital Library
D. Cubranić and G. C. Murphy. Hipikat: recommending pertinent software development artifacts. In Software Engineering, 2003. Proceedings. 25th International Conference on, pages 408--418, Portland, Oregon, 2003. IEEE Press. Google ScholarDigital Library
J. Ekanayake, J. Tappolet, H. C. Gall, and A. Bernstein. Tracking concept drift of software projects using defect prediction quality. In Proceedings of the 2009 6th IEEE International Working Conference on Mining Software Repositories, MSR '09, pages 51--60, Washington, DC, USA, 2009. IEEE Computer Society. Google ScholarDigital Library
K. El Emam, S. Benlarbi, N. Goel, and S. Rai. The confounding effect of class size on the validity of object-oriented metrics. IEEE TSE, 27(7):630--650, 2001. Google ScholarDigital Library
M. Fischer, M. Pinzger, and H. Gall. Populating a release history database from version control and bug tracking systems. In ICSM '03, page 23, Washington, DC, USA, 2003. IEEE Computer Society. Google ScholarDigital Library
Z. He, F. Shu, Y. Yang, M. Li, and Q. Wang. An investigation on the feasibility of cross-project defect prediction. Autom. Softw. Eng., 19(2):167--199, 2012. Google ScholarDigital Library
T. Khoshgoftaar and J. Munson. Predicting software development errors using software complexity metrics. Selected Areas in Communications, IEEE Journal on, 8(2):253--261, 1990. Google ScholarDigital Library
S. Kim, H. Zhang, R. Wu, and L. Gong. Dealing with noise in defect prediction. In ICSE'2011, pages 481--490. IEEE, 2011. Google ScholarDigital Library
S. Kim, T. Zimmermann, E. Whitehead Jr, and A. Zeller. Predicting faults from cached history. In Proceedings of the 29th ICSE, pages 489--498. IEEE Computer Society, 2007. Google ScholarDigital Library
A. G. Koru and H. Liu. Identifying and characterizing change-prone classes in two large-scale open-source products. Journal of Systems and Software, 80(1):63--73, 2007. Google ScholarDigital Library
S. Lessmann, B. Baesens, C. Mues, and S. Pietsch. Benchmarking classification models for software defect prediction: A proposed framework and novel findings. IEEE TSE, 34(4):485--496, July 2008. Google ScholarDigital Library
Y. Ma and B. Cukic. Adequate and precise evaluation of quality models in software engineering studies. In Proceedings of the 29th ICSE Workshops, ICSEW '07, pages 68--, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarDigital Library
Y. Ma, G. Luo, X. Zeng, and A. Chen. Transfer learning for cross-company software defect prediction. Information and Software Technology, 54(3):248--256, 2012. Google ScholarDigital Library
T. Mende. Replication of defect prediction studies: problems, pitfalls and recommendations. In T. Menzies and G. Koru, editors, PROMISE, page 5. ACM, 2010. Google ScholarDigital Library
T. Menzies, A. Butcher, A. Marcus, T. Zimmermann, and D. Cok. Local vs. global models for effort estimation and defect prediction. In Automated Software Engineering (ASE), 2011 26th IEEE/ACM International Conference on, pages 343--351. IEEE, 2011. Google ScholarDigital Library
T. Menzies, J. Greenwald, and A. Frank. Data mining static code attributes to learn defect predictors. IEEE TSE, 33(1):2--13, 2007. Google ScholarDigital Library
R. Moser, W. Pedrycz, and G. Succi. A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction. In W. Schäfer, M. B. Dwyer, and V. Gruhn, editors, ICSE, pages 181--190. ACM, 2008. Google ScholarDigital Library
T. Ostrand and E. Weyuker. The distribution of faults in a large industrial software system. In International Symposium on Software Testing and Analysis: Proceedings of the 2002 ACM SIGSOFT international symposium on Software testing and analysis: Roma, Italy. Association for Computing Machinery, Inc., One Astor Plaza, 1515 Broadway, New York, NY, 10036--5701, USA, 2002. Google ScholarDigital Library
D. Posnett, V. Filkov, and P. Devanbu. Ecological inference in empirical software engineering. In ASE'2011, pages 362--371. IEEE, 2011. Google ScholarDigital Library
F. Rahman, D. Posnett, A. Hindle, E. Barr, and P. Devanbu. Bugcache for inspections: hit or miss? In Proceedings of the 19th ACM SIGSOFT FSE, pages 322--331. ACM, 2011. Google ScholarDigital Library
E. Shihab, Z. M. Jiang, W. M. Ibrahim, B. Adams, and A. E. Hassan. Understanding the impact of code and process metrics on post-release defects: a case study on the eclipse project. In G. Succi, M. Morisio, and N. Nagappan, editors, ESEM. ACM, 2010. Google ScholarDigital Library
B. Turhan, T. Menzies, A. B. Bener, and J. Di Stefano. On the relative value of cross-company and within-company data for defect prediction. Empirical Softw. Engg., 14(5):540--578, Oct. 2009. Google ScholarDigital Library
B. Turhan, A. T. Misirli, and A. B. Bener. Empirical evaluation of mixed-project defect prediction models. In EUROMICRO-SEAA, pages 396--403. IEEE, 2011. Google ScholarDigital Library
H. Wang, T. M. Khoshgoftaar, and N. Seliya. How many software metrics should be selected for defect prediction? In R. C. Murray and P. M. McCarthy, editors, FLAIRS Conference. AAAI Press, 2011.Google Scholar
E. J. Weyuker, T. J. Ostrand, and R. M. Bell. Do too many cooks spoil the broth? using the number of developers to enhance defect prediction models. ESE, 13(5):539--559, 2008. Google ScholarDigital Library
H. Zhang. On the distribution of software faults. IEEE TSE, 34(2):301--302, March-April 2008. Google ScholarDigital Library
T. Zimmermann, N. Nagappan, H. Gall, E. Giger, and B. Murphy. Cross-project defect prediction: a large scale experiment on data vs. domain vs. process. In H. van Vliet and V. Issarny, editors, ESEC/SIGSOFT FSE, pages 91--100. ACM, 2009. Google ScholarDigital Library
T. Zimmermann, R. Premraj, and A. Zeller. Predicting defects for eclipse. In Proceedings of the Third International Workshop on Predictor Models in Software Engineering, PROMISE '07, pages 9--, Washington, DC, USA, 2007. IEEE Computer Society. Google ScholarDigital Library

Index Terms

Recalling the "imprecision" of cross-project defect prediction
1. Software and its engineering
  1. Software creation and management
    1. Software verification and validation
      1. Process validation
        Walkthroughs

Recommendations

Cross-project defect prediction: a large scale experiment on data vs. domain vs. process
ESEC/FSE '09: Proceedings of the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering

Prediction of software defects works well within projects as long as there is a sufficient amount of data available to train any models. However, this is rarely the case for new software projects and for many companies. So far, only a few have studies ...
Read More
Training data selection for cross-project defect prediction
PROMISE '13: Proceedings of the 9th International Conference on Predictive Models in Software Engineering

Software defect prediction has been a popular research topic in recent years and is considered as a means for the optimization of quality assurance activities. Defect prediction can be done in a within-project or a cross-project scenario. The within-...
Read More
Cross-project smell-based defect prediction
Abstract
Defect prediction is a technique introduced to optimize the testing phase of the software development pipeline by predicting which components in the software may contain defects. Its methodology trains a classifier with data regarding a set of ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
FSE '12: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
November 2012
494 pages
ISBN:9781450316149
DOI:10.1145/2393596
General Chair:
Will Tracz
Lockheed Martin Fellow Emeritus
,
Program Chairs:
Martin Robillard
McGill University
,
Tevfik Bultan
University of California, Santa Barbara
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 11 November 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
empirical software engineering
fault prediction
inspection
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate17of128submissions,13%
Upcoming Conference
FSE '24

Sponsor:

sigsoft

32nd ACM International Conference on the Foundations of Software Engineering

July 15 - 19, 2024

Ipojuca (Pernambuco) , Brazil
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 181
  Total Citations
  View Citations
- 875
  Total Downloads
- Downloads (Last 12 months)23
- Downloads (Last 6 weeks)3
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Recalling the "imprecision" of cross-project defect prediction

FSE '12: Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering

ABSTRACT

References

Cited By

Index Terms

Recommendations

Cross-project defect prediction: a large scale experiment on data vs. domain vs. process

Training data selection for cross-project defect prediction

Cross-project smell-based defect prediction