Skip to main content

Rank Aggregation Based Multi-filter Feature Selection Method for Software Defect Prediction

  • Conference paper
  • First Online:
Advances in Cyber Security (ACeS 2020)

Abstract

With the variety of different filter methods, selecting the most appropriate filter method which gives the best performance is a difficult task. Filter rank selection and stability problems make the selection of filter methods in SDP a hard choice. The best approach is to independently apply a mixture of filter methods and evaluate the results. This study presents a novel rank aggregation-based multi-filter feature selection method to address high dimensionality and filter rank selection problems in software defect prediction. The proposed method combines the rank list generated by individual filter methods from the software defect dataset using a rank aggregation mechanism into a single aggregated rank list. The proposed method aims to resolve the filter selection problem by using multiple filter methods of diverse computational characteristics to produce a more stable (non-disjoint) and complete feature rank list better than individual filter methods employed. The effectiveness of the proposed method was evaluated by applying with Decision Tree (DT) and Naïve Bayes (NB) models on defect datasets from NASA repository. From the experimental results, the proposed method had a superior effect (positive) on the prediction performance of selected prediction models than other experimented methods. This makes the combining of individual filter rank methods a viable solution to the filter rank selection problem and enhancement of prediction models in SDP.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Basri, S., Almomani, M.A., Imam, A.A., Thangiah, M., Gilal, A.R., Balogun, A.O.: The organisational factors of software process improvement in small software industry: comparative study. In: Saeed, F., Mohammed, F., Gazem, N. (eds.) IRICT 2019. AISC, vol. 1073, pp. 1132–1143. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-33582-3_106

    Chapter  Google Scholar 

  2. Mojeed, H.A., Bajeh, A.O., Balogun, A.O., Adeleke, H.O.: Memetic Approach for Multi-Objective Overtime Planning in Software Engineering projects. J. Eng. Sci. Technol. 14, 3213–3233 (2019)

    Google Scholar 

  3. Bowes, D., Hall, T., Petrić, J.: Software defect prediction: do different classifiers find the same defects? Softw. Qual. J. 26(2), 525–552 (2017). https://doi.org/10.1007/s11219-016-9353-3

    Article  Google Scholar 

  4. Chen, X., Shen, Y., Cui, Z., Ju, X.: Applying feature selection to software defect prediction using multi-objective optimization. In: 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC), vol. 2, pp. 54–59. IEEE (2017)

    Google Scholar 

  5. Gao, K., Khoshgoftaar, T.M., Wang, H., Seliya, N.: Choosing software metrics for defect prediction: an investigation on feature selection techniques. Softw. Prac. Exp. 41, 579–606 (2011)

    Article  Google Scholar 

  6. Iqbal, A., Aftab, S., Matloob, F.: Performance analysis of resampling techniques on class imbalance issue in software defect prediction. Int. J. Inf. Technol. Comput. Sci 11, 44–54 (2019)

    Google Scholar 

  7. Balogun, A.O., Bajeh, A.O., Orie, V.A., Yusuf-Asaju, W.A.: Software defect prediction using ensemble learning: an ANP based evaluation method. FUOYEJET 3, 50–55 (2018)

    Article  Google Scholar 

  8. Balogun, A.O., Basri, S., Abdulkadir, S.J., Adeyemo, V.E., Imam, A.A., Bajeh, A.O.: Software defect prediction: analysis of class imbalance and performance stability. J. Eng. Sci. Technol. 14, 3294–3308 (2019)

    Google Scholar 

  9. Mabayoje, M.A., Balogun, A.O., Jibril, H.A., Atoyebi, J.O., Mojeed, H.A., Adeyemo, V.E.: Parameter tuning in KNN for software defect prediction: an empirical analysis. Jurnal Teknologi dan Sistem Komputer 7, 297–303 (2019). https://doi.org/10.14710/jtsiskom.2020.13669

    Article  Google Scholar 

  10. Jimoh, R., Balogun, A., Bajeh, A., Ajayi, S.: A PROMETHEE based evaluation of software defect predictors. JCSA 25, 106–119 (2018)

    Google Scholar 

  11. Kondo, M., Bezemer, C.-P., Kamei, Y., Hassan, A.E., Mizuno, O.: The impact of feature reduction techniques on defect prediction models. Empirical Softw. Eng. 24(4), 1925–1963 (2019). https://doi.org/10.1007/s10664-018-9679-5

    Article  Google Scholar 

  12. Lessmann, S., Baesens, B., Mues, C., Pietsch, S.: Benchmarking classification models for software defect prediction: a proposed framework and novel findings. IEEE Trans. Softw. Eng. 34, 485–496 (2008)

    Article  Google Scholar 

  13. Li, L., Lessmann, S., Baesens, B.: Evaluating software defect prediction performance: an updated benchmarking study. arXiv preprint arXiv:1901.01726 (2019)

  14. Mabayoje, M.A., Balogun, A.O., Bello, S.M., Atoyebi, J.O., Mojeed, H.A., Ekundayo, A.H.: Wrapper feature selection based heterogeneous classifiers for software defect prediction. AUJET 2, 1–1 (2019)

    Google Scholar 

  15. Ameen, A.O., Balogun, A.O., Usman, G., Fashoto, G.S.: Heterogeneous ensemble methods based on filter feature selection. Comput. Inf. Syst. Dev. Inform. J. 7, 63–78 (2016)

    Google Scholar 

  16. Muthukumaran, K., Rallapalli, A., Murthy, N.B.: Impact of feature selection techniques on bug prediction models. In: Proceedings of the 8th India Software Engineering Conference, pp. 120–129 (2015)

    Google Scholar 

  17. Rathore, S.S., Gupta, A.: A comparative study of feature-ranking and feature-subset selection techniques for improved fault prediction. In: Proceedings of the 7th India Software Engineering Conference, p. 7. ACM (2014)

    Google Scholar 

  18. Rodríguez, D., Ruiz, R., Cuadrado-Gallego, J., Aguilar-Ruiz, J.: Detecting fault modules applying feature selection to classifiers. In: 2007 IEEE International Conference on Information Reuse and Integration, pp. 667–672. IEEE (2007)

    Google Scholar 

  19. Wahono, R.S., Suryana, N., Ahmad, S.: Metaheuristic optimization based feature selection for software defect prediction. J. Softw. 9, 1324–1333 (2014)

    Article  Google Scholar 

  20. Balogun, A.O., Basri, S., Abdulkadir, S.J., Hashim, A.S.: Performance analysis of feature selection methods in software defect prediction: a search method approach. Appl. Sci. 9, 2764 (2019)

    Article  Google Scholar 

  21. Balogun, A.O., Basri, S., Mahamad, S., Abdulkadir, S.J., Almomani, M.A., Adeyemo, V.E., Al-Tashi, Q., Mojeed, H.A., Imam, A.A., Bajeh, A.O.: Impact of feature selection methods on the predictive performance of software defect prediction models: an extensive empirical study. Symmetry 12, 1147 (2020)

    Article  Google Scholar 

  22. Ghotra, B., McIntosh, S., Hassan, A.E.: A large-scale study of the impact of feature selection techniques on defect classification models. In: 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 146–157. IEEE (2017)

    Google Scholar 

  23. Xu, Z., Liu, J., Yang, Z., An, G., Jia, X.: The impact of feature selection on defect prediction performance: an empirical comparison. In: 2016 IEEE 27th International Symposium on Software Reliability Engineering (ISSRE), pp. 309–320. IEEE (2016)

    Google Scholar 

  24. Akintola, A.G., Balogun, A.O., Lafenwa-Balogun, F., Mojeed, H.A.: Comparative analysis of selected heterogeneous classifiers for software defects prediction using filter-based feature selection methods. FUOYEJET 3, 134–137 (2018)

    Article  Google Scholar 

  25. Yu, Q., Jiang, S., Zhang, Y.: The performance stability of defect prediction models with class imbalance: an empirical study. IEICE Trans. Inf. Sys. 100, 265–272 (2017)

    Article  Google Scholar 

  26. Shepperd, M., Song, Q., Sun, Z., Mair, C.: Data quality: Some comments on the nasa software defect datasets. IEEE Trans. Softw. Eng. 39, 1208–1215 (2013)

    Article  Google Scholar 

  27. James, G., Witten, D., Hastie, T., Tibshirani, R.: An Introduction to Statistical Learning. Springer (2013). https://doi.org/10.1007/978-1-4614-7138-7

  28. Kuhn, M., Johnson, K.: Applied Predictive Modeling. Springer, Berlin/Hedielberg (2013). https://doi.org/10.1007/978-1-4614-6849-3

  29. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM Sig. Exp. 11, 10–18 (2009)

    Article  Google Scholar 

  30. Ghotra, B., McIntosh, S., Hassan, A.E.: A large-scale study of the impact of feature selection techniques on defect classification models. In: Proceedings of 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 146–157. IEEE (2017)

    Google Scholar 

Download references

Acknowledgement

This research/paper was fully supported by Universiti Teknologi PETRONAS, under the Yayasan Universiti Teknologi PETRONAS (YUTP) Research Grant Scheme (YUTP-FRG/015LC0240) and (YUTP-FRG/015LC0297).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdullateef O. Balogun .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Balogun, A.O. et al. (2021). Rank Aggregation Based Multi-filter Feature Selection Method for Software Defect Prediction. In: Anbar, M., Abdullah, N., Manickam, S. (eds) Advances in Cyber Security. ACeS 2020. Communications in Computer and Information Science, vol 1347. Springer, Singapore. https://doi.org/10.1007/978-981-33-6835-4_25

Download citation

  • DOI: https://doi.org/10.1007/978-981-33-6835-4_25

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-33-6834-7

  • Online ISBN: 978-981-33-6835-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics