Abstract
With the variety of different filter methods, selecting the most appropriate filter method which gives the best performance is a difficult task. Filter rank selection and stability problems make the selection of filter methods in SDP a hard choice. The best approach is to independently apply a mixture of filter methods and evaluate the results. This study presents a novel rank aggregation-based multi-filter feature selection method to address high dimensionality and filter rank selection problems in software defect prediction. The proposed method combines the rank list generated by individual filter methods from the software defect dataset using a rank aggregation mechanism into a single aggregated rank list. The proposed method aims to resolve the filter selection problem by using multiple filter methods of diverse computational characteristics to produce a more stable (non-disjoint) and complete feature rank list better than individual filter methods employed. The effectiveness of the proposed method was evaluated by applying with Decision Tree (DT) and Naïve Bayes (NB) models on defect datasets from NASA repository. From the experimental results, the proposed method had a superior effect (positive) on the prediction performance of selected prediction models than other experimented methods. This makes the combining of individual filter rank methods a viable solution to the filter rank selection problem and enhancement of prediction models in SDP.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Basri, S., Almomani, M.A., Imam, A.A., Thangiah, M., Gilal, A.R., Balogun, A.O.: The organisational factors of software process improvement in small software industry: comparative study. In: Saeed, F., Mohammed, F., Gazem, N. (eds.) IRICT 2019. AISC, vol. 1073, pp. 1132–1143. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-33582-3_106
Mojeed, H.A., Bajeh, A.O., Balogun, A.O., Adeleke, H.O.: Memetic Approach for Multi-Objective Overtime Planning in Software Engineering projects. J. Eng. Sci. Technol. 14, 3213–3233 (2019)
Bowes, D., Hall, T., Petrić, J.: Software defect prediction: do different classifiers find the same defects? Softw. Qual. J. 26(2), 525–552 (2017). https://doi.org/10.1007/s11219-016-9353-3
Chen, X., Shen, Y., Cui, Z., Ju, X.: Applying feature selection to software defect prediction using multi-objective optimization. In: 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC), vol. 2, pp. 54–59. IEEE (2017)
Gao, K., Khoshgoftaar, T.M., Wang, H., Seliya, N.: Choosing software metrics for defect prediction: an investigation on feature selection techniques. Softw. Prac. Exp. 41, 579–606 (2011)
Iqbal, A., Aftab, S., Matloob, F.: Performance analysis of resampling techniques on class imbalance issue in software defect prediction. Int. J. Inf. Technol. Comput. Sci 11, 44–54 (2019)
Balogun, A.O., Bajeh, A.O., Orie, V.A., Yusuf-Asaju, W.A.: Software defect prediction using ensemble learning: an ANP based evaluation method. FUOYEJET 3, 50–55 (2018)
Balogun, A.O., Basri, S., Abdulkadir, S.J., Adeyemo, V.E., Imam, A.A., Bajeh, A.O.: Software defect prediction: analysis of class imbalance and performance stability. J. Eng. Sci. Technol. 14, 3294–3308 (2019)
Mabayoje, M.A., Balogun, A.O., Jibril, H.A., Atoyebi, J.O., Mojeed, H.A., Adeyemo, V.E.: Parameter tuning in KNN for software defect prediction: an empirical analysis. Jurnal Teknologi dan Sistem Komputer 7, 297–303 (2019). https://doi.org/10.14710/jtsiskom.2020.13669
Jimoh, R., Balogun, A., Bajeh, A., Ajayi, S.: A PROMETHEE based evaluation of software defect predictors. JCSA 25, 106–119 (2018)
Kondo, M., Bezemer, C.-P., Kamei, Y., Hassan, A.E., Mizuno, O.: The impact of feature reduction techniques on defect prediction models. Empirical Softw. Eng. 24(4), 1925–1963 (2019). https://doi.org/10.1007/s10664-018-9679-5
Lessmann, S., Baesens, B., Mues, C., Pietsch, S.: Benchmarking classification models for software defect prediction: a proposed framework and novel findings. IEEE Trans. Softw. Eng. 34, 485–496 (2008)
Li, L., Lessmann, S., Baesens, B.: Evaluating software defect prediction performance: an updated benchmarking study. arXiv preprint arXiv:1901.01726 (2019)
Mabayoje, M.A., Balogun, A.O., Bello, S.M., Atoyebi, J.O., Mojeed, H.A., Ekundayo, A.H.: Wrapper feature selection based heterogeneous classifiers for software defect prediction. AUJET 2, 1–1 (2019)
Ameen, A.O., Balogun, A.O., Usman, G., Fashoto, G.S.: Heterogeneous ensemble methods based on filter feature selection. Comput. Inf. Syst. Dev. Inform. J. 7, 63–78 (2016)
Muthukumaran, K., Rallapalli, A., Murthy, N.B.: Impact of feature selection techniques on bug prediction models. In: Proceedings of the 8th India Software Engineering Conference, pp. 120–129 (2015)
Rathore, S.S., Gupta, A.: A comparative study of feature-ranking and feature-subset selection techniques for improved fault prediction. In: Proceedings of the 7th India Software Engineering Conference, p. 7. ACM (2014)
Rodríguez, D., Ruiz, R., Cuadrado-Gallego, J., Aguilar-Ruiz, J.: Detecting fault modules applying feature selection to classifiers. In: 2007 IEEE International Conference on Information Reuse and Integration, pp. 667–672. IEEE (2007)
Wahono, R.S., Suryana, N., Ahmad, S.: Metaheuristic optimization based feature selection for software defect prediction. J. Softw. 9, 1324–1333 (2014)
Balogun, A.O., Basri, S., Abdulkadir, S.J., Hashim, A.S.: Performance analysis of feature selection methods in software defect prediction: a search method approach. Appl. Sci. 9, 2764 (2019)
Balogun, A.O., Basri, S., Mahamad, S., Abdulkadir, S.J., Almomani, M.A., Adeyemo, V.E., Al-Tashi, Q., Mojeed, H.A., Imam, A.A., Bajeh, A.O.: Impact of feature selection methods on the predictive performance of software defect prediction models: an extensive empirical study. Symmetry 12, 1147 (2020)
Ghotra, B., McIntosh, S., Hassan, A.E.: A large-scale study of the impact of feature selection techniques on defect classification models. In: 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 146–157. IEEE (2017)
Xu, Z., Liu, J., Yang, Z., An, G., Jia, X.: The impact of feature selection on defect prediction performance: an empirical comparison. In: 2016 IEEE 27th International Symposium on Software Reliability Engineering (ISSRE), pp. 309–320. IEEE (2016)
Akintola, A.G., Balogun, A.O., Lafenwa-Balogun, F., Mojeed, H.A.: Comparative analysis of selected heterogeneous classifiers for software defects prediction using filter-based feature selection methods. FUOYEJET 3, 134–137 (2018)
Yu, Q., Jiang, S., Zhang, Y.: The performance stability of defect prediction models with class imbalance: an empirical study. IEICE Trans. Inf. Sys. 100, 265–272 (2017)
Shepperd, M., Song, Q., Sun, Z., Mair, C.: Data quality: Some comments on the nasa software defect datasets. IEEE Trans. Softw. Eng. 39, 1208–1215 (2013)
James, G., Witten, D., Hastie, T., Tibshirani, R.: An Introduction to Statistical Learning. Springer (2013). https://doi.org/10.1007/978-1-4614-7138-7
Kuhn, M., Johnson, K.: Applied Predictive Modeling. Springer, Berlin/Hedielberg (2013). https://doi.org/10.1007/978-1-4614-6849-3
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The WEKA data mining software: an update. ACM Sig. Exp. 11, 10–18 (2009)
Ghotra, B., McIntosh, S., Hassan, A.E.: A large-scale study of the impact of feature selection techniques on defect classification models. In: Proceedings of 2017 IEEE/ACM 14th International Conference on Mining Software Repositories (MSR), pp. 146–157. IEEE (2017)
Acknowledgement
This research/paper was fully supported by Universiti Teknologi PETRONAS, under the Yayasan Universiti Teknologi PETRONAS (YUTP) Research Grant Scheme (YUTP-FRG/015LC0240) and (YUTP-FRG/015LC0297).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Balogun, A.O. et al. (2021). Rank Aggregation Based Multi-filter Feature Selection Method for Software Defect Prediction. In: Anbar, M., Abdullah, N., Manickam, S. (eds) Advances in Cyber Security. ACeS 2020. Communications in Computer and Information Science, vol 1347. Springer, Singapore. https://doi.org/10.1007/978-981-33-6835-4_25
Download citation
DOI: https://doi.org/10.1007/978-981-33-6835-4_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-6834-7
Online ISBN: 978-981-33-6835-4
eBook Packages: Computer ScienceComputer Science (R0)