A Deep Learning-Based Framework for the Classification of Non-functional Requirements

Sabir, Maliha; Banissi, Ebad; Child, Mike

doi:10.1007/978-3-030-72651-5_56

Maliha Sabir ORCID: orcid.org/0000-0001-5657-9929¹⁹,
Ebad Banissi¹⁹ &
Mike Child¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1366))

Included in the following conference series:

World Conference on Information Systems and Technologies

1755 Accesses
1 Citations

Abstract

State-of-the-art solutions to the classification of Non-functional requirements are mostly based on supervised machine learning models, requiring a reasonable amount of time in feature engineering. Deep learning, on the other hand, does not need to define features explicitly. This research aims to design and develop an automatic system to classify Non-functional requirements in multiple classes based on deep learning techniques. Specifically, we investigate the design and application of four neural network models; Artificial Neural Network, Convolutional Neural Network, Long Short-term Memory, and Gated Recurrent Unit to classify Non-functional requirements into five classes: reliability, usability efficiency, maintainability, and portability. However, these models require a large, annotated corpus and prone to overfitting. To address this, we proposed a novel framework for text augmentation. This technique uses a sort and concatenates approach to merge two sentences belonging to the same class to generate a two-time increase in data size yet preserving the domain vocabulary. We have compared our results with the state-of-the-art Easy data augmentation approach.

Our findings indicate that the NFRs classification model improved when trained with fine-tuned word embedding and CUSTOM augmentation approach. Interestingly Convolutional Neural Network turned out to be an outstanding learner with a jump in accuracy from 60% to 96% compared to the first approach. This simple text augmentation approach can add value for tasks where domain-specific terminologies play an essential role.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
https://vectors.nlpl.eu/repository/.

References

Sabir, M., Chrysoulas, C., Banissi, E.: Multi-label classifier to deal with misclassification in non-functional requirements. In: Rocha, Á., Adeli, H., Reis, L.P., Costanzo, S., Orovic, I., Moreira, F. (eds.) Trends and Innovations in Information Systems and Technologies, pp. 486–493. Springer International Publishing, Cham (2020)
Chapter Google Scholar
Griffiths, T.L., Steyvers, M., Tenenbaum, J.B.: Topics in semantic representation. Psychol. Rev. 114, 211–244 (2007). https://doi.org/10.1037/0033-295X.114.2.211
Article Google Scholar
Lu, X., Zheng, B., Velivelli, A., Zhai, C.: Enhancing text categorization with semantic-enriched representation and training data augmentation. J. Am. Med. Inform. Assoc. 13, 526–535 (2006). https://doi.org/10.1197/jamia.M2051
Article Google Scholar
Baker, C., Deng, L., Chakraborty, S., Dehlinger, J.: Automatic multi-class non-functional software requirements classification using neural networks. In: 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC), pp. 610–615. IEEE, Milwaukee, WI, USA (2019)
Google Scholar
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv:1301.3781 [cs] (2013)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM. 60, 84–90 (2017). https://doi.org/10.1145/3065386
Article Google Scholar
Cagli, E., Dumas, C., Prouff, E.: Convolutional neural networks with data augmentation against jitter-based countermeasures. In: Fischer, W., Homma, N. (eds.) Cryptographic Hardware and Embedded Systems – CHES 2017, pp. 45–68. Springer International Publishing, Cham (2017)
Chapter Google Scholar
Wong, S.C., Gatt, A., Stamatescu, V., McDonnell, M.D.: Understanding data augmentation for classification: when to warp? arXiv:1609.08764 [cs] (2016)
Amodei, D., Olah, C., Steinhardt, J., Christiano, P., Schulman, J., Mané, D.: Concrete Problems in AI Safety. arXiv:1606.06565 [cs] (2016)
Graves, A.: Generating sequences with recurrent neural networks. arXiv:1308.0850 [cs] (2014)
Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. J Big Data. 6, 60 (2019). https://doi.org/10.1186/s40537-019-0197-0
Article Google Scholar
Perera, P., Patel, V.M.: Deep transfer learning for multiple class novelty detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11536–11544. IEEE, Long Beach, CA, USA (2019)
Google Scholar
Zeman, D.: CoNLL 2017 shared task: multilingual parsing from raw text to universal dependencies. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, p. 19. Association for Computational Linguistics, Vancouver, Canada (2017)
Google Scholar
Quijas, J.: Analysing the Effects of Data Augmentation and Free Parameters for Text Classification with Recurrent Convolutional Neural Networks 54 (2017)
Google Scholar
Coulombe, C.: Text Data Augmentation Made Simple By Leveraging NLP Cloud APIs 33 (2018)
Google Scholar
Wei, J., Zou, K.: EDA: Easy data augmentation techniques for boosting performance on text classification tasks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 6381–6387. Association for Computational Linguistics, Hong Kong, China (2019)
Google Scholar
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. arXiv:1509.01626 [cs] (2016)
Kobayashi, S.: Contextual augmentation: data augmentation by words with paradigmatic relations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pp. 452–457. Association for Computational Linguistics, New Orleans, Louisiana (2018)
Google Scholar
Porter, N.D., Verdery, A.M., Gaddis, S.M.: Enhancing big data in the social sciences with crowdsourcing: data augmentation practices, techniques, and opportunities. PLoS ONE 15, e0233154 (2020). https://doi.org/10.1371/journal.pone.0233154
Article Google Scholar
Rosario, R.R.: A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Statistics 210 (2017)
Google Scholar
Abulaish, M., Sah, A.K.: A Text data augmentation approach for improving the performance of CNN. In: 2019 11th International Conference on Communication Systems & Networks (COMSNETS), pp. 625–630. IEEE, Bengaluru, India (2019)
Google Scholar
Inoue, H.: Data augmentation by pairing samples for images classification. arXiv:1801.02929 [cs, stat] (2018)
Liang, D., Yang, F., Zhang, T., Yang, P.: Understanding mixup training methods. IEEE Access. 6, 58774–58783 (2018). https://doi.org/10.1109/ACCESS.2018.2872698
Summers, C., Dinneen, M.J.: Improved mixed-example data augmentation. arXiv:1805.11272 [cs] (2019)
Takahashi, R., Matsubara, T., Uehara, K.: Data augmentation using random image cropping and patching for deep CNNs. IEEE Trans. Circuits Syst. Video Technol. 30, 2917–2931 (2020). https://doi.org/10.1109/TCSVT.2019.2935128
Article Google Scholar

Download references

Author information

Authors and Affiliations

London South Bank University, 103 Borough Road, London, 1 0AA, UK
Maliha Sabir, Ebad Banissi & Mike Child

Authors

Maliha Sabir
View author publications
You can also search for this author in PubMed Google Scholar
Ebad Banissi
View author publications
You can also search for this author in PubMed Google Scholar
Mike Child
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Maliha Sabir , Ebad Banissi or Mike Child .

Editor information

Editors and Affiliations

ISEG, University of Lisbon, Lisbon, Portugal
Álvaro Rocha
College of Engineering, The Ohio State University, Columbus, OH, USA
Hojjat Adeli
Institute of Data Science and Digital Technologies, Vilnius University, Vilnius, Lithuania
Gintautas Dzemyda
DCT, Universidade Portucalense, Porto, Portugal
Fernando Moreira
Department of Information Sciences, University of Sheffield, Lisbon, Portugal
Ana Maria Ramalho Correia

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sabir, M., Banissi, E., Child, M. (2021). A Deep Learning-Based Framework for the Classification of Non-functional Requirements. In: Rocha, Á., Adeli, H., Dzemyda, G., Moreira, F., Ramalho Correia, A.M. (eds) Trends and Applications in Information Systems and Technologies . WorldCIST 2021. Advances in Intelligent Systems and Computing, vol 1366. Springer, Cham. https://doi.org/10.1007/978-3-030-72651-5_56

Download citation

DOI: https://doi.org/10.1007/978-3-030-72651-5_56
Published: 29 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72650-8
Online ISBN: 978-3-030-72651-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics