Abstract
State-of-the-art solutions to the classification of Non-functional requirements are mostly based on supervised machine learning models, requiring a reasonable amount of time in feature engineering. Deep learning, on the other hand, does not need to define features explicitly. This research aims to design and develop an automatic system to classify Non-functional requirements in multiple classes based on deep learning techniques. Specifically, we investigate the design and application of four neural network models; Artificial Neural Network, Convolutional Neural Network, Long Short-term Memory, and Gated Recurrent Unit to classify Non-functional requirements into five classes: reliability, usability efficiency, maintainability, and portability. However, these models require a large, annotated corpus and prone to overfitting. To address this, we proposed a novel framework for text augmentation. This technique uses a sort and concatenates approach to merge two sentences belonging to the same class to generate a two-time increase in data size yet preserving the domain vocabulary. We have compared our results with the state-of-the-art Easy data augmentation approach.
Our findings indicate that the NFRs classification model improved when trained with fine-tuned word embedding and CUSTOM augmentation approach. Interestingly Convolutional Neural Network turned out to be an outstanding learner with a jump in accuracy from 60% to 96% compared to the first approach. This simple text augmentation approach can add value for tasks where domain-specific terminologies play an essential role.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sabir, M., Chrysoulas, C., Banissi, E.: Multi-label classifier to deal with misclassification in non-functional requirements. In: Rocha, Á., Adeli, H., Reis, L.P., Costanzo, S., Orovic, I., Moreira, F. (eds.) Trends and Innovations in Information Systems and Technologies, pp. 486–493. Springer International Publishing, Cham (2020)
Griffiths, T.L., Steyvers, M., Tenenbaum, J.B.: Topics in semantic representation. Psychol. Rev. 114, 211–244 (2007). https://doi.org/10.1037/0033-295X.114.2.211
Lu, X., Zheng, B., Velivelli, A., Zhai, C.: Enhancing text categorization with semantic-enriched representation and training data augmentation. J. Am. Med. Inform. Assoc. 13, 526–535 (2006). https://doi.org/10.1197/jamia.M2051
Baker, C., Deng, L., Chakraborty, S., Dehlinger, J.: Automatic multi-class non-functional software requirements classification using neural networks. In: 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC), pp. 610–615. IEEE, Milwaukee, WI, USA (2019)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv:1301.3781 [cs] (2013)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM. 60, 84–90 (2017). https://doi.org/10.1145/3065386
Cagli, E., Dumas, C., Prouff, E.: Convolutional neural networks with data augmentation against jitter-based countermeasures. In: Fischer, W., Homma, N. (eds.) Cryptographic Hardware and Embedded Systems – CHES 2017, pp. 45–68. Springer International Publishing, Cham (2017)
Wong, S.C., Gatt, A., Stamatescu, V., McDonnell, M.D.: Understanding data augmentation for classification: when to warp? arXiv:1609.08764 [cs] (2016)
Amodei, D., Olah, C., Steinhardt, J., Christiano, P., Schulman, J., Mané, D.: Concrete Problems in AI Safety. arXiv:1606.06565 [cs] (2016)
Graves, A.: Generating sequences with recurrent neural networks. arXiv:1308.0850 [cs] (2014)
Shorten, C., Khoshgoftaar, T.M.: A survey on image data augmentation for deep learning. J Big Data. 6, 60 (2019). https://doi.org/10.1186/s40537-019-0197-0
Perera, P., Patel, V.M.: Deep transfer learning for multiple class novelty detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11536–11544. IEEE, Long Beach, CA, USA (2019)
Zeman, D.: CoNLL 2017 shared task: multilingual parsing from raw text to universal dependencies. In: Proceedings of the CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, p. 19. Association for Computational Linguistics, Vancouver, Canada (2017)
Quijas, J.: Analysing the Effects of Data Augmentation and Free Parameters for Text Classification with Recurrent Convolutional Neural Networks 54 (2017)
Coulombe, C.: Text Data Augmentation Made Simple By Leveraging NLP Cloud APIs 33 (2018)
Wei, J., Zou, K.: EDA: Easy data augmentation techniques for boosting performance on text classification tasks. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 6381–6387. Association for Computational Linguistics, Hong Kong, China (2019)
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. arXiv:1509.01626 [cs] (2016)
Kobayashi, S.: Contextual augmentation: data augmentation by words with paradigmatic relations. In: Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Short Papers), pp. 452–457. Association for Computational Linguistics, New Orleans, Louisiana (2018)
Porter, N.D., Verdery, A.M., Gaddis, S.M.: Enhancing big data in the social sciences with crowdsourcing: data augmentation practices, techniques, and opportunities. PLoS ONE 15, e0233154 (2020). https://doi.org/10.1371/journal.pone.0233154
Rosario, R.R.: A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Statistics 210 (2017)
Abulaish, M., Sah, A.K.: A Text data augmentation approach for improving the performance of CNN. In: 2019 11th International Conference on Communication Systems & Networks (COMSNETS), pp. 625–630. IEEE, Bengaluru, India (2019)
Inoue, H.: Data augmentation by pairing samples for images classification. arXiv:1801.02929 [cs, stat] (2018)
Liang, D., Yang, F., Zhang, T., Yang, P.: Understanding mixup training methods. IEEE Access. 6, 58774–58783 (2018). https://doi.org/10.1109/ACCESS.2018.2872698
Summers, C., Dinneen, M.J.: Improved mixed-example data augmentation. arXiv:1805.11272 [cs] (2019)
Takahashi, R., Matsubara, T., Uehara, K.: Data augmentation using random image cropping and patching for deep CNNs. IEEE Trans. Circuits Syst. Video Technol. 30, 2917–2931 (2020). https://doi.org/10.1109/TCSVT.2019.2935128
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Sabir, M., Banissi, E., Child, M. (2021). A Deep Learning-Based Framework for the Classification of Non-functional Requirements. In: Rocha, Á., Adeli, H., Dzemyda, G., Moreira, F., Ramalho Correia, A.M. (eds) Trends and Applications in Information Systems and Technologies . WorldCIST 2021. Advances in Intelligent Systems and Computing, vol 1366. Springer, Cham. https://doi.org/10.1007/978-3-030-72651-5_56
Download citation
DOI: https://doi.org/10.1007/978-3-030-72651-5_56
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72650-8
Online ISBN: 978-3-030-72651-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)