Abstract
The field of human–computer interaction greatly benefits from the significant role of speech emotion recognition (SER), which finds applications across various domains. However, practical applications of SER still face certain challenges. One such challenge is the variation in emotional expressions among individuals, while another issue arises from the presence of indistinguishable emotions, which can impact the stability of SER systems. This study investigates the application of variants of the Bacterial Foraging Optimization Algorithm (BFOA) in the domain of SER. Experiments are conducted on multiple emotion datasets, including Emo-DB, SAVEE, and SUBESCO, to evaluate the effectiveness of the proposed variants. The findings of this study emphasize the potential of BFOA variants as powerful tools for SER.
Similar content being viewed by others
References
Chen, L., Su, W., Feng, Y., Wu, M., She, J., Hirota, K.: Two-layer fuzzy multiple random forest for speech emotion recognition in human–robot interaction. Inf. Sci. 509, 150–163 (2020)
Panigrahi, S., Palo, H.: Analysis and recognition of emotions from voice samples using ant colony optimization algorithm. Lect. Not. Electr. Eng. 814, 219–231 (2022)
Vijaya Lakshmi, T.R., Sastry, P.N., Rajinikanth, T.: Feature optimization to recognize Telugu handwritten characters by implementing DE and PSO techniques. In: Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications: FICTA 2016, Vol. 2, pp. 397–405. Springer (2017)
Vijaya Lakshmi, T.R., Krishna Reddy, C.V.: Cancer prediction with gene expression profiling and differential evolution. Signal Image Video Process. 17(5), 1855–1861 (2023)
Deng, W., Yao, R., Zhao, H., Yang, X., Li, G.: A novel intelligent diagnosis method using optimal LS-SVM with improved PSO algorithm. Soft Comput. 23, 2445–2462 (2019)
Lakshmi, T.R.V.: Reduction of features to identify characters from degraded historical manuscripts. Alex. Eng. J. 57(4), 2393–2399 (2018)
Vijaya Lakshmi, T.R., Sastry, P.N., Rajinikanth, T.: Feature selection to recognize text from palm leaf manuscripts. Signal Image Video Process. 12, 223–229 (2018)
Huang, S., Dang, H., Jiang, R., Hao, Y., Xue, C., Gu, W.: Multi-layer hybrid fuzzy classification based on SVM and improved PSO for speech emotion recognition. Electronics (Switzerland) 10, 23 (2021)
Narmatha, P., Gupta, S., Lakshmi, T.R.V., Manikavelan, D.: Skin cancer detection from dermoscopic images using deep Siamese domain adaptation convolutional neural network optimized with honey badger algorithm. Biomed. Signal Process. Control 86, 105264 (2023)
Lakshmi, T.R.V., Reddy, C.V.K., Kora, P., Swaraja, K., Meenakshi, K., Kumari, C.U., Reddy, L.P.: Classification of multi-spectral data with fine-tuning variants of representative models. Multimed. Tools Appl. (2023). https://doi.org/10.1007/s11042-023-16291-z
Passino, K.M.: Biomimicry of bacterial foraging for distributed optimization and control. IEEE Control Syst. Mag. 22(3), 52–67 (2002)
Mishra, S., Bhende, C.: Bacterial foraging technique-based optimized active power filter for load compensation. IEEE Trans. Power Deliv. 22(1), 457–465 (2006)
Berlin database of emotional speech. http://emodb.bilderbar.info/index-1280.html
Jackson, P., Haq, S.: Surrey audio-visual expressed emotion (savee) database. University of Surrey, Guildford (2014)
Sultana, S., Rahman, M.S., Selim, M.R., Iqbal, M.Z.: SUST bangla emotional speech corpus (SUBESCO): an audio-only emotional speech corpus for bangla. PLoS ONE 16(4), e0250173 (2021)
Lakshmi, T.R.V., Krishna Reddy, C.V.: Classification of skin lesions by incorporating drop-block and batch normalization layers in representative CNN models. Arab. J. Sci. Eng. (2023). https://doi.org/10.1007/s13369-023-08131-x
Liogienė, T., Tamulevičius, G.: Multi-stage recognition of speech emotion using sequential forward feature selection. Sci. J. Riga Tech. Univ. Electr. Control Commun. Eng. 10, 35–41 (2016)
Demircan, S., Kahramanli, H.: Application of fuzzy c-means clustering algorithm to spectral features for emotion classification from speech. Neural Comput. Appl. 29, 59–66 (2018)
Funding
No funding was provided to carry out this work.
Author information
Authors and Affiliations
Contributions
TRVL designed the SER framework and took the lead to conduct simulations. CVKR aided in interpreting the results and worked on initial draft and proof of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Vijaya Lakshmi, T.R., Krishna Reddy, C.V. Modeling and simulation of bacterial foraging variants: acoustic feature selection and classification. SIViP 18, 607–613 (2024). https://doi.org/10.1007/s11760-023-02783-w
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11760-023-02783-w