“A speech recognizer” a tool to recognize the high clarity speech signal based on existing speech using ISCA

Velammal, M. Navaneetha; Kumar, P. Nirmal

doi:10.1007/s10470-018-1275-5

“A speech recognizer” a tool to recognize the high clarity speech signal based on existing speech using ISCA

Published: 13 August 2018

Volume 98, pages 41–58, (2019)
Cite this article

Analog Integrated Circuits and Signal Processing Aims and scope Submit manuscript

M. Navaneetha Velammal¹ &
P. Nirmal Kumar²

178 Accesses
Explore all metrics

Abstract

In this research a new way introduced for solving the underdetermined blind speech signal separation problem when the number of observation is less than the sources for which the ICA is no longer applicable, which enhance the time complexity for separation of signal. To resolve that, Improved Sparse Component Analysis (ISCA) is introduced to exploit the sparse nature of TF domain, which adopt a two-step processing that contains mixing matrix estimation followed by separation of source. This ISCA is based on fuzzy c-means with Particle swarm optimization (PSO) algorithm for mixed matrix Estimation. In our work PSO is used to separate the accurate voice signal from the random mixed signal by finding the best optima solution in the cluster part. Then the source signal separation is carried out based on the shortest path. These initial processing is done and verified by Mat lab and hardware description language is generated using HDL coder and it is synthesized using Xilinx ISE. The final result illustrates that the proposed system has an improved performance in terms of SNR, Efficiency and Accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Hybridization of spectral filtering with particle swarm optimization for speech signal enhancement

Article 24 November 2015

Single-channel speech separation using combined EMD and speech-specific information

Article 23 October 2017

Analysis of Optimized Spectral Subtraction Method for Single Channel Speech Enhancement

Article 10 September 2022

References

Dupont, S., & Luettin, J. (2000). Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia, 2(3), 141–151.
Article Google Scholar
Liutkus, A., Badeau, R., & Richard, G. (2011). Gaussian processes for underdetermined source separation. IEEE Transactions on Signal Processing, 59(7), 3155–3167.
Article MathSciNet MATH Google Scholar
Vincent, E., Araki, S., Theis, F., Nolte, G., Bofill, P., Sawada, H., et al. (2012). The signal separation evaluation Campaign (2007–2010): Achievements and remaining challenges. Signal Processing Journal, 92(8), 1928–1936.
Article Google Scholar
Kokkinakis, K., & Asoke Nandi, K. (2006). Multichannel blind deconvolution for source separation in convolutive mixtures of speech. IEEE Transactions on Audio, Speech and Language Processing, 14(1), 200–212.
Article Google Scholar
Daniel, W. E., Schobben, C., & Sommen, P. (2002). A frequency domain blind signal separation method based on decorrelation. IEEE Transactions on Signal Processing, 50(8), 1855–1865.
Article Google Scholar
Koldovsky, Z., & Tichavsky, P. (2011). Time-domain blind separation of audio sources on the basis of a complete ICA decomposition of an observation space. IEEE Transactions on Audio, Speech and Language Processing, 19(2), 406–416.
Article Google Scholar
Yilmaz, O., & Rickard, S. (2004). Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing, 52(7), 1830–1847.
Article MathSciNet MATH Google Scholar
Gribonval, R., Benaroya, L., Vincent, E., & Févotte, C. (2003). Proposals for performance measurement in source separation. In Proceedings of 4th international symposium on independent component analysis and blind signal separation (ICA2003) (pp. 763–768).
Lin, Q.-H., Yin, F.-L., Mei, T.-M., & Liang, H. (2006). A blind source separation based method for speech encryption. IEEE Transactions on Circuits and Systems I: Regular Papers, 53(6), 1320–1328.
Article MathSciNet MATH Google Scholar
Rojas, F., Puntonet, C. G., Górriz, J. M., & Valenzuela, O. (2005). Assessing the performance of several fitness functions in a genetic algorithm for nonlinear separation of sources. Advances in natural computation (pp. 863–872). Berlin Heidelberg: Springer.
Chapter Google Scholar
André Takahata, K., Everton Nadalin, Z., Ferrari, R., Duarte, L. T., Suyama, R., Renato Lopes, R., et al. (2012). Unsupervised processing of geophysical signals: A review of some key aspects of blind de convolution and blind source separation. IEEE Signal Processing Magazine, 29(4), 27–35.
Article Google Scholar
Fuentes, B., Badeau, R., & Richard, G. (2012). Blind harmonic adaptive decomposition applied to supervised source separation. In Proceedings of the IEEE 20th European signal processing conference (EUSIPCO) (pp. 2654–2658).
Diago Sorianoa, C., Suyamab, R., & Attux, R. (2011). Blind extraction of chaotic sources from mixtures with stochastic signals based on recurrence quantification analysis. Conference on Digital Signal Processing, 21, 417–426.
Article Google Scholar
Virtanen, T. (2007). Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria. IEEE Transactions on Audio, Speech and Language Processing, 15(3), 1066–1074.
Article Google Scholar
Sawada, H., Mukai, R., Araki, S., & Makino, S. (2004). A robust and precise method for solving the permutation problem of frequency-domain blind source separation. IEEE Transactions on Speech and Audio Processing, 12(5), 530–538.
Article Google Scholar
Araki, S., Sawada, H., Mukai, R., & Makino, S. (2007). Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors. Signal Processing, 87(8), 1833–1847.
Article MATH Google Scholar
Bofill, P., & Zibulevsky, M. (2001). Underdetermined blind source separation using sparse representations. Signal Processing, 81(11), 2353–2362.
Article MATH Google Scholar
Abrard, F., & Deville, Y. (2005). A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources. Signal Processing, 85(7), 1389–1403.
Article MATH Google Scholar
Puigt, M., & Deville, Y. (2005). Time–frequency ratio-based blind separation methods for attenuated and time-delayed sources. Mechanical Systems and Signal Processing, 19(6), 1348–1379.
Article Google Scholar
Reju, V. G., Koh, S. N., & Soon, I. Y. (2009). An algorithm for mixing estimation in instantaneous blind source separation. Signal Processing, 89(9), 1762–1773.
Article MATH Google Scholar
Sha, Z.-C., Huang, Z.-T., Zhou, Yi-Yu., & Wang, F.-H. (2013). Frequency-hopping signals sorting based on underdetermined blind source separation. IET Communications, 7(14), 1456–1464.
Article Google Scholar
Xie, S., Yang, L., Yang, J.-M., Zhou, G., & Xiang, Y. (2012). Time-Frequency Approach to Underdetermined Blind Source Separation. IEEE Transactions on Neural Networks and Learning Systems, 23(2), 306–316.
Article Google Scholar
Bao, G., Zhongfu Ye, X., & Zhou, Y. (2013). A compressed sensing approach to blind separation of speech mixture based on a two-layer sparsity model. IEEE Transactions on Audio, Speech and Language Processing, 21(5), 899–906.
Article Google Scholar
Alshabrawy, O. S. & Hassanien, A. E. (2014). Underdetermined blind separation of mixtures of an unknown number of sources with additive white and pink noises. In Proceedings of the Fifth international conference on innovations in bio-inspired computing and applications IBICA 2014 (pp. 241–250). Cham: Springer.
Zhang, L., Yang, J., Kaiwang, L., & Zhang, Q. (2014). Modified subspace method based on convex model for underdetermined blind speech separation. IEEE Transactions on Consumer Electronics, 60(2), 225–232.
Article Google Scholar
Abrard, F., & Deville, Y. (2005). A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources. Signal Process, 85(7), 1389–1403.
Article MATH Google Scholar
Puigt, M., & Deville, Y. (2005). Time–frequency ratio-based blind separation methods for attenuated and time-delayed sources. Mechanical Systems and Signal Processing, 19(6), 1348–1379.
Article Google Scholar
Araki, S., Sawada H. & Makino, S. (2007). Blind speech separation in a meeting situation with maximum SNR beamformers. In IEEE international conference on acoustics, speech and signal processing, 2007. ICASSP 2007 (pp. 1–41).
Pavlidi, D., Griffin, A., Puigt, M., & Mouchtaris, A. (2013). Real-time multiple sound source localization and counting using a circular microphone array. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 21(10), 2193–2206.
Google Scholar

Download references

Author information

Authors and Affiliations

Electronics and Communication Engineering Department, Francis Xavier Engineering College, Tirunelveli, Tamilnadu, 627003, India
M. Navaneetha Velammal
Electronics and Communication Engineering, Anna University, Chennai, Tamilnadu, 600025, India
P. Nirmal Kumar

Authors

M. Navaneetha Velammal
View author publications
You can also search for this author in PubMed Google Scholar
P. Nirmal Kumar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Navaneetha Velammal.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Velammal, M.N., Kumar, P.N. “A speech recognizer” a tool to recognize the high clarity speech signal based on existing speech using ISCA. Analog Integr Circ Sig Process 98, 41–58 (2019). https://doi.org/10.1007/s10470-018-1275-5

Download citation

Received: 30 January 2018
Revised: 03 May 2018
Accepted: 24 July 2018
Published: 13 August 2018
Issue Date: 15 January 2019
DOI: https://doi.org/10.1007/s10470-018-1275-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

“A speech recognizer” a tool to recognize the high clarity speech signal based on existing speech using ISCA

Abstract

Access this article

Similar content being viewed by others

Hybridization of spectral filtering with particle swarm optimization for speech signal enhancement

Single-channel speech separation using combined EMD and speech-specific information

Analysis of Optimized Spectral Subtraction Method for Single Channel Speech Enhancement

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

“A speech recognizer” a tool to recognize the high clarity speech signal based on existing speech using ISCA

Abstract

Access this article

Similar content being viewed by others

Hybridization of spectral filtering with particle swarm optimization for speech signal enhancement

Single-channel speech separation using combined EMD and speech-specific information

Analysis of Optimized Spectral Subtraction Method for Single Channel Speech Enhancement

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation