Skip to main content
Log in

Binary classifier for identification of stammering instances in Hindi speech data

  • Published:
International Journal of Speech Technology Aims and scope Submit manuscript

Abstract

In this research paper we show results from our experiments on creating a binary classifier for stammering identification in Hindi speech data. We train several Sequential CNN models with parametric adjustments such as color, image size, and training data shape changes to tweak classification performance. Our experimental pipeline converts speech samples into spectrograms using Librosa, and trains the Sequential CNN classifier on the image data using TensorFlow Lite. Our classification models achieve more than 95% accuracy in this classification task.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data availability

This journal article forms a continuing segment of an ongoing doctoral project. Upon the culmination of the thesis to avoid data breaches and uphold intellectual property safeguards, the researchers plan to make the research dataset available (https://shivamdwivedi.com/resources) upon reasonable requests.

References

Download references

Acknowledgements

First and foremost, we wish to express our profound gratitude to our research subjects. Their dedication and active involvement not only made the data collection drive a success but also enriched this research with invaluable speech data. Their unwavering support has been the bedrock upon which this work stands. Equally essential to the completion of this work was the guidance of Dr. Anil Thakur. His insights and direction have been instrumental in shaping our research. Additionally, we are deeply indebted to Dr. Sukomal Pal, whose discerning critiques and constructive feedback have been invaluable in refining our approach and processes. To all of you who have been part of this journey with us, we extend our heartfelt thanks.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shivam Dwivedi.

Ethics declarations

Disclosures

The recruitment of research participants for this study was based on a voluntary basis. It’s important to note that none of the participants received monetary compensation for their involvement. in tandem with this, the research project garnered no external funding, with all research-related expenses being borne by the authors themselves. the authors assert their absence of conflict of interest pertaining to this research endeavor. it is further confirmed that external entities had no involvement in the study's design, data collection, analysis, interpretation of results, or the decision to publish. the presented study findings stand as a product solely derived from the authors' collected and analyzed data, maintaining independence from any external influences.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dwivedi, S., Ghosh, S. & Dwivedi, S. Binary classifier for identification of stammering instances in Hindi speech data. Int J Speech Technol 26, 765–774 (2023). https://doi.org/10.1007/s10772-023-10046-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10772-023-10046-9

Keywords

Navigation