Paper The following article is Open access

Content Classification With Electroglottograph

, and

Published under licence by IOP Publishing Ltd
, , Citation Pengfei Chen et al 2020 J. Phys.: Conf. Ser. 1544 012191 DOI 10.1088/1742-6596/1544/1/012191

1742-6596/1544/1/012191

Abstract

Electroglottograph(EGG) is a physiological signal collected from the throat which reflects the vocal cord movement. EGG signals can be still collected from patients without speaking ability or from the extremely noisy environments. Additionally, the trends of the vocal cord movement will be distinctive for long enough Chinese sentences with different contents. Therefore, it is valuable and possible to carry out the research of applying only the EGG signals for content classification or recognition. In this paper, a content classification method with EGG was proposed, which consists of an EGG feature extraction module and a classification network based on LSTM(Long Short-Term Memory) units. The EGG feature extraction module was composed of three parts: the voiced segments extraction, the feature extraction and the F0 smoothing. The classification network was made of a three-layer bidirectional LSTM encoder. This method achieved 91.12% accuracy on the validation set in 20-class content classification experiment, which provides the reference for further study in content classification and recognition with EGG signals.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.
10.1088/1742-6596/1544/1/012191