ISCA Archive Interspeech 2007
ISCA Archive Interspeech 2007

Score fusion for articulatory feature detection

Brian M. Ore, Raymond E. Slyh

Articulatory Features (AFs) describe the way in which the speech organs are used when producing speech sounds. Research has shown that incorporating this information into speech recognizers can lead to an increase in system performance. This paper considers English AF detection using Gaussian Mixture Models (GMMs) and Multi-Layer Perceptrons (MLPs). The scores from the GMM- and MLP-based detectors are fused using a second MLP, resulting in an average reduction of 8.24% in equal error rate compared to the individual systems. These detector outputs are used to form the feature set for a Hidden Markov Model (HMM) phone recognizer. It is shown that monophone models created using the proposed feature set perform comparably to triphone models trained using Mel-Frequency Cepstral Coefficients (MFCCs).


doi: 10.21437/Interspeech.2007-514

Cite as: Ore, B.M., Slyh, R.E. (2007) Score fusion for articulatory feature detection. Proc. Interspeech 2007, 1845-1848, doi: 10.21437/Interspeech.2007-514

@inproceedings{ore07_interspeech,
  author={Brian M. Ore and Raymond E. Slyh},
  title={{Score fusion for articulatory feature detection}},
  year=2007,
  booktitle={Proc. Interspeech 2007},
  pages={1845--1848},
  doi={10.21437/Interspeech.2007-514}
}