Elsevier

Journal of Voice

Volume 29, Issue 5, September 2015, Pages 519-529
Journal of Voice

An Examination of the Relationship Between Electroglottographic Contact Quotient, Electroglottographic Decontacting Phase Profile, and Acoustical Spectral Moments

Portions of this manuscript were presented at the 43rd Annual Symposium: Care of the Professional Voice, Philadelphia, PA. May 29, 2014.
https://doi.org/10.1016/j.jvoice.2014.10.016Get rights and content

Summary

Objectives

To date, only a few studies have examined the possible relationship between electroglottographic (EGG) data and spectral characteristics of the voice. This study examined the possible association between EGG signal data (contact quotient [CQ] and decontacting phase profile) and spectral moments of the acoustic signal (spectral mean, spectral standard deviation (SD), spectral skewness, and spectral kurtosis). Furthermore, the possible effects of gender on these measurements were analyzed.

Methods

Sustained vowel /ɑ/ productions were obtained from 48 normophonic individuals (24 adult males and 24 adult females). The central 1-second portions of the acoustic vowel samples were analyzed for spectral moments, and the EGG signal was analyzed for CQ (CQEGG), fundamental frequency (F0), and decontacting phase profile.

Results

Across all subjects, the spectral characteristics of the voice (in particular, spectral SD, skewness, and kurtosis) are significantly related to changes in the relative duration of vocal fold contact (as measured via CQEGG). In addition, significant effects of the profile of the EGG decontacting phase (ie, concave down/“knee” vs concave up/“no knee”) on spectral SD were also observed, as well as a strong trend for decontacting phase profile to influence the spectral mean.

Discussion

Although the degree of vocal fold contact and differences in decontacting phase profile may have an influence on the spectral characteristics of the acoustic voice signal, the strength of correlations between CQEGG values and measures of spectral moments only accounted for approximately 13–16% of the variation in spectral distribution characteristics. These results stress the importance of the transformative role of the supraglottal vocal tract in producing an acoustic output that maintains some of the characteristics of the glottal source, but which modifies the source characteristics in ways not completely accounted for by single parameters such as CQEGG or EGG profile.

Introduction

When analyzing vocal fold movement during phonation, adult males and females have been described as exhibiting physiological differences throughout the glottal cycle.1 During the contact phase of vibration, men have been reported to approximate more complete glottal closure than women,2, 3 corresponding to a greater relative duration of vocal fold closure in men versus women.3, 4 In contrast to observations for adult male speakers, multiple researchers have noted that a majority of adult females produce comfortable pitch and loudness phonation with incomplete vocal fold closure (posterior glottal gap) during modal register phonation,5, 6, 7, 8 resulting in a relative increase in the duration of the open phase and a possible tendency for a subtle breathy voice quality in some women. In addition, adult females have been described as having less abrupt contacting and decontacting behavior than adult males.9

One method for objectively analyzing vocal fold contact during the phonatory cycle is electroglottography (EGG).10, 11 EGG is a noninvasive method of measuring vocal fold contact during voice production4 that provides a physiological correlate of vocal fold vibration.12 To obtain the EGG signal, two electrodes are placed on either side of the neck at the level of the vocal folds, and a small high-frequency current is passed between the electrodes.4 When the current is transmitted during vocal fold vibration, the EGG signal varies in response to the fluctuations in electrical impedance that are induced by changes in the contact area of the vocal folds.13 The resulting EGG waveform can be analyzed for contact quotient (CQEGG—the relative duration of vocal fold contact during the phonatory cycle14). In addition, modeling studies have shown that a description of the quality of vocal fold contact during the contacting and decontacting phases may be made from the EGG waveform.15, 16

Previous research has described key characteristics of adult male versus female EGG waveforms. As expected, men have been observed to produce greater CQEGGs than women3, 17 although this observation may depend on the method used for computation of the CQEGG.18 In addition, description of the EGG waveform profile has indicated that many adult male EGG waveforms are characterized as having a concave-down profile or knee in the decontacting phase1, 2, 19 that corresponds to a bulging or skirting motion of the vocal fold(s) in the vertical dimension,1 resulting in a tendency for increased vocal fold contact per glottal cycle.2 Although the knee tends to occur in the majority of adult male EGG waveforms, the presence of a decontacting knee has also been reported in a smaller percentage of adult female EGG waveforms at habitual speaking pitch/frequency levels17, 20, 21 (eg, Awan and Awan17 recently reported that the knee was present in 76% of the males and 36% of the females that they studied).

In humans, most of the acoustic energy is generated by the cessation of the transglottal airflow.22 The relative duration of the closed phase during a glottal cycle has a direct influence on the shape of the glottal airflow pulse and hence the generation of stronger high-frequency partials in the radiated sound source spectrum.23, 24 Therefore, it may be expected that differences in the relative duration and the quality or type of vocal fold contact during the glottal cycle (as discussed in the aforementioned literature review regarding adult male vs female EGG waveforms) will have an effect on the spectral characteristics of the acoustic voice signal. Holmberg et al25 examined the relationships between aerodynamic, acoustic, and EGG measures of voice in consonant-vowel syllable and vowel production. Results showed weak but significant negative relationships between EGG adduction quotient and harmonic-to-harmonic (H1-H2) and harmonic-to-formant (H1-F1 and H1-F3) amplitude differences, whereas weak positive correlations were observed between H1-F3 noise and F1-F3 noise. These authors hypothesized that gradual closing movements of a somewhat abducted membranous portion of the vocal folds would result in relatively sinusoidal glottal waveforms and small adduction quotients and that the corresponding acoustic spectrum would be characterized by a first harmonic with relatively high amplitude, a steep overall spectral slope, and an attenuated F1 peak amplitude (because of a relatively wide F1 bandwidth). However, it was reported that strong relationships (r's > 0.70) between EGG adduction quotients and the aforementioned spectral characteristics were observed in ≤35% of the subjects. Winkler et al26 examined the effect of aging on EGG, acoustic, and perceptual analyses of the voice. Results indicated a characteristic spectral distribution in the /ɑ/ vowel of male voices. Long-term average spectra (LTAS) of the male voices perceived as old had relatively little harmonic energy between 2 and 4 kHz, whereas younger male voices showed distinct spectral harmonics up to 4 kHz. The spectral characteristics for the young males coincided with EGG waveforms that showed a steep increase of vocal fold contact in the closing and a more extended opening phase with the instant of glottal opening clearly visible in the EGG signal as a knee. In contrast, the EGG waveforms for the older males were nearly sinusoidal with reduced steepness of the increasing signal slope. Bestebreurtje and Schutte27 examined EGG and spectral characteristics of the belting singing style in a single subject study. Results indicated that the loud bright sound of the belting style is achieved by enhanced higher harmonics in the acoustic spectrum and a corresponding raising of the CQEGG above 52% (the authors speculated that this was an apparent threshold value for belting).

In addition to the examination of isolated spectral harmonic amplitudes, one of the methods that may be used to summarize the spectral characteristics of the acoustic voice signal is by examining the spectral moments of the LTAS.28 In the computation of spectral moments, the spectral values are treated as a random probability distribution, with the first spectral moment being the spectral mean (a measure of the spectrum's central tendency in hertz; the center of gravity); the second spectral moment the standard deviation (SD; a measure linked with the variance around first moment); the third spectral moment being skewness (a dimensionless measure reflecting the average symmetry of the spectral distribution; positive skewness coincides with a higher concentration of energy in the lower spectral regions and negative vice versa); and the fourth spectral moment being kurtosis (a dimensionless measure of the shape of the spectrum, with positive values representative of a compact spectrum and energy concentrated within a very small frequency range versus low or negative values indicative of a flattened spectrum; may be interpreted via peakedness (width of the peak), as well as tail weight (heavy vs light tails), and the presence of infrequent extreme deviations).29, 30, 31, 32, 33 Spectral moments analyses have had various applications to speech/voice analysis and have been used to assess the impact of vocal effort on voice characteristics31; to distinguish between typical versus disordered voice subjects during continuous speech34; to evaluate the impact of unilateral vocal fold paralysis on pre versus post-therapy voice characteristics35; to provide a voice treatment outcomes measure36; to aid in the differential diagnosis of adductor spasmodic dysphonia versus muscle tension dysphonia patients37; to discriminate between hypokinetic dysarthric and neurologically normal subjects38; and to assess the effects of cochlear implant on speech characteristics.39

Investigation of the possible relationships between physiological measures of voice (such as EGG) and the acoustic output of the speech/voice mechanism is valuable in our attempts to gain a deeper understanding of normal voice production.25 Because acoustic measures are readily available, low cost, noninvasive, and familiar to researchers and clinicians alike, they are probably the most frequently used “objective” measures of voice. Research that can establish the relationship(s) between acoustic and physiological measures of voice are important in demonstrating the validity of these methods. Because EGG is a noninvasive (and hence easily obtained) physiological correlate of vocal fold vibration, it is important to relate acoustic measures (spectral or time based) to data from the EGG waveform and thus possibly gain a better understanding of the causal relationship between the quality of vocal fold vibration and the resulting sound output. Spectral-based measures (such as spectral moments) are of particular interest because (1) they have been reported to be effective in documenting voice types that are problematic for traditional perturbation measures (eg, strained voice) and (2) (although not the focus of this study) they have been shown to be applicable to the analysis of continuous speech samples. Therefore, the purpose of this study was to examine the possible association between EGG characteristics (including CQEGG and EGG profile) and spectral characteristics referred to as spectral moments from corresponding acoustic voice signals. In addition, possible gender differences in EGG and spectral characteristics of the voice were also investigated.

Section snippets

Methodology

The Bloomsburg University of Pennsylvania Institutional Review Board approved the methodology used in this study.

Results

All statistical analyses were computed using SPSS version. 16.0 (SPSS Inc., Chicago, IL). Analyses were focused on the second trial of the sustained vowel /ɑ/ because (1) preliminary analyses had indicated no significant difference between trials for measures of CQEGG, F0, or the various spectral moments and (2) ratings of EGG profile (knee vs no knee) were obtained from the second trial productions. Because the shift in harmonics due to subject-to-subject differences in F0 may influence

Discussion

The results of this study show that, when evaluated across gender groups, the spectral characteristics of the voice (in particular, spectral SD, skewness, and kurtosis) are significantly related to changes in the relative duration of vocal fold contact (as measured via CQEGG). However, when analyzed within gender subgroups, these correlations were observed to be weaker and nonsignificant. In addition, significant effects of the profile of the EGG decontacting phase (ie, concave down/“knee” vs

Conclusions

The results of this study clearly show that degree of vocal fold contact and differences in decontacting phase profile have an effect on the spectral characteristics of the voice. However, the strength of correlations between CQEGG values and measures of spectral moments tended to be relatively weak and only accounted for approximately 13–16% of the variation in spectral distribution characteristics. These findings are consistent with the relatively weak relationships between CQEGG and spectral

Acknowledgment

This study was partially supported by the European Social Fund project OP VK CZ.1.07/2.3.00/30.0004 ‘POST-UP’ (CTH).

References (64)

  • K. Tanner et al.

    Spectral moments of the long-term average spectrum: sensitive indices of voice change after therapy?

    J Voice

    (2005)
  • H. Kraemer et al.

    Measures of clinical significance

    J Am Acad Child Adolesc Psychiatry

    (2003)
  • F. Alipour et al.

    Measures of spectral slope using an excised larynx model

    J Voice

    (2012)
  • S.N. Awan et al.

    Effects of vocal intensity and vowel type on cepstral analysis of voice

    J Voice

    (2012)
  • C.T. Herbst et al.

    Using electroglottographic real-time feedback to control posterior glottal adduction during phonation

    J Voice

    (2010)
  • I. Titze

    Physiologic and acoustic differences between male and female voices

    J Acoust Soc Am

    (1989)
  • Y. Chen et al.

    Electroglottographic evaluation of gender and vowel effects during modal and vocal fry phonation

    J Speech, Lang Hear Res

    (2002)
  • H.M. Hanson et al.

    Glottal characteristics of male speakers: acoustic correlates and comparison with female data

    J Acoust Soc Am

    (1999)
  • M.B. Higgins et al.

    Gender differences in vocal fold contact computed from electroglottographic signals: the influence of measurement criteria

    J Acoust Soc Am

    (2002)
  • H.M. Hanson

    Glottal characteristics of female speakers: acoustic correlates

    J Acoust Soc Am

    (1997)
  • E.B. Holmberg et al.

    Glottal airflow and transglottal air pressure measurements for male and female speakers in soft, normal, and loud voice

    J Acoust Soc Am

    (1988)
  • P. Fabre

    Un procédé électrique percuntané d’inscription de l'accolement glottique au cours de la phonation: glottographie de haute fréquence; premiers résultats

    Bull Acad Natl Med

    (1957)
  • R. Scherer et al.

    Electroglottography and direct measurement of vocal fold contact area

  • R. Orlikoff

    Assessment of the dynamics of vocal fold contact from the electroglottogram: data from normal male subjects

    J Speech Hear Res

    (1991)
  • M. Rothenberg et al.

    Monitoring vocal fold abduction through vocal fold contact area

    J Speech Hear Res

    (1988)
  • D.G. Childers et al.

    A model for vocal fold vibratory motion, contact area, and the electroglottogram

    J Acoust Soc Am

    (1986)
  • C. Herbst et al.

    A comparison of different methods to measure the EGG contact quotient

    Logoped Phoniatr Vocol

    (2006)
  • M.P. Robb et al.

    Gender comparisons of children’s vocal fold contact behavior

    J Acoust Soc Am

    (1990)
  • C.T. Herbst et al.

    Electroglottographic wavegrams: a technique for visualizing vocal fold dynamics noninvasively

    J Acoust Soc Am

    (2010)
  • L.A. Cecconello

    Qualitative and quantitative aspects of the electroglottography in singers with different glottic contact patterns

  • G. Fant

    Quarterly progress and status report: glottal source and excitation analysis

    STL-QPSR

    (1979)
  • B. Doval et al.

    The spectrum of glottal flow models

    Acta Acustica

    (2006)
  • Cited by (9)

    • Automated Quantification of Inflection Events in The Electroglottographic Signal

      2023, Journal of Voice
      Citation Excerpt :

      Additionally, Awan et al.6 discussed the relationship between the presence and absence of this knee and the waveform spectrum. Although many authors refer to this knee during the decontacting phase [6,19,28,31–34], to our knowledge, no quantitative method to describe this kind of event has been proposed to date. A strong relationship has been suggested between the concave-down knee and the minimum of the first derivative of the EGG signal (dEGG) [8,23,35–37].

    • Electroglottography – An Update

      2020, Journal of Voice
    • Automated Electroglottographic Inflection Events Detection. A Pilot Study

      2016, Journal of Voice
      Citation Excerpt :

      There is an agreement in the literature that certain aspects of the contacting and the decontacting phases of the vocal folds can be quickly and easily described by electroglottographic (EGG) signal. Since 1957,1 electroglottography has been widely used in the clinic and in voice research considering its main quantitative parameters: contact quotient,2–12 open quotient,13–15 and fundamental frequency extraction.7,16–20 Throughout the years, several authors have suggested different approaches to obtain these parameters.4,21–25

    • Relationship between the Electroglottographic Signal and Vocal Fold Contact Area

      2016, Journal of Voice
      Citation Excerpt :

      Because of its noninvasive nature, electroglottography is a popular method for assessing VF vibration in vivo. The EGG signal can be subjected to either qualitative analysis, such as assessing the presence of a “knee” in the EGG waveform as a putative indicator of VF bulging,52 or to quantitative analysis. A widely used quantitative parameter derived from the EGG signal is the EGG contact quotient CQEGG.

    View all citing articles on Scopus
    View full text