On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement

Rehr, Robert; Gerkmann, Timo

doi:10.1109/TASLP.2017.2778151

Computer Science > Sound

arXiv:1703.05003 (cs)

[Submitted on 15 Mar 2017 (v1), last revised 16 Jan 2018 (this version, v2)]

Title:On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement

Authors:Robert Rehr, Timo Gerkmann

View PDF

Abstract:For enhancing noisy signals, machine-learning based single-channel speech enhancement schemes exploit prior knowledge about typical speech spectral structures. To ensure a good generalization and to meet requirements in terms of computational complexity and memory consumption, certain methods restrict themselves to learning speech spectral envelopes. We refer to these approaches as machine-learning spectral envelope (MLSE)-based approaches.
In this paper we show by means of theoretical and experimental analyses that for MLSE-based approaches, super-Gaussian priors allow for a reduction of noise between speech spectral harmonics which is not achievable using Gaussian estimators such as the Wiener filter. For the evaluation, we use a deep neural network (DNN)-based phoneme classifier and a low-rank nonnegative matrix factorization (NMF) framework as examples of MLSE-based approaches. A listening experiment and instrumental measures confirm that while super-Gaussian priors yield only moderate improvements for classic enhancement schemes, for MLSE-based approaches super-Gaussian priors clearly make an important difference and significantly outperform Gaussian priors.

Comments:	10 pages, 9 figures
Subjects:	Sound (cs.SD)
Cite as:	arXiv:1703.05003 [cs.SD]
	(or arXiv:1703.05003v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.1703.05003
Journal reference:	IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 2, pp. 357-366, Feb. 2018
Related DOI:	https://doi.org/10.1109/TASLP.2017.2778151

Submission history

From: Robert Rehr [view email]
[v1] Wed, 15 Mar 2017 08:33:38 UTC (3,114 KB)
[v2] Tue, 16 Jan 2018 11:16:40 UTC (3,553 KB)

Computer Science > Sound

Title:On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:On the Importance of Super-Gaussian Speech Priors for Machine-Learning Based Speech Enhancement

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators