Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement

OU Shifeng; SONG Peng; GAO Ying

doi:10.1049/cje.2018.09.009

OU Shifeng, SONG Peng, GAO Ying. Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement[J]. Chinese Journal of Electronics, 2018, 27(6): 1214-1220. DOI: 10.1049/cje.2018.09.009

Citation:

Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement

Graphical Abstract

Graphical Abstract

Abstract

Abstract

The estimation of noise Power spectral density (PSD) is a very crucial issue for speech enhancement as a result of its significant effect on the quality and intelligibility of the enhanced speech. Most of the existing estimators for noise PSD try to employ Gaussian speech priors, which, however, have been proven inconsistent with the reality. We derived an effective solution to this problem of estimating noise PSD in the Minimum mean square error (MMSE) sense when the speech component is modeled by a Laplacian distribution. Meanwhile, the soft decision technique instead of the hard Voice activity detection (VAD) is evolved into our algorithm, which can automatically makes the estimation unbiased without requiring a bias compensation. The performance of the proposed method is tested by several objective and subjective measures under various stationary and nonstationary noise environments. The results confirm that our method achieves good performance for all the noise conditions and Signalnoise-ratio (SNR) settings.