Laplacian Speech Model and Soft Decision Based MMSE Estimator for Noise Power Spectral Density in Speech Enhancement
-
Graphical Abstract
-
Abstract
The estimation of noise Power spectral density (PSD) is a very crucial issue for speech enhancement as a result of its significant effect on the quality and intelligibility of the enhanced speech. Most of the existing estimators for noise PSD try to employ Gaussian speech priors, which, however, have been proven inconsistent with the reality. We derived an effective solution to this problem of estimating noise PSD in the Minimum mean square error (MMSE) sense when the speech component is modeled by a Laplacian distribution. Meanwhile, the soft decision technique instead of the hard Voice activity detection (VAD) is evolved into our algorithm, which can automatically makes the estimation unbiased without requiring a bias compensation. The performance of the proposed method is tested by several objective and subjective measures under various stationary and nonstationary noise environments. The results confirm that our method achieves good performance for all the noise conditions and Signalnoise-ratio (SNR) settings.
-
-