47
IRUS Total
Downloads

Model-Based Speech Enhancement in the Modulation Domain

File Description SizeFormat 
IEEEtran_Gaussring_Final.pdfAccepted version1.1 MBAdobe PDFView/Open
Title: Model-Based Speech Enhancement in the Modulation Domain
Authors: Wang, Y
Brookes, DM
Item Type: Journal Article
Abstract: This paper presents an algorithm for modulationdomain speech enhancement using a Kalman filter. The proposed estimator jointly models the estimated dynamics of the spectral amplitudes of speech and noise to obtain an MMSE estimation of the speech amplitude spectrum with the assumption that the speech and noise are additive in the complex domain. In order to include the dynamics of noise amplitudes with those of speech amplitudes, we propose a statistical “Gaussring” model that comprises a mixture of Gaussians whose centres lie in a circle on the complex plane. The performance of the proposed algorithm is evaluated using the perceptual evaluation of speech quality (PESQ) measure, segmental SNR (segSNR) measure and shorttime objective intelligibility (STOI) measure. For speech quality measures, the proposed algorithm is shown to give a consistent improvement over a wide range of SNRs when compared to competitive algorithms. Speech recognition experiments also show that the Gaussring model based algorithm performs well for two types of noise.
Issue Date: 25-Dec-2017
Date of Acceptance: 18-Dec-2017
URI: http://hdl.handle.net/10044/1/55599
DOI: https://dx.doi.org/10.1109/TASLP.2017.2786863
ISSN: 2329-9304
Publisher: Institute of Electrical and Electronics Engineers
Start Page: 580
End Page: 594
Journal / Book Title: IEEE/ACM Transactions on Audio, Speech and Language Processing
Volume: 26
Issue: 3
Copyright Statement: © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Keywords: Science & Technology
Technology
Acoustics
Engineering, Electrical & Electronic
Engineering
Speech enhancement
modulation-domain Kalman filter
statistical modelling
minimum mean-square error (MMSE) estimator
SPECTRAL AMPLITUDE ESTIMATOR
AUDIO SIGNAL ENHANCEMENT
SQUARE ERROR ESTIMATION
NOISE
INTELLIGIBILITY
SUPPRESSION
RECEPTION
CHANNELS
QUALITY
PRIORS
Publication Status: Published
Appears in Collections:Electrical and Electronic Engineering
Faculty of Engineering