47
IRUS TotalDownloads
Model-Based Speech Enhancement in the Modulation Domain
File | Description | Size | Format | |
---|---|---|---|---|
![]() | Accepted version | 1.1 MB | Adobe PDF | View/Open |
Title: | Model-Based Speech Enhancement in the Modulation Domain |
Authors: | Wang, Y Brookes, DM |
Item Type: | Journal Article |
Abstract: | This paper presents an algorithm for modulationdomain speech enhancement using a Kalman filter. The proposed estimator jointly models the estimated dynamics of the spectral amplitudes of speech and noise to obtain an MMSE estimation of the speech amplitude spectrum with the assumption that the speech and noise are additive in the complex domain. In order to include the dynamics of noise amplitudes with those of speech amplitudes, we propose a statistical “Gaussring” model that comprises a mixture of Gaussians whose centres lie in a circle on the complex plane. The performance of the proposed algorithm is evaluated using the perceptual evaluation of speech quality (PESQ) measure, segmental SNR (segSNR) measure and shorttime objective intelligibility (STOI) measure. For speech quality measures, the proposed algorithm is shown to give a consistent improvement over a wide range of SNRs when compared to competitive algorithms. Speech recognition experiments also show that the Gaussring model based algorithm performs well for two types of noise. |
Issue Date: | 25-Dec-2017 |
Date of Acceptance: | 18-Dec-2017 |
URI: | http://hdl.handle.net/10044/1/55599 |
DOI: | https://dx.doi.org/10.1109/TASLP.2017.2786863 |
ISSN: | 2329-9304 |
Publisher: | Institute of Electrical and Electronics Engineers |
Start Page: | 580 |
End Page: | 594 |
Journal / Book Title: | IEEE/ACM Transactions on Audio, Speech and Language Processing |
Volume: | 26 |
Issue: | 3 |
Copyright Statement: | © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. |
Keywords: | Science & Technology Technology Acoustics Engineering, Electrical & Electronic Engineering Speech enhancement modulation-domain Kalman filter statistical modelling minimum mean-square error (MMSE) estimator SPECTRAL AMPLITUDE ESTIMATOR AUDIO SIGNAL ENHANCEMENT SQUARE ERROR ESTIMATION NOISE INTELLIGIBILITY SUPPRESSION RECEPTION CHANNELS QUALITY PRIORS |
Publication Status: | Published |
Appears in Collections: | Electrical and Electronic Engineering Faculty of Engineering |