Modulation-domain Kalman filtering for monaural blind speech denoising and dereverberation
File(s)Dionelis2019.pdf (2.61 MB)
Accepted version
Author(s)
Dionelis, Nikolaos
Brookes, David
Type
Journal Article
Abstract
We describe a monaural speech enhancement algorithm based on modulation-domain Kalman filtering to blindly track the time-frequency log-magnitude spectra of speech and reverberation. We propose an adaptive algorithm that performs blind joint denoising and dereverberation, while accounting for the inter-frame speech dynamics, by estimating the posterior distribution of the speech log-magnitude spectrum given the log-magnitude spectrum of the noisy reverberant speech. The Kalman filter update step models the non-linear relations between the speech, noise and reverberation log-spectra. The Kalman filtering algorithm uses a signal model that takes into account the reverberation parameters of the reverberation time, T60, and the direct-to-reverberant energy ratio (DRR) and also estimates and tracks the T60 and the DRR in every frequency bin to improve the estimation of the speech log-spectrum. The proposed algorithm is evaluated in terms of speech quality, speech intelligibility and dereverberation performance for a range of reverberation parameters and reverberant speech to noise ratios, in different noises, and is also compared to competing denoising and dereverberation techniques. Experimental results using noisy reverberant speech demonstrate the effectiveness of the enhancement algorithm.
Date Issued
2019-04
Date Acceptance
2019-01-06
Citation
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019, 27 (4), pp.799-214
ISSN
2329-9290
Publisher
Institute of Electrical and Electronics Engineers
Start Page
799
End Page
214
Journal / Book Title
IEEE/ACM Transactions on Audio, Speech and Language Processing
Volume
27
Issue
4
Copyright Statement
© 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Identifier
https://ieeexplore.ieee.org/document/8624447
Subjects
Science & Technology
Technology
Acoustics
Engineering, Electrical & Electronic
Engineering
Speech enhancement
dereverberation
Kalman filtering
minimum mean-square error (MMSE) estimation
WEIGHTED PREDICTION ERROR
ENHANCEMENT
NOISE
COMPENSATION
MODEL
RECOGNITION
ALGORITHM
MASKING
Publication Status
Published
Date Publish Online
2019-01-23