Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • About
  • Communities & Collections
  • Advanced Search
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Engineering
  3. Faculty of Engineering
  4. Modulation-domain multichannel Kalman filtering for speech enhancement
 
  • Details
Modulation-domain multichannel Kalman filtering for speech enhancement
File(s)
accepted_double.pdf (1.01 MB)
Accepted version
Author(s)
Xue, Wei
Moore, Alastair
Brookes, DM
Naylor, Patrick
Type
Journal Article
Abstract
Compared with single-channel speech enhancement methods, multichannel methods can utilize spatial information to design optimal filters. Although some filters adaptively consider second-order signal statistics, the temporal evolution of the speech spectrum is usually neglected. By using linear prediction (LP) to model the inter-frame temporal evolution of speech, single-channel Kalman filtering (KF) based methods have been developed for speech enhancement. In this paper, we derive a multichannel KF (MKF) that jointly uses both interchannel spatial correlation and interframe temporal correlation for speech enhancement. We perform LP in the modulation domain, and by incorporating the spatial information, derive an optimal MKF gain in the short-time Fourier transform domain. We show that the proposed MKF reduces to the conventional multichannel Wiener filter if the LP information is discarded. Furthermore, we show that, under an appropriate assumption, the MKF is equivalent to a concatenation of the minimum variance distortion response beamformer and a single-channel modulation-domain KF and therefore present an alternative implementation of the MKF. Experiments conducted on a public head-related impulse response database demonstrate the effectiveness of the proposed method.
Date Issued
2018-10-01
Date Acceptance
2018-06-04
Citation
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018, 26 (10), pp.1833-1847
URI
http://hdl.handle.net/10044/1/60712
URL
https://ieeexplore.ieee.org/document/8375666
DOI
https://www.dx.doi.org/10.1109/TASLP.2018.2845665
ISSN
2329-9290
Publisher
Association for Computing Machinery (ACM)
Start Page
1833
End Page
1847
Journal / Book Title
IEEE/ACM Transactions on Audio, Speech and Language Processing
Volume
26
Issue
10
Copyright Statement
© 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Sponsor
Engineering & Physical Science Research Council (EPSRC)
Identifier
https://ieeexplore.ieee.org/document/8375666
Grant Number
EP/M026698/1
Subjects
Science & Technology
Technology
Acoustics
Engineering, Electrical & Electronic
Engineering
Speech enhancement
microphone arrays
Kalman filtering
modulation domain
MVDR BEAMFORMER
NOISE
DEREVERBERATION
SINGLE
LCMV
Publication Status
Published
Date Publish Online
2018-06-08
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback