Speech enhancement in the modulation domain

File Description SizeFormat 
Wang-Y-2016-PhD-Thesis.pdfThesis8.15 MBAdobe PDFView/Open
Title: Speech enhancement in the modulation domain
Authors: Wang, Yu
Item Type: Thesis or dissertation
Abstract: The goal of a speech enhancement algorithm is to reduce or eliminate background noise without distorting the speech signal. Although speech enhancement is important for practical scenarios, it is a difficult task especially when the noisy speech signal is only available from a single channel. Although many single-channel speech algorithms have been proposed that can improve the Signal-to-Noise Ratio (SNR) of the noisy speech, in some cases dramatically, they also introduce speech distortion and spurious tonal artefacts known as musical noise. There has been evidence, both physiological and psychoacoustic, to support the significance of the modulation domain, i.e. the temporal modulation of the acoustic spectral components, to speech enhancement. In this thesis three methods for implementing single-channel speech enhancement in the modulation domain have been proposed. The goal in all three cases is to take advantage of prior knowledge about the temporal modulation of short-time spectral amplitudes. The first method is to post-process the output of a conventional single-channel speech enhancement algorithm using a modulation domain Kalman filter. The second method performs enhancement directly in the modulation domain based on the assumption that the temporal sequence of spectral amplitudes within each frequency bin lies within a low dimensional subspace. The third method uses a modulation-domain Kalman filter to perform enhancement using two alternative distribution families for the speech and noise amplitude prior distributions. The performance of the proposed enhancement algorithms is assessed by measuring the SNR and speech quality (using the Perceptual Evaluation of Speech Quality (PESQ) metric) of the enhanced speech. It is found that, for a range of noise types, the proposed algorithms give consistent improvements in both metrics.
Content Version: Open Access
Issue Date: Aug-2015
Date Awarded: Jan-2016
URI: http://hdl.handle.net/10044/1/43966
Supervisor: Brookes, Mike
Department: Electrical and Electronic Engineering
Publisher: Imperial College London
Qualification Level: Doctoral
Qualification Name: Doctor of Philosophy (PhD)
Appears in Collections:Electrical and Electronic Engineering PhD theses

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Creative Commonsx