A Weighted STOI Intelligibility Metric Based On Mutual Information
File(s)Paper_v37.pdf (545.82 KB)
Accepted version
Author(s)
Lightburn, L
Brookes, D
Type
Conference Paper
Abstract
It is known that the information required for the intelligibility of a speech signal is distributed non-uniformly in time. In this paper we propose WSTOI, a modified version of STOI, a speech intelligibility metric. With WSTOI the contribution of each time-frequency cell is weighted by an estimate of its intelligibility content. This estimate is equal to the mutual information between two hypothetical signals at either end of a simplified model of human communication. Listening tests show that the modification improves the prediction accuracy of STOI at all performance levels on both long and short utterances. An improvement was observed across all tested noise types and suppression algorithms.
Date Issued
2016-03-25
Date Acceptance
2015-12-21
Citation
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016, pp.5365-5369
Publisher
IEEE
Start Page
5365
End Page
5369
Journal / Book Title
2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Copyright Statement
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Sponsor
Engineering & Physical Science Research Council (EPSRC)
Grant Number
ep/m026698/1
Source
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Publication Status
Published
Start Date
2016-03-20
Finish Date
2016-03-25
Coverage Spatial
Shanghai