Estimation of room acoustic parameters: the ACE challenge
File(s)ACE_journ_final_no_header_export.pdf (4.56 MB)
Accepted version
Author(s)
Eaton, DJ
Gaubitch, ND
Moore, AH
Naylor, PA
Type
Journal Article
Abstract
Reverberation Time (T60) and Direct-to-Reverberant Ratio (DRR) are important parameters which together can characterize sound captured by microphones in non-anechoic rooms. These parameters are important in speech processing applications such as speech recognition and dereverberation. The values of T60 and DRR can be estimated directly from the Acoustic Impulse Response (AIR) of the room. In practice, the AIR is
not normally available, in which case these parameters must be estimated blindly from the observed speech in the microphone signal. The Acoustic Characterization of Environments (ACE) Challenge aimed to determine the state-of-the-art in blind acoustic parameter estimation and also to stimulate research in this area. A summary of the ACE Challenge, and the corpus
used in the challenge is presented together with an analysis of the results. Existing algorithms were submitted alongside novel contributions, the comparative results for which are presented in this paper. The challenge showed that T60 estimation is a mature field where analytical approaches dominate whilst DRR estimation is a less mature field where machine learning approaches are currently more successful.
not normally available, in which case these parameters must be estimated blindly from the observed speech in the microphone signal. The Acoustic Characterization of Environments (ACE) Challenge aimed to determine the state-of-the-art in blind acoustic parameter estimation and also to stimulate research in this area. A summary of the ACE Challenge, and the corpus
used in the challenge is presented together with an analysis of the results. Existing algorithms were submitted alongside novel contributions, the comparative results for which are presented in this paper. The challenge showed that T60 estimation is a mature field where analytical approaches dominate whilst DRR estimation is a less mature field where machine learning approaches are currently more successful.
Date Issued
2016-06-07
Date Acceptance
2016-05-27
Citation
IEEE Transactions on Audio Speech and Language Processing, 2016, 24 (10), pp.1681-1693
ISSN
2329-9290
Publisher
Institute of Electrical and Electronics Engineers
Start Page
1681
End Page
1693
Journal / Book Title
IEEE Transactions on Audio Speech and Language Processing
Volume
24
Issue
10
Copyright Statement
© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Sponsor
Commission of the European Communities
Engineering & Physical Science Research Council (EPSRC)
Grant Number
609465
n/a
Publication Status
Published