Automatic speaker segmentation using multiple features and distance measures: a comparison of three approaches
File(s)ICME_2006_Margarita_Kotti_b.pdf (120.08 KB)
Accepted version
Author(s)
Kotti, Margarita
Martins, Luís Gustavo PM
Benetos, Emmanouil
Cardoso, Jaime S
Kotropoulos, Constantine
Type
Conference Paper
Abstract
This paper addresses the problem of unsupervised speaker change detection. Three systems based on the Bayesian Information Criterion (BIC) are tested. The first system investigates the AudioSpectrumCentroid and the AudioWaveformEnvelope features, implements a dynamic thresholding followed by a fusion scheme, and finally applies BIC. The second method is a real-time one that uses a metric-based approach employing the line spectral pairs and the BIC to validate a potential speaker change point. The third method consists of three modules. In the first module, a measure based on second-order statistics is used; in the second module, the Euclidean distance and T2 Hotelling statistic are applied; and in the third module, the BIC is utilized. The experiments are carried out on a dataset created by concatenating speakers from the TIMIT database, that is referred to as the TIMIT data set. A comparison between the performance of the three systems is made based on t-statistics. © 2006 IEEE.
Date Issued
2006-07
Citation
IEEE International Conference on Multimedia and Expo, 2006, pp.1101-1104
ISBN
1424403677
Publisher
IEEE
Start Page
1101
End Page
1104
Journal / Book Title
IEEE International Conference on Multimedia and Expo
Copyright Statement
© 2006 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Description
14.08.13 KB. Ok to add accepted version to Spiral. IEEE
Source
ICME 2006
Source Place
Ontario, Canada
Start Date
2006-07-09
Finish Date
2006-07-12
Coverage Spatial
Toronto, Canada