Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Medicine
  3. Department of Surgery and Cancer
  4. Department of Surgery and Cancer
  5. A neural network approach to audio-assisted movie dialogue detection
 
  • Details
A neural network approach to audio-assisted movie dialogue detection
File(s)
NEUROCOMPUTING_Elsevier_2008_Margarita_Kotti.pdf (221.21 KB)
Accepted version
Author(s)
Kotti, Margarita
Benetos, Emmanouil
Kotropoulos, Constantine
Pitas, Ioannis
Type
Journal Article
Abstract
A novel framework for audio-assisted dialogue detection based on indicator functions and neural networks is investigated. An indicator function defines that an actor is present at a particular time instant. The cross-correlation function of a pair of indicator functions and the magnitude of the corresponding cross-power spectral density are fed as input to neural networks for dialogue detection. Several types of artificial neural networks, including multilayer perceptrons (MLPs), voted perceptrons, radial basis function networks, support vector machines, and particle swarm optimization-based MLPs are tested. Experiments are carried out to validate the feasibility of the aforementioned approach by using ground-truth indicator functions determined by human observers on six different movies. A total of 41 dialogue instances and another 20 non-dialogue instances are employed. The average detection accuracy achieved is high, ranging between 84.78 % ± 5.499 % and 91.43 % ± 4.239 %. © 2007 Elsevier B.V. All rights reserved.
Date Issued
2007-12
Citation
Neurocomputing, 2007, 71 (1-3), pp.157-166
URI
http://hdl.handle.net/10044/1/11712
DOI
https://www.dx.doi.org/10.1016/j.neucom.2007.08.006
ISSN
0925-2312
Publisher
Elsevier
Start Page
157
End Page
166
Journal / Book Title
Neurocomputing
Volume
71
Issue
1-3
Copyright Statement
© 2007 Elsevier B.V. All rights reserved. NOTICE: this is the author’s version of a work that was accepted for publication in Neurocomputing. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in NEUROCOMPUTING, Vol.:71, Issue:1-3, (2007) DOI: 10.1016/j.neucom.2007.08.006
License URL
http://www.rioxx.net/licenses/all-rights-reserved
Description
07.08.13 KB. Ok to add the accepted version to Spiral, Elsevier says ok while mandate is not enforced.
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback