Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Medicine
  3. National Heart and Lung Institute
  4. National Heart and Lung Institute
  5. A deep learning method for pathological voice detection using convolutional deep belief networks
 
  • Details
A deep learning method for pathological voice detection using convolutional deep belief networks
OA Location
https://strathprints.strath.ac.uk/64290/
Author(s)
Wu, Huiyi
Soraghan, John
Lowit, Anja
Di-Caterina, Gaetano
Type
Conference Paper
Abstract
Automatically detecting pathological voice disorders such as vocal cord paralysis or Reinke’s edema is an important medical classification problem. While deep learning techniques have achieved significant progress in the speech recognition field, there has been less research work in the area of pathological voice disorders detection. A novel system for pathological voice detection using Convolutional Neural Network (CNN) as the basic architecture is presented in this work. The novel system uses spectrograms of normal and pathological speech recordings as the input to the network. Initially Convolutional Deep Belief Network (CDBN) are used to pre-train the weights of CNN system. This acts as a generative model to explore the structure of the input data using statistical methods. Then a CNN is trained using supervised back-propagation learning algorithm to fine tune the weights. Results show that a small amount of data can be used to achieve good results in classification with this deep learning approach. A performance analysis of the novel method is provided using real data from the Saarbrucken Voice database.
Date Acceptance
2018-09-01
Citation
Interspeech 2018, pp.446-450
URI
http://hdl.handle.net/10044/1/104141
URL
http://dx.doi.org/10.21437/interspeech.2018-1351
DOI
https://www.dx.doi.org/10.21437/interspeech.2018-1351
Publisher
ISCA
Start Page
446
End Page
450
Journal / Book Title
Interspeech 2018
Copyright Statement
© 2019 The Author(s).
Identifier
http://dx.doi.org/10.21437/interspeech.2018-1351
Source
Interspeech 2018
Publication Status
Published online
Start Date
2018-09-02
Finish Date
2018-09-06
Coverage Spatial
Hyderabad, India
Date Publish Online
2018-09-02
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback