Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • About
  • Communities & Collections
  • Advanced Search
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Medicine
  3. Department of Brain Sciences
  4. Uncovering the potential for a weakly supervised end-to-end model in recognising speech from patient with post-stroke aphasia
 
  • Details
Uncovering the potential for a weakly supervised end-to-end model in recognising speech from patient with post-stroke aphasia
File(s)
2023.clinicalnlp-1.24.pdf (350.49 KB)
Published version
Author(s)
Sanguedolce, G
Naylor, PA
Geranmayeh, F
Type
Conference Paper
Abstract
Post-stroke speech and language deficits (aphasia) significantly impact patients' quality of life. Many with mild symptoms remain undiagnosed, and the majority do not receive the intensive doses of therapy recommended, due to healthcare costs and/or inadequate services. Automatic Speech Recognition (ASR) may help overcome these difficulties by improving diagnostic rates and providing feedback during tailored therapy. However, its performance is often unsatisfactory due to the high variability in speech errors and scarcity of training datasets. This study assessed the performance of Whisper, a recently released end-to-end model, in patients with post-stroke aphasia (PWA). We tuned its hyperparameters to achieve the lowest word error rate (WER) on aphasic speech. WER was significantly higher in PWA compared to age-matched controls (10.3% vs 38.5%, p < 0.001). We demonstrated that worse WER was related to the more severe aphasia as measured by expressive (overt naming, and spontaneous speech production) and receptive (written and spoken comprehension) language assessments. Stroke lesion size did not affect the performance of Whisper. Linear mixed models accounting for demographic factors, therapy duration, and time since stroke, confirmed worse Whisper performance with left hemispheric frontal lesions. We discuss the implications of these findings for how future ASR can be improved in PWA.
Date Issued
2023-07-14
Date Acceptance
2023-07-01
Citation
Proceedings of the 5th Clinical Natural Language Processing Workshop, 2023, pp.182-190
URI
http://hdl.handle.net/10044/1/107750
DOI
https://www.dx.doi.org/10.18653/v1/2023.clinicalnlp-1.24
Publisher
Association for Computational Linguistics
Start Page
182
End Page
190
Journal / Book Title
Proceedings of the 5th Clinical Natural Language Processing Workshop
Copyright Statement
©2023 Association for Computational Linguistics.
Source
5th Clinical Natural Language Processing Workshop
Publication Status
Published
Start Date
2023-07-14
Coverage Spatial
Toronto, Canada
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback