Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Medicine
  3. Faculty of Medicine
  4. Variational denoising autoencoders and least-squares policy iteration for statistical dialogue managers
 
  • Details
Variational denoising autoencoders and least-squares policy iteration for statistical dialogue managers
File(s)
SigProcLetters2020_Kotti.pdf (278.38 KB)
Accepted version
Author(s)
Kotti, Margarita
Diakoloukas, Vassilios
Lygerakis, Fotios
Lagoudakis, Michail
Type
Journal Article
Abstract
The use of Reinforcement Learning (RL) approaches for dialogue policy optimization has been the new trend for dialogue management systems. Several methods have been proposed, which are trained on dialogue data to provide optimal system response. However, most of these approaches exhibit performance degradation in the presence of noise, poor scalability to other domains, as well as performance instabilities. To overcome these problems, we propose a novel approach based on the incremental, sample-efficient Least-Squares Policy Iteration (LSPI) algorithm, which is trained on compact, fixed-size dialogue state encodings, obtained from deep Variational Denoising Autoencoders (VDAE). The proposed scheme exhibits stable and noise-robust performance, which significantly outperforms the current state-of-the-art, even in mismatched noise environments.
Date Issued
2020-05-28
Date Acceptance
2020-05-18
Citation
IEEE Signal Processing Letters, 2020, 27, pp.960-964
URI
http://hdl.handle.net/10044/1/80679
URL
https://ieeexplore.ieee.org/document/9103219
DOI
https://www.dx.doi.org/10.1109/LSP.2020.2998361
ISSN
1070-9908
Publisher
Institute of Electrical and Electronics Engineers
Start Page
960
End Page
964
Journal / Book Title
IEEE Signal Processing Letters
Volume
27
Copyright Statement
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Identifier
https://ieeexplore.ieee.org/document/9103219
Subjects
Science & Technology
Technology
Engineering, Electrical & Electronic
Engineering
Noise reduction
Signal processing algorithms
Encoding
Training
Optimization
Approximation algorithms
Degradation
Variational autoencoders
denoising
dialogue systems
sample-efficient statistical dialogue managers
least-squares policy iteration
0801 Artificial Intelligence and Image Processing
0906 Electrical and Electronic Engineering
1005 Communications Technologies
Networking & Telecommunications
Publication Status
Published
Date Publish Online
2020-05-28
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback