Variational denoising autoencoders and least-squares policy iteration for statistical dialogue managers
File(s)SigProcLetters2020_Kotti.pdf (278.38 KB)
Accepted version
Author(s)
Kotti, Margarita
Diakoloukas, Vassilios
Lygerakis, Fotios
Lagoudakis, Michail
Type
Journal Article
Abstract
The use of Reinforcement Learning (RL) approaches for dialogue policy optimization has been the new trend for dialogue management systems. Several methods have been proposed, which are trained on dialogue data to provide optimal system response. However, most of these approaches exhibit performance degradation in the presence of noise, poor scalability to other domains, as well as performance instabilities. To overcome these problems, we propose a novel approach based on the incremental, sample-efficient Least-Squares Policy Iteration (LSPI) algorithm, which is trained on compact, fixed-size dialogue state encodings, obtained from deep Variational Denoising Autoencoders (VDAE). The proposed scheme exhibits stable and noise-robust performance, which significantly outperforms the current state-of-the-art, even in mismatched noise environments.
Date Issued
2020-05-28
Date Acceptance
2020-05-18
Citation
IEEE Signal Processing Letters, 2020, 27, pp.960-964
ISSN
1070-9908
Publisher
Institute of Electrical and Electronics Engineers
Start Page
960
End Page
964
Journal / Book Title
IEEE Signal Processing Letters
Volume
27
Copyright Statement
© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Identifier
https://ieeexplore.ieee.org/document/9103219
Subjects
Science & Technology
Technology
Engineering, Electrical & Electronic
Engineering
Noise reduction
Signal processing algorithms
Encoding
Training
Optimization
Approximation algorithms
Degradation
Variational autoencoders
denoising
dialogue systems
sample-efficient statistical dialogue managers
least-squares policy iteration
0801 Artificial Intelligence and Image Processing
0906 Electrical and Electronic Engineering
1005 Communications Technologies
Networking & Telecommunications
Publication Status
Published
Date Publish Online
2020-05-28