A brief survey of deep reinforcement learning

File Description SizeFormat 
1708.05866v1.pdfAccepted version5.11 MBAdobe PDFDownload
Title: A brief survey of deep reinforcement learning
Author(s): Arulkumaran, K
Deisenroth, MP
Brundage, M
Bharath, AA
Item Type: Journal Article
Abstract: Deep reinforcement learning (DRL) is poised to revolutionize the field of artificial intelligence (AI) and represents a step toward building autonomous systems with a higherlevel understanding of the visual world. Currently, deep learning is enabling reinforcement learning (RL) to scale to problems that were previously intractable, such as learning to play video games directly from pixels. DRL algorithms are also applied to robotics, allowing control policies for robots to be learned directly from camera inputs in the real world. In this survey, we begin with an introduction to the general field of RL, then progress to the main streams of value-based and policy-based methods. Our survey will cover central algorithms in deep RL, including the deep Q-network (DQN), trust region policy optimization (TRPO), and asynchronous advantage actor critic. In parallel, we highlight the unique advantages of deep neural networks, focusing on visual understanding via RL. To conclude, we describe several current areas of research within the field.
Publication Date: 9-Nov-2017
Date of Acceptance: 1-Aug-2017
URI: http://hdl.handle.net/10044/1/53340
DOI: https://dx.doi.org/10.1109/MSP.2017.2743240
ISSN: 1053-5888
Publisher: Institute of Electrical and Electronics Engineers
Start Page: 26
End Page: 38
Journal / Book Title: IEEE Signal Processing Magazine
Volume: 34
Copyright Statement: © 2017 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Keywords: cs.LG
cs.LG
cs.AI
stat.ML
0906 Electrical And Electronic Engineering
0913 Mechanical Engineering
Networking & Telecommunications
Article Number: 6
Appears in Collections:Faculty of Engineering
Bioengineering
Computing



Items in Spiral are protected by copyright, with all rights reserved, unless otherwise indicated.

Creative Commons