20
IRUS TotalDownloads
Altmetric
Approximate FPGA-based LSTMs under computation time constraints
File | Description | Size | Format | |
---|---|---|---|---|
Approximate.pdf | Accepted version | 813.78 kB | Adobe PDF | View/Open |
Title: | Approximate FPGA-based LSTMs under computation time constraints |
Authors: | Rizakis, M Venieris, SI Kouris, A Bouganis, C-S |
Item Type: | Conference Paper |
Abstract: | Recurrent Neural Networks, with the prominence of Long Short-Term Memory (LSTM) networks, have demonstrated state-of-the- art accuracy in several emerging Artificial Intelligence tasks. Neverthe- less, the highest performing LSTM models are becoming increasingly demanding in terms of computational and memory load. At the same time, emerging latency-sensitive applications including mobile robots and autonomous vehicles often operate under stringent computation time constraints. In this paper, we address the challenge of deploying com- putationally demanding LSTMs at a constrained time budget by intro- ducing an approximate computing scheme that combines iterative low- rank compression and pruning, along with a novel FPGA-based LSTM architecture. Combined in an end-to-end framework, the approximation method parameters are optimised and the architecture is configured to address the problem of high-performance LSTM execution in time- constrained applications. Quantitative evaluation on a real-life image captioning application indicates that the proposed system required up to 6.5 × less time to achieve the same application-level accuracy compared to a baseline method, while achieving an average of 25 × higher accuracy under the same computation time constraints. |
Editors: | Voros, NS Hübner, M Keramidas, G Goehringer, D Antonopoulos, CP Diniz, PC |
Issue Date: | 2-May-2018 |
Date of Acceptance: | 2-May-2018 |
URI: | http://hdl.handle.net/10044/1/64139 |
DOI: | https://dx.doi.org/10.1007/978-3-319-78890-6 |
ISBN: | 9783319788890 |
ISSN: | 0302-9743 |
Publisher: | Springer |
Start Page: | 3 |
End Page: | 15 |
Journal / Book Title: | Applied Reconfigurable Computing. Architectures, Tools, and Applications |
Volume: | 10824 |
Copyright Statement: | © 2018 Springer International Publishing AG, part of Springer Nature. The final publication is available at Springer via https://dx.doi.org/10.1007/978-3-319-78890-6 |
Conference Name: | ARC 2018: 14th International Symposium on Applied Reconfigurable Computing |
Keywords: | 08 Information And Computing Sciences Artificial Intelligence & Image Processing |
Publication Status: | Published |
Start Date: | 2018-05-02 |
Finish Date: | 2018-05-04 |
Conference Place: | Santorini, Greece |
Appears in Collections: | Electrical and Electronic Engineering |