FPGA-based acceleration for Bayesian convolutional neural networks
File(s)tcad22_3dbayes_hf1_final.pdf (920.03 KB)
Accepted version
Author(s)
Type
Journal Article
Abstract
Neural networks (NNs) have demonstrated their
potential in a variety of domains ranging from computer
vision to natural language processing. Among various NNs,
two-dimensional (2D) and three-dimensional (3D) convolutional
neural networks (CNNs) have been widely adopted in a broad
spectrum of applications such as image classification and video
recognition, due to their excellent capabilities in extracting 2D
and 3D features. However, standard 2D and 3D CNNs are
not able to capture their model uncertainty which is crucial
for many safety-critical applications including healthcare and
autonomous driving. In contrast, Bayesian convolutional neural
networks (BayesCNNs), as a variant of CNNs, have demonstrated
their ability to express uncertainty in their prediction via a
mathematical grounding. Nevertheless, BayesCNNs have not
been widely used in industrial practice due to their compute
requirements stemming from sampling and subsequent forward
passes through the whole network multiple times. As a result,
these processes significantly increase the amount of computation
and memory consumption in comparison to standard CNNs. This
paper proposes a novel FPGA-based hardware architecture to
accelerate both 2D and 3D BayesCNNs inferred through Monte
Carlo Dropout. Compared with other state-of-the-art accelerators
for BayesCNNs, the proposed design can achieve up to 4 times
higher energy efficiency and 9 times better compute efficiency.
Considering partial Bayesian inference, an automatic framework
is proposed to explore the trade-off between hardware and
algorithmic performance. Extensive experiments are conducted
to demonstrate that our proposed framework can effectively find
the optimal points in the design space.
potential in a variety of domains ranging from computer
vision to natural language processing. Among various NNs,
two-dimensional (2D) and three-dimensional (3D) convolutional
neural networks (CNNs) have been widely adopted in a broad
spectrum of applications such as image classification and video
recognition, due to their excellent capabilities in extracting 2D
and 3D features. However, standard 2D and 3D CNNs are
not able to capture their model uncertainty which is crucial
for many safety-critical applications including healthcare and
autonomous driving. In contrast, Bayesian convolutional neural
networks (BayesCNNs), as a variant of CNNs, have demonstrated
their ability to express uncertainty in their prediction via a
mathematical grounding. Nevertheless, BayesCNNs have not
been widely used in industrial practice due to their compute
requirements stemming from sampling and subsequent forward
passes through the whole network multiple times. As a result,
these processes significantly increase the amount of computation
and memory consumption in comparison to standard CNNs. This
paper proposes a novel FPGA-based hardware architecture to
accelerate both 2D and 3D BayesCNNs inferred through Monte
Carlo Dropout. Compared with other state-of-the-art accelerators
for BayesCNNs, the proposed design can achieve up to 4 times
higher energy efficiency and 9 times better compute efficiency.
Considering partial Bayesian inference, an automatic framework
is proposed to explore the trade-off between hardware and
algorithmic performance. Extensive experiments are conducted
to demonstrate that our proposed framework can effectively find
the optimal points in the design space.
Date Issued
2022-03-28
Date Acceptance
2022-02-16
Citation
IEEE Transactions on Computer - Aided Design of Integrated Circuits and Systems, 2022, 41 (12), pp.5343-5356
ISSN
0278-0070
Publisher
Institute of Electrical and Electronics Engineers
Start Page
5343
End Page
5356
Journal / Book Title
IEEE Transactions on Computer - Aided Design of Integrated Circuits and Systems
Volume
41
Issue
12
Copyright Statement
Copyright © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Sponsor
Engineering and Physical Sciences Research Council
Engineering & Physical Science Research Council (EPSRC)
Engineering & Physical Science Research Council (EPSRC)
Identifier
https://ieeexplore.ieee.org/document/9743481
Grant Number
EP/L016796/1
EP/P010040/1
EP/S030069/1
Subjects
Computer Hardware & Architecture
0906 Electrical and Electronic Engineering
1006 Computer Hardware
Publication Status
Published
Date Publish Online
2022-03-28