Poly-NL: linear complexity non-local layers with 3rd order polynomials
Author(s)
Type
conference proceedings
Abstract
Spatial self-attention layers, in the form of Non-Local blocks, introduce long-range dependencies in Convolutional Neural Networks by computing pairwise similarities among all possible positions. Such pairwise functions underpin the effectiveness of non-local layers, but also determine a complexity that scales quadratically with respect to the input size both in space and time. This is a severely limiting factor that practically hinders the applicability of non-local blocks to even moderately sized inputs. Previous works focused on reducing the complexity by modifying the underlying matrix operations, however in this work we aim to retain full expressiveness of non-local layers while keeping complexity linear. We overcome the efficiency limitation of non-local blocks by framing them as special cases of 3rd order polynomial functions. This fact enables us to formulate novel fast Non-Local blocks, capable of reducing the complexity from quadratic to linear with no loss in performance, by replacing any direct computation of pairwise similarities with element-wise multiplications. The proposed method, which we dub as "Poly-NL", is competitive with state-of-the-art performance across image recognition, instance segmentation, and face detection tasks, while having considerably less computational overhead.
Date Issued
2022-02-28
Date Acceptance
2021-10-11
Citation
2021 IEEE/CVF International Conference on Computer Vision (ICCV), 2022, pp.10498-10508
Publisher
IEEE
Start Page
10498
End Page
10508
Journal / Book Title
2021 IEEE/CVF International Conference on Computer Vision (ICCV)
Copyright Statement
© 2021 IEEE. This ICCV copy is the Open Access version, provided by the Computer Vision Foundation. Except for the watermark, it is identical to the accepted version; the final published version of the proceedings is available on IEEE Xplore.
Identifier
https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000798743200049&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=a2bf6146997ec60c407a63945d4e92bb
Source
18th IEEE/CVF International Conference on Computer Vision (ICCV)
Subjects
Computer Science
Computer Science, Artificial Intelligence
Computer Science, Theory & Methods
Science & Technology
Technology
Publication Status
Published
Start Date
2021-10-11
Finish Date
2021-10-17
Coverage Spatial
Virtual