Regularization of polynomial networks for image recognition
Author(s)
Chrysos, Grigorios G
Wang, Bohan
Deng, Jiankang
Cevher, Volkan
Type
Conference Paper
Abstract
Deep Neural Networks (DNNs) have obtained impressive performance across tasks, however they still remain as black boxes, e.g., hard to theoretically analyze. At the same time, Polynomial Networks (PNs) have emerged as an alternative method with a promising performance and improved interpretability but have yet to reach the performance of the powerful DNN baselines. In this work, we aim to close this performance gap. We introduce a class of PNs, which are able to reach the performance of ResNet across a range of six benchmarks. We demonstrate that strong regularization is critical and conduct an extensive study of the exact regularization schemes required to match performance. To further motivate the regularization schemes, we introduce D-PolyNets that achieve a higher- degree of expansion than previously proposed polynomial networks. D-PolyNets are more parameter-efficient while achieving a similar performance as other polynomial networks. We expect that our new models can lead to an understanding of the role of elementwise activation functions (which are no longer required for training PNs). The source code is available at https://github.com/grigorisg9gr/regularized_polynomials.
Date Issued
2023-08-22
Date Acceptance
2023-06-17
Citation
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp.16123-16132
ISSN
1063-6919
Publisher
IEEE Computer Society
Start Page
16123
End Page
16132
Journal / Book Title
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Copyright Statement
© 2023 IEEE. This CVPR paper is the Open Access version, provided by the Computer Vision Foundation. Except for this watermark, it is identical to the accepted version; the final published version of the proceedings is available on IEEE Xplore.
Source
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Subjects
Computer Science
Computer Science, Artificial Intelligence
Science & Technology
Technology
Publication Status
Published
Start Date
2023-06-17
Finish Date
2023-06-24
Coverage Spatial
Vancouver, Canada