Anomaly detection through latent space restoration using vector-quantized variational autoencoders
File(s)2012.06765v1.pdf (426.66 KB)
Accepted version
Author(s)
Marimont, Sergio Naval
Tarroni, Giacomo
Type
Conference Paper
Abstract
We propose an out-of-distribution detection method that combines density and
restoration-based approaches using Vector-Quantized Variational Auto-Encoders
(VQ-VAEs). The VQ-VAE model learns to encode images in a categorical latent
space. The prior distribution of latent codes is then modelled using an
Auto-Regressive (AR) model. We found that the prior probability estimated by
the AR model can be useful for unsupervised anomaly detection and enables the
estimation of both sample and pixel-wise anomaly scores. The sample-wise score
is defined as the negative log-likelihood of the latent variables above a
threshold selecting highly unlikely codes. Additionally, out-of-distribution
images are restored into in-distribution images by replacing unlikely latent
codes with samples from the prior model and decoding to pixel space. The
average L1 distance between generated restorations and original image is used
as pixel-wise anomaly score. We tested our approach on the MOOD challenge
datasets, and report higher accuracies compared to a standard
reconstruction-based approach with VAEs.
restoration-based approaches using Vector-Quantized Variational Auto-Encoders
(VQ-VAEs). The VQ-VAE model learns to encode images in a categorical latent
space. The prior distribution of latent codes is then modelled using an
Auto-Regressive (AR) model. We found that the prior probability estimated by
the AR model can be useful for unsupervised anomaly detection and enables the
estimation of both sample and pixel-wise anomaly scores. The sample-wise score
is defined as the negative log-likelihood of the latent variables above a
threshold selecting highly unlikely codes. Additionally, out-of-distribution
images are restored into in-distribution images by replacing unlikely latent
codes with samples from the prior model and decoding to pixel space. The
average L1 distance between generated restorations and original image is used
as pixel-wise anomaly score. We tested our approach on the MOOD challenge
datasets, and report higher accuracies compared to a standard
reconstruction-based approach with VAEs.
Date Issued
2021-05-25
Date Acceptance
2021-01-08
Citation
IEEE Xplore, 2021, pp.1764-1767
Publisher
IEEE
Start Page
1764
End Page
1767
Journal / Book Title
IEEE Xplore
Copyright Statement
© 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Identifier
http://arxiv.org/abs/2012.06765v1
Source
2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI)
Subjects
cs.CV
cs.CV
cs.LG
eess.IV
Notes
4 Pages, 4 Figures. Submitted to ISBI 2021
Start Date
2021-04-13
Finish Date
2021-04-16
Coverage Spatial
Nice, France
Date Publish Online
2021-05-25