Common limitations of image processing metrics: a picture story
File(s)2104.05642v2.pdf (527.18 KB)
Working paper
Author(s)
Type
Working Paper
Abstract
While the importance of automatic image analysis is increasing at an enormous
pace, recent meta-research revealed major flaws with respect to algorithm
validation. Specifically, performance metrics are key for objective,
transparent and comparative performance assessment, but relatively little
attention has been given to the practical pitfalls when using specific metrics
for a given image analysis task. A common mission of several international
initiatives is therefore to provide researchers with guidelines and tools to
choose the performance metrics in a problem-aware manner. This dynamically
updated document has the purpose to illustrate important limitations of
performance metrics commonly applied in the field of image analysis. The
current version is based on a Delphi process on metrics conducted by an
international consortium of image analysis experts.
pace, recent meta-research revealed major flaws with respect to algorithm
validation. Specifically, performance metrics are key for objective,
transparent and comparative performance assessment, but relatively little
attention has been given to the practical pitfalls when using specific metrics
for a given image analysis task. A common mission of several international
initiatives is therefore to provide researchers with guidelines and tools to
choose the performance metrics in a problem-aware manner. This dynamically
updated document has the purpose to illustrate important limitations of
performance metrics commonly applied in the field of image analysis. The
current version is based on a Delphi process on metrics conducted by an
international consortium of image analysis experts.
Date Issued
2021-04-13
Citation
2021
Publisher
arXiv
Copyright Statement
© 2021 The Author(s)
Identifier
http://arxiv.org/abs/2104.05642v2
Subjects
cs.CV
cs.CV
eess.IV
Notes
This is a dynamic paper on limitations of commonly used metrics. The current version discusses segmentation metrics only, while future versions will also include metrics for classification and detection. For missing use cases, comments or questions, please contact a.reinke@dkfz.de or l.maier-hein@dkfz.de. Substantial contributions to this document will be acknowledged with a co-authorship
Publication Status
Published