A review of the metrics used to assess auto-contouring systems in radiotherapy
File(s)1-s2.0-S0936655523000213-main.pdf (1.07 MB)
Published version
Author(s)
Mackay, K
Bernstein, D
Glocker, B
Kamnitsas, K
Taylor, A
Type
Journal Article
Abstract
Auto-contouring could revolutionise future planning of radiotherapy treatment. The lack of consensus on how to assess and validate auto-contouring systems currently limits clinical use. This review formally quantifies the assessment metrics used in studies published during one calendar year and assesses the need for standardised practice. A PubMed literature search was undertaken for papers evaluating radiotherapy auto-contouring published during 2021. Papers were assessed for types of metric and the methodology used to generate ground-truth comparators. Our PubMed search identified 212 studies, of which 117 met the criteria for clinical review. Geometric assessment metrics were used in 116 of 117 studies (99.1%). This includes the Dice Similarity Coefficient used in 113 (96.6%) studies. Clinically relevant metrics, such as qualitative, dosimetric and time-saving metrics, were less frequently used in 22 (18.8%), 27 (23.1%) and 18 (15.4%) of 117 studies, respectively. There was heterogeneity within each category of metric. Over 90 different names for geometric measures were used. Methods for qualitative assessment were different in all but two papers. Variation existed in the methods used to generate radiotherapy plans for dosimetric assessment. Consideration of editing time was only given in 11 (9.4%) papers. A single manual contour as a ground-truth comparator was used in 65 (55.6%) studies. Only 31 (26.5%) studies compared auto-contours to usual inter- and/or intra-observer variation. In conclusion, significant variation exists in how research papers currently assess the accuracy of automatically generated contours. Geometric measures are the most popular, however their clinical utility is unknown. There is heterogeneity in the methods used to perform clinical assessment. Considering the different stages of system implementation may provide a framework to decide the most appropriate metrics. This analysis supports the need for a consensus on the clinical implementation of auto-contouring.
Date Issued
2023-06
Date Acceptance
2023-01-01
Citation
Clinical Oncology, 2023, 35 (6), pp.354-369
ISSN
0936-6555
Publisher
Springer
Start Page
354
End Page
369
Journal / Book Title
Clinical Oncology
Volume
35
Issue
6
Copyright Statement
© 2023 The Authors. Published by Elsevier Ltd on behalf of The Royal College of Radiologists. This is an open access article under the CC BY license
(http://creativecommons.org/licenses/by/4.0/).
(http://creativecommons.org/licenses/by/4.0/).
License URL
Identifier
https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:001007726800001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=a2bf6146997ec60c407a63945d4e92bb
Subjects
ADAPTIVE RADIATION-THERAPY
Artificial intelligence
Auto-contour
AUTOMATIC SEGMENTATION
Auto-segmentation
AUTOSEGMENTATION
CELL LUNG-CANCER
CLINICAL IMPLEMENTATION
Deep learning
DEFORMABLE IMAGE REGISTRATION
Evaluation
HEAD
Life Sciences & Biomedicine
Oncology
ORGANS-AT-RISK
PROSTATE
Radiotherapy
Science & Technology
TARGET VOLUMES
Publication Status
Published
Date Publish Online
2023-01-31