Altmetric

Development of artificial intelligence models to classify pulmonary nodules and improve lung cancer early diagnosis

File Description SizeFormat 
Hunter-B-2023-PhD-Thesis.pdfThesis15.9 MBAdobe PDFView/Open
Title: Development of artificial intelligence models to classify pulmonary nodules and improve lung cancer early diagnosis
Authors: Hunter, Benjamin
Item Type: Thesis or dissertation
Abstract: The aims of this thesis were to develop machine-learning models to identify lung nodules, predict the risk of cancer and provide clinical decision support. A structured-query language model was developed at The Royal Marsden Hospital to generate a database of 14,586 patients with lung nodules. Lung (39%), neuro-endocrine (38%) and skin (35%) cancers were most commonly associated with nodules. Nodules patients had more metastatic diagnoses (45% vs 23%, p < 0.001) and a higher mean scan number (6.56 vs 1.93, p < 0.001) at shorter intervals (4.1 vs 5.9 months, p < 0.001). The model was externally validated with high performance (Krippendorf’s Alpha > 0.98). Scans from the LUCADI and LIBRA studies were used to develop small (< 15mm) and large (> 15mm) nodule radiomics predictive vectors (SN and LN-RPV respectively). Features were extracted using TexLab 2.0, and models were developing using LASSO logistic regression. The SN-RPV had an AUC of 0.78 in the test (95% C.I. 0.70-0.86) and external test (95% C.I. 0.71-0.83) sets. For the two-feature LN-RPV, the test set AUC was 0.87 (95% C.I. 0.80-0.93), compared to 0.67 (95% CI 0.55–0.76, DeLong p = 0.002) for the Brock score and 0.83 (95% CI 0.75–0.90, DeLong p = 0.4) for the Herder score. The external test set AUC was 0.75 (95% CI 0.63–0.85). The developed decision-support tool identified 18/22 (82%) malignant nodules in the Herder 10-70% category, and may have led to earlier investigation. Finally, a model was developed to predict nodule spiculation in the LIBRA and NSCLC Radiogenomics studies. The test set AUC for the 7 feature model was 0.90 (95% CI: 0.82- Introduction 5 0.96), and spiculation was associated with worse overall survival (HR 2.0, 95% C.I. 1.00 - 4.01, p = 0.04), the differential expression of 11 genes and suppression of inflammation.
Content Version: Open Access
Issue Date: Feb-2023
Date Awarded: Aug-2023
URI: http://hdl.handle.net/10044/1/113989
DOI: https://doi.org/10.25560/113989
Copyright Statement: Creative Commons Attribution NonCommercial Licence
Supervisor: Aboagye, Eric
Lee, Richard
Blackledge, Matthew
Sponsor/Funder: Cancer Research UK
RM Partners
Royal Marsden Cancer Charity
National Institute for Health Research (Great Britain)
Funder's Grant Number: C309/A31316
Department: Department of Surgery & Cancer
Publisher: Imperial College London
Qualification Level: Doctoral
Qualification Name: Doctor of Philosophy (PhD)
Appears in Collections:Department of Surgery and Cancer PhD Theses



This item is licensed under a Creative Commons License Creative Commons