Predicting total knee replacement at 2 and 5 years in osteoarthritis patients using machine learning
File(s)e000141.full.pdf (931.49 KB)
Published version
Author(s)
Mahmoud, Khadija
Alagha, M Abdulhadi
Nowinka, Zuzanna
Jones, Gareth
Type
Journal Article
Abstract
Objectives Knee osteoarthritis is a major cause of physical disability and reduced quality of life, with end-stage disease often treated by total knee replacement (TKR). We set out to develop and externally validate a machine learning model capable of predicting the need for a TKR in 2 and 5 years time using routinely collected health data.
Design A prospective study using datasets Osteoarthritis Initiative (OAI) and the Multicentre Osteoarthritis Study (MOST). OAI data were used to train the models while MOST data formed the external test set. The data were preprocessed using feature selection to curate 45 candidate features including demographics, medical history, imaging assessments, history of intervention and outcome.
Setting The study was conducted using two multicentre USA-based datasets of participants with or at high risk of knee OA.
Participants The study excluded participants with at least one existing TKR. OAI dataset included participants aged 45–79 years of which 3234 were used for training and 809 for internal testing, while MOST involved participants aged 50–79 and 2248 were used for external testing.
Main outcome measures The primary outcome of this study was prediction of TKR onset at 2 and 5 years. Performance was evaluated using area under the curve (AUC) and F1-score and key predictors identified.
Results For the best performing model (gradient boosting machine), the AUC at 2 years was 0.913 (95% CI 0.876 to 0.951), and at 5 years 0.873 (95% CI 0.839 to 0.907). Radiographic-derived features, questionnaire-based assessments alongside the patient’s educational attainment were key predictors for these models.
Conclusions Our approach suggests that routinely collected patient data are sufficient to drive a predictive model with a clinically acceptable level of accuracy (AUC>0.7) and is the first such tool to be externally validated. This level of accuracy is higher than previously published models utilising MRI data, which is not routinely collected.
Design A prospective study using datasets Osteoarthritis Initiative (OAI) and the Multicentre Osteoarthritis Study (MOST). OAI data were used to train the models while MOST data formed the external test set. The data were preprocessed using feature selection to curate 45 candidate features including demographics, medical history, imaging assessments, history of intervention and outcome.
Setting The study was conducted using two multicentre USA-based datasets of participants with or at high risk of knee OA.
Participants The study excluded participants with at least one existing TKR. OAI dataset included participants aged 45–79 years of which 3234 were used for training and 809 for internal testing, while MOST involved participants aged 50–79 and 2248 were used for external testing.
Main outcome measures The primary outcome of this study was prediction of TKR onset at 2 and 5 years. Performance was evaluated using area under the curve (AUC) and F1-score and key predictors identified.
Results For the best performing model (gradient boosting machine), the AUC at 2 years was 0.913 (95% CI 0.876 to 0.951), and at 5 years 0.873 (95% CI 0.839 to 0.907). Radiographic-derived features, questionnaire-based assessments alongside the patient’s educational attainment were key predictors for these models.
Conclusions Our approach suggests that routinely collected patient data are sufficient to drive a predictive model with a clinically acceptable level of accuracy (AUC>0.7) and is the first such tool to be externally validated. This level of accuracy is higher than previously published models utilising MRI data, which is not routinely collected.
Date Issued
2023-02-15
Date Acceptance
2022-12-09
Citation
BMJ Surgery, Interventions, & Health Technologies, 2023, 5 (1), pp.1-9
ISSN
2631-4940
Publisher
BMJ Publishing Group
Start Page
1
End Page
9
Journal / Book Title
BMJ Surgery, Interventions, & Health Technologies
Volume
5
Issue
1
Copyright Statement
© Author(s) (or their employer(s)) 2023. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ. http://creativecommons.org/licenses/by-nc/4.0/
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Identifier
https://sit.bmj.com/content/5/1/e000141
Subjects
Health Care Quality, Access, and Evaluation
Learning Curve
Orthopedic Devices
Patient Outcome Assessment
Technology Assessment, Biomedical
Publication Status
Published
Date Publish Online
2023-02-15