13
IRUS TotalDownloads
Altmetric
Disease-specific variant pathogenicity prediction significantly improves variant interpretation in inherited cardiac conditions
File | Description | Size | Format | |
---|---|---|---|---|
Supplementary_material_accepted.pdf | Supplementary information | 949.97 kB | Adobe PDF | View/Open |
s41436-020-00972-3.pdf | Published version | 954.65 kB | Adobe PDF | View/Open |
Title: | Disease-specific variant pathogenicity prediction significantly improves variant interpretation in inherited cardiac conditions |
Authors: | Zhang, X Walsh, R Whiffin, N Buchan, R Midwinter, W Wilk, A Govind, R Li, N Ahmad, M Mazzarotto, F Roberts, A Theotokis, P Mazaika, E Allouba, M De Marvao, A Pua, CJ Day, SM Ashley, E Colan, SD Michels, M Pereira, AC Jacoby, D Ho, CY Olivotto, I Gunnarsson, GT Jefferies, J Semsarian, C Ingles, J O’Regan, DP Aguib, Y Yacoub, MH Cook, SA Barton, PJR Bottolo, L Ware, JS |
Item Type: | Journal Article |
Abstract: | Background: Accurate discrimination of benign and pathogenic rare variation remains a priority for clinical genome interpretation. State-of-the-art machine learning tools are useful for genome-wide variant prioritisation but remain imprecise. Since the relationship between molecular consequence and likelihood of pathogenicity varies between genes with distinct molecular mechanisms, we hypothesised that a disease-specific classifier may outperform existing genome-wide tools. Methods: We present a novel disease-specific variant classification tool, CardioBoost, that estimates the probability of pathogenicity for rare missense variants in inherited cardiomyopathies and arrhythmias, trained with variants of known clinical effect. To benchmark against state-of-the-art genome-wide pathogenicity classification tools, we assessed classification of hold-out test variants using both overall performance metrics, and metrics of high-confidence (>90%) classifications relevant to variant interpretation. We further evaluated the prioritisation of variants associated with disease and patient clinical outcomes, providing validations that are robust to potential mis-classification in gold-standard reference datasets. Results: CardioBoost has higher discriminating power than published genome-wide variant classification tools in distinguishing between pathogenic and benign variants based on overall classification performance measures with the highest area under the Precision-Recall Curve as 91% for cardiomyopathies and as 96% for inherited arrhythmias. When assessed at high-confidence (>90%) classification thresholds, prediction accuracy is improved by at least 120% over existing tools for both cardiomyopathies and arrhythmias, with significantly improved sensitivity and specificity. Finally, CardioBoost improves prioritisation of variants significantly associated with disease, and stratifies survival of patients with cardiomyopathies, confirming biologically relevant variant classification. Conclusions: We demonstrate that a disease-specific variant pathogenicity prediction tool outperforms state-of-the-art genome-wide tools for the classification of rare missense variants of uncertain significance for inherited cardiac conditions. To facilitate evaluation of CardioBoost, we provide pre-computed pathogenicity scores for all possible rare missense variants in genes associated with cardiomyopathies and arrhythmias (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://www.cardiodb.org/cardioboost/">https://www.cardiodb.org/cardioboost/</jats:ext-link>). Our results also highlight the need to develop and evaluate variant classification tools focused on specific diseases and clinical application contexts. Our proposed model for assessing variants in known disease genes, and the use of application-specific evaluations, is broadly applicable to improve variant interpretation across a wide range of Mendelian diseases. |
Issue Date: | 1-Jan-2021 |
Date of Acceptance: | 27-Aug-2020 |
URI: | http://hdl.handle.net/10044/1/83594 |
DOI: | 10.1038/s41436-020-00972-3 |
ISSN: | 1098-3600 |
Publisher: | American College of Medical Genetics and Genomics |
Start Page: | 69 |
End Page: | 79 |
Journal / Book Title: | Genetics in Medicine |
Volume: | 23 |
Copyright Statement: | © The Author(s) 2020. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in anymediumor format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
Sponsor/Funder: | Wellcome Trust Royal Brompton & Harefield NHS Foundation Trust Imperial College Healthcare NHS Trust- BRC Funding British Heart Foundation Imperial College Healthcare NHS Trust- BRC Funding British Heart Foundation British Heart Foundation |
Funder's Grant Number: | 107469/Z/15/Z N/A RDC04 NH/17/1/32725 RDB02 RE/18/4/34215 RG/19/6/34387 |
Keywords: | Science & Technology Life Sciences & Biomedicine Genetics & Heredity pathogenicity prediction missense variant interpretation cardiomyopathy long QT syndrome Brugada syndrome MUTATION GENOTYPE Brugada syndrome cardiomyopathy long QT syndrome missense variant interpretation pathogenicity prediction Genetics & Heredity 0604 Genetics 1103 Clinical Sciences |
Publication Status: | Published |
Online Publication Date: | 2020-10-13 |
Appears in Collections: | National Heart and Lung Institute Institute of Clinical Sciences Faculty of Medicine |
This item is licensed under a Creative Commons License