13
IRUS Total
Downloads
  Altmetric

Disease-specific variant pathogenicity prediction significantly improves variant interpretation in inherited cardiac conditions

File Description SizeFormat 
Supplementary_material_accepted.pdfSupplementary information949.97 kBAdobe PDFView/Open
s41436-020-00972-3.pdfPublished version954.65 kBAdobe PDFView/Open
Title: Disease-specific variant pathogenicity prediction significantly improves variant interpretation in inherited cardiac conditions
Authors: Zhang, X
Walsh, R
Whiffin, N
Buchan, R
Midwinter, W
Wilk, A
Govind, R
Li, N
Ahmad, M
Mazzarotto, F
Roberts, A
Theotokis, P
Mazaika, E
Allouba, M
De Marvao, A
Pua, CJ
Day, SM
Ashley, E
Colan, SD
Michels, M
Pereira, AC
Jacoby, D
Ho, CY
Olivotto, I
Gunnarsson, GT
Jefferies, J
Semsarian, C
Ingles, J
O’Regan, DP
Aguib, Y
Yacoub, MH
Cook, SA
Barton, PJR
Bottolo, L
Ware, JS
Item Type: Journal Article
Abstract: Background: Accurate discrimination of benign and pathogenic rare variation remains a priority for clinical genome interpretation. State-of-the-art machine learning tools are useful for genome-wide variant prioritisation but remain imprecise. Since the relationship between molecular consequence and likelihood of pathogenicity varies between genes with distinct molecular mechanisms, we hypothesised that a disease-specific classifier may outperform existing genome-wide tools. Methods: We present a novel disease-specific variant classification tool, CardioBoost, that estimates the probability of pathogenicity for rare missense variants in inherited cardiomyopathies and arrhythmias, trained with variants of known clinical effect. To benchmark against state-of-the-art genome-wide pathogenicity classification tools, we assessed classification of hold-out test variants using both overall performance metrics, and metrics of high-confidence (&gt;90%) classifications relevant to variant interpretation. We further evaluated the prioritisation of variants associated with disease and patient clinical outcomes, providing validations that are robust to potential mis-classification in gold-standard reference datasets. Results: CardioBoost has higher discriminating power than published genome-wide variant classification tools in distinguishing between pathogenic and benign variants based on overall classification performance measures with the highest area under the Precision-Recall Curve as 91% for cardiomyopathies and as 96% for inherited arrhythmias. When assessed at high-confidence (&gt;90%) classification thresholds, prediction accuracy is improved by at least 120% over existing tools for both cardiomyopathies and arrhythmias, with significantly improved sensitivity and specificity. Finally, CardioBoost improves prioritisation of variants significantly associated with disease, and stratifies survival of patients with cardiomyopathies, confirming biologically relevant variant classification. Conclusions: We demonstrate that a disease-specific variant pathogenicity prediction tool outperforms state-of-the-art genome-wide tools for the classification of rare missense variants of uncertain significance for inherited cardiac conditions. To facilitate evaluation of CardioBoost, we provide pre-computed pathogenicity scores for all possible rare missense variants in genes associated with cardiomyopathies and arrhythmias (<jats:ext-link xmlns:xlink="http://www.w3.org/1999/xlink" ext-link-type="uri" xlink:href="https://www.cardiodb.org/cardioboost/">https://www.cardiodb.org/cardioboost/</jats:ext-link>). Our results also highlight the need to develop and evaluate variant classification tools focused on specific diseases and clinical application contexts. Our proposed model for assessing variants in known disease genes, and the use of application-specific evaluations, is broadly applicable to improve variant interpretation across a wide range of Mendelian diseases.
Issue Date: 1-Jan-2021
Date of Acceptance: 27-Aug-2020
URI: http://hdl.handle.net/10044/1/83594
DOI: 10.1038/s41436-020-00972-3
ISSN: 1098-3600
Publisher: American College of Medical Genetics and Genomics
Start Page: 69
End Page: 79
Journal / Book Title: Genetics in Medicine
Volume: 23
Copyright Statement: © The Author(s) 2020. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in anymediumor format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Sponsor/Funder: Wellcome Trust
Royal Brompton & Harefield NHS Foundation Trust
Imperial College Healthcare NHS Trust- BRC Funding
British Heart Foundation
Imperial College Healthcare NHS Trust- BRC Funding
British Heart Foundation
British Heart Foundation
Funder's Grant Number: 107469/Z/15/Z
N/A
RDC04
NH/17/1/32725
RDB02
RE/18/4/34215
RG/19/6/34387
Keywords: Science & Technology
Life Sciences & Biomedicine
Genetics & Heredity
pathogenicity prediction
missense variant interpretation
cardiomyopathy
long QT syndrome
Brugada syndrome
MUTATION
GENOTYPE
Brugada syndrome
cardiomyopathy
long QT syndrome
missense variant interpretation
pathogenicity prediction
Genetics & Heredity
0604 Genetics
1103 Clinical Sciences
Publication Status: Published
Online Publication Date: 2020-10-13
Appears in Collections:National Heart and Lung Institute
Institute of Clinical Sciences
Faculty of Medicine



This item is licensed under a Creative Commons License Creative Commons