Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Medicine
  3. Faculty of Medicine
  4. Machine learning for classification of hypertension subtypes using multi-omics: a multi-centre, retrospective, data-driven study
 
  • Details
Machine learning for classification of hypertension subtypes using multi-omics: a multi-centre, retrospective, data-driven study
File(s)
1-s2.0-S2352396422004583-main.pdf (3.42 MB)
Published version
Author(s)
Reel, Parminder S
Reel, Smarti
van Kralingen, Josie C
Langton, Katharina
Lang, Katharina
more
Type
Journal Article
Abstract
Background
Arterial hypertension is a major cardiovascular risk factor. Identification of secondary hypertension in its various forms is key to preventing and targeting treatment of cardiovascular complications. Simplified diagnostic tests are urgently required to distinguish primary and secondary hypertension to address the current underdiagnosis of the latter.
Methods
This study uses Machine Learning (ML) to classify subtypes of endocrine hypertension (EHT) in a large cohort of hypertensive patients using multidimensional omics analysis of plasma and urine samples. We measured 409 multi-omics (MOmics) features including plasma miRNAs (PmiRNA: 173), plasma catechol O-methylated metabolites (PMetas: 4), plasma steroids (PSteroids: 16), urinary steroid metabolites (USteroids: 27), and plasma small metabolites (PSmallMB: 189) in primary hypertension (PHT) patients, EHT patients with either primary aldosteronism (PA), pheochromocytoma/functional paraganglioma (PPGL) or Cushing syndrome (CS) and normotensive volunteers (NV). Biomarker discovery involved selection of disease combination, outlier handling, feature reduction, 8 ML classifiers, class balancing and consideration of different age- and sex-based scenarios. Classifications were evaluated using balanced accuracy, sensitivity, specificity, AUC, F1, and Kappa score.
Findings
Complete clinical and biological datasets were generated from 307 subjects (PA=113, PPGL=88, CS=41 and PHT=112). The random forest classifier provided ∼92% balanced accuracy (∼11% improvement on the best mono-omics classifier), with 96% specificity and 0.95 AUC to distinguish one of the four conditions in multi-class ALL-ALL comparisons (PPGL vs PA vs CS vs PHT) on an unseen test set, using 57 MOmics features. For discrimination of EHT (PA + PPGL + CS) vs PHT, the simple logistic classifier achieved 0.96 AUC with 90% sensitivity, and ∼86% specificity, using 37 MOmics features. One PmiRNA (hsa-miR-15a-5p) and two PSmallMB (C9 and PC ae C38:1) features were found to be most discriminating for all disease combinations. Overall, the MOmics-based classifiers were able to provide better classification performance in comparison to mono-omics classifiers.
Interpretation
We have developed a ML pipeline to distinguish different EHT subtypes from PHT using multi-omics data. This innovative approach to stratification is an advancement towards the development of a diagnostic tool for EHT patients, significantly increasing testing throughput and accelerating administration of appropriate treatment.
Funding
European Union's Horizon 2020 Research and Innovation Programme under Grant Agreement No. 633983, Clinical Research Priority Program of the University of Zurich for the CRPP HYRENE (to Z.E. and F.B.), and Deutsche Forschungsgemeinschaft (CRC/Transregio 205/1).
Date Issued
2022-10-01
Date Acceptance
2022-09-06
Citation
EBioMedicine, 2022, 84
URI
https://hdl.handle.net/10044/1/126093
URL
https://www.sciencedirect.com/science/article/pii/S2352396422004583?via%3Dihub
DOI
https://www.dx.doi.org/10.1016/j.ebiom.2022.104276
ISSN
2352-3964
Publisher
Elsevier
Journal / Book Title
EBioMedicine
Volume
84
Copyright Statement
© 2022 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)
License URL
http://creativecommons.org/licenses/by/4.0/
Identifier
https://www.ncbi.nlm.nih.gov/pubmed/36179553
PII: S2352-3964(22)00458-3
Subjects
Biomarkers
BLOOD-PRESSURE
Cushing syndrome
DIAGNOSIS
ENDOCRINE
General & Internal Medicine
Hypertension
Life Sciences & Biomedicine
Machine learning
Medicine, General & Internal
Medicine, Research & Experimental
METABOLOMICS
Multi-omics
paragan-glioma
Pheochromocytoma
PREVALENCE
Primary aldosteronism
PRIMARY ALDOSTERONISM
Research & Experimental Medicine
Science & Technology
SOCIETY
Publication Status
Published
Coverage Spatial
Netherlands
Article Number
104276
Date Publish Online
2022-09-27
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback