Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Medicine
  3. National Heart and Lung Institute
  4. National Heart and Lung Institute
  5. Elucidating B4GALNT1 as potential biomarker in hepatocellular carcinoma using machine learning models and mutational dynamics explored through MD simulation
 
  • Details
Elucidating B4GALNT1 as potential biomarker in hepatocellular carcinoma using machine learning models and mutational dynamics explored through MD simulation
File(s)
1-s2.0-S2352914824000704-main.pdf (21.85 MB)
Published version
Author(s)
Verma, Rohit Kumar
Lokhande, Kiran Bharat
Srivastava, Prashant Kumar
Singh, Ashutosh
Type
Journal Article
Abstract
Liver hepatocellular carcinoma (LIHC) is considered one of the primary contributors to cancer-related mortality on a global scale. The identification of new biomarkers is of utmost importance due to the fact that patients with LIHC are frequently detected at advanced stages, leading to an increased mortality rate. The study utilized TCGA-LIHC gene expression datasets to identify biomarkers and to address the complexity of datasets. A combination of feature selection (FS) techniques was used, and the performance of this strategy was assessed using ten machine learning classifiers. The findings were integrated, revealing biomarkers identified through at least five FS techniques. Through our proposed approach, we identified 55 potential biomarkers for LIHC. The Gaussian Naive Bayes Classifier (AUC = 0.99) was found to be the most effective classifier, achieving 98.67% accuracy when utilizing the 55 identified biomarkers in the test dataset. Additionally, we conducted differential gene expression, survival analysis, and enrichment analysis for all the identified biomarkers. Subsequently, Lasso-penalized Cox regression further refined the identified biomarkers to thirteen. Out of thirteen genes, we singled out B4GALNT1 because of its statistical significance in differential expression analysis and increasing importance across various cancer types, including LIHC. We carried out comprehensive bioinformatics and molecular dynamics simulation studies along with other structural analysis of B4GALNT1 in LIHC. In LIHC, six mutations (P64Q, S131F, A311S, R340Q, D478H, and P507Q) have been predicted to be probably damaging by evaluating in-silico prediction algorithms. In comparison to the wild type, the B4GALNT1 variations, specifically P64Q and S131F, demonstrate increased stability. However, these mutations lead to decreased atomic fluctuations, indicating a rigid protein structure. Again, mutations like A311S and P507Q induce increased flexibility, highlighting their structural impact on B4GALNT1. The study demonstrated the combination of various feature selection methods effectively reveals new biomarkers, thereby directly impacting their biological significance. Furthermore, our findings indicate a link between increased B4GALNT1 expression in individuals with liver cancer and a poorer prognosis, highlighting its potential as a promising therapeutic target.
Date Issued
2024
Date Acceptance
2024-05-02
Citation
Informatics in Medicine Unlocked, 2024, 48
URI
http://hdl.handle.net/10044/1/112336
URL
http://dx.doi.org/10.1016/j.imu.2024.101514
DOI
https://www.dx.doi.org/10.1016/j.imu.2024.101514
ISSN
2352-9148
Publisher
Elsevier
Journal / Book Title
Informatics in Medicine Unlocked
Volume
48
Copyright Statement
© 2024 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/).
License URL
http://creativecommons.org/licenses/by-nc/4.0/
Identifier
http://dx.doi.org/10.1016/j.imu.2024.101514
Publication Status
Published
Article Number
101514
Date Publish Online
2024-05-07
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback