Repository logo
  • Log In
    Log in via Symplectic to deposit your publication(s).
Repository logo
  • Communities & Collections
  • Research Outputs
  • Statistics
  • Log In
    Log in via Symplectic to deposit your publication(s).
  1. Home
  2. Faculty of Medicine
  3. National Heart and Lung Institute
  4. National Heart and Lung Institute
  5. ClinVar data parsing.
 
  • Details
ClinVar data parsing.
File(s)
29c66e61-bb51-49de-93e3-e9f0402b43c5_11640_-_xiaolei_zhang.pdf (557.37 KB)
Published version
OA Location
https://wellcomeopenresearch.org/articles/2-33/v1
Author(s)
Zhang, X
Minikel, EV
O'Donnell-Luria, AH
MacArthur, DG
Ware, JS
more
Type
Journal Article
Abstract
This software repository provides a pipeline for converting raw ClinVar data files into analysis-friendly tab-delimited tables, and also provides these tables for the most recent ClinVar release. Separate tables are generated for genome builds GRCh37 and GRCh38 as well as for mono-allelic variants and complex multi-allelic variants. Additionally, the tables are augmented with allele frequencies from the ExAC and gnomAD datasets as these are often consulted when analyzing ClinVar variants. Overall, this work provides ClinVar data in a format that is easier to work with and can be directly loaded into a variety of popular analysis tools such as R, python pandas, and SQL databases.
Date Issued
2017-05-23
Date Acceptance
2017-05-22
Citation
Wellcome Open Research, 2017, 2
URI
http://hdl.handle.net/10044/1/52936
DOI
https://www.dx.doi.org/10.12688/wellcomeopenres.11640.1
ISSN
2398-502X
Publisher
F1000Research
Journal / Book Title
Wellcome Open Research
Volume
2
Copyright Statement
Copyright: © 2017 Zhang X et al. This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The author(s) is/are employees of the US Government and therefore domestic copyright protection in USA does not apply to this work. The work may be protected under the copyright laws of other jurisdictions when used in those jurisdictions.
License URL
http://creativecommons.org/licenses/by/4.0/
Sponsor
Wellcome Trust
Grant Number
107469/Z/15/Z
Subjects
ClinVar
Mendelian disease
XML parsing
pathogenic variants
variant interpretation
Publication Status
Published online
Article Number
33
About
Spiral Depositing with Spiral Publishing with Spiral Symplectic
Contact us
Open access team Report an issue
Other Services
Scholarly Communications Library Services
logo

Imperial College London

South Kensington Campus

London SW7 2AZ, UK

tel: +44 (0)20 7589 5111

Accessibility Modern slavery statement Cookie Policy

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement
  • Send Feedback