84
IRUS Total
Downloads
  Altmetric

A structural variation reference for medical and population genetics.

File Description SizeFormat 
s41586-020-2287-8.pdfPublished version4.96 MBAdobe PDFView/Open
Title: A structural variation reference for medical and population genetics.
Authors: Collins, RL
Brand, H
Karczewski, KJ
Zhao, X
Alföldi, J
Francioli, LC
Khera, AV
Lowther, C
Gauthier, LD
Wang, H
Watts, NA
Solomonson, M
O'Donnell-Luria, A
Baumann, A
Munshi, R
Walker, M
Whelan, CW
Huang, Y
Brookings, T
Sharpe, T
Stone, MR
Valkanas, E
Fu, J
Tiao, G
Laricchia, KM
Ruano-Rubio, V
Stevens, C
Gupta, N
Cusick, C
Margolin, L
Genome Aggregation Database Production Team
Genome Aggregation Database Consortium
Taylor, KD
Lin, HJ
Rich, SS
Post, WS
Chen, Y-DI
Rotter, JI
Nusbaum, C
Philippakis, A
Lander, E
Gabriel, S
Neale, BM
Kathiresan, S
Daly, MJ
Banks, E
MacArthur, DG
Talkowski, ME
Item Type: Journal Article
Abstract: Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.
Issue Date: May-2020
Date of Acceptance: 31-Mar-2020
URI: http://hdl.handle.net/10044/1/80020
DOI: 10.1038/s41586-020-2287-8
ISSN: 0028-0836
Publisher: Nature Research
Start Page: 444
End Page: 451
Journal / Book Title: Nature
Volume: 581
Issue: 7809
Copyright Statement: © The Author(s) 2020. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Keywords: Genome Aggregation Database Production Team
Genome Aggregation Database Consortium
General Science & Technology
Publication Status: Published
Conference Place: England
Open Access location: https://www.nature.com/articles/s41586-020-2287-8
Online Publication Date: 2020-05-27
Appears in Collections:National Heart and Lung Institute
Institute of Clinical Sciences