Copy number variation arising from gene conversion on the human Y chromosome
File(s)2018 shi massaia louzada gc cnvs.pdf (4.63 MB)
Published version
Author(s)
Type
Journal Article
Abstract
We describe the variation in copy number of a ~ 10 kb region overlapping the long intergenic noncoding RNA (lincRNA)
gene, TTTY22, within the IR3 inverted repeat on the short arm of the human Y chromosome, leading to individuals with
0–3 copies of this region in the general population. Variation of this CNV is common, with 266 individuals having 0 copies,
943 (including the reference sequence) having 1, 23 having 2 copies, and two having 3 copies, and was validated by breakpoint
PCR, fbre-FISH, and 10× Genomics Chromium linked-read sequencing in subsets of 1234 individuals from the 1000
Genomes Project. Mapping the changes in copy number to the phylogeny of these Y chromosomes previously established
by the Project identifed at least 20 mutational events, and investigation of fanking paralogous sequence variants showed
that the mutations involved fanking sequences in 18 of these, and could extend over > 30 kb of DNA. While either gene
conversion or double crossover between misaligned sister chromatids could formally explain the 0–2 copy events, gene
conversion is the more likely mechanism, and these events include the longest non-allelic gene conversion reported thus far.
Chromosomes with three copies of this CNV have arisen just once in our data set via another mechanism: duplication of
420 kb that places the third copy 230 kb proximal to the existing proximal copy. Our results establish gene conversion as a
previously under-appreciated mechanism of generating copy number changes in humans and reveal the exceptionally large
size of the conversion events that can occur.
gene, TTTY22, within the IR3 inverted repeat on the short arm of the human Y chromosome, leading to individuals with
0–3 copies of this region in the general population. Variation of this CNV is common, with 266 individuals having 0 copies,
943 (including the reference sequence) having 1, 23 having 2 copies, and two having 3 copies, and was validated by breakpoint
PCR, fbre-FISH, and 10× Genomics Chromium linked-read sequencing in subsets of 1234 individuals from the 1000
Genomes Project. Mapping the changes in copy number to the phylogeny of these Y chromosomes previously established
by the Project identifed at least 20 mutational events, and investigation of fanking paralogous sequence variants showed
that the mutations involved fanking sequences in 18 of these, and could extend over > 30 kb of DNA. While either gene
conversion or double crossover between misaligned sister chromatids could formally explain the 0–2 copy events, gene
conversion is the more likely mechanism, and these events include the longest non-allelic gene conversion reported thus far.
Chromosomes with three copies of this CNV have arisen just once in our data set via another mechanism: duplication of
420 kb that places the third copy 230 kb proximal to the existing proximal copy. Our results establish gene conversion as a
previously under-appreciated mechanism of generating copy number changes in humans and reveal the exceptionally large
size of the conversion events that can occur.
Date Issued
2017-12-05
Date Acceptance
2017-11-28
Citation
Human Genetics, 2017, 137 (1), pp.73-83
ISSN
0340-6717
Publisher
Springer Verlag
Start Page
73
End Page
83
Journal / Book Title
Human Genetics
Volume
137
Issue
1
Copyright Statement
© The Author(s) 2017. This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
License URL
Identifier
http://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000423568900008&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=1ba7043ffcc86c417c072aa74d649202
Subjects
Science & Technology
Life Sciences & Biomedicine
Genetics & Heredity
MUTATION-RATES
SEQUENCING ERA
MECHANISMS
RECOMBINATION
PALINDROMES
GENOMES
Publication Status
Published