55
IRUS Total
Downloads
  Altmetric

Efficient genotype compression and analysis of large genetic-variation data sets

File Description SizeFormat 
Efficient genotype compression and analysis of large genetic-variation data sets.pdfAccepted version560.97 kBAdobe PDFView/Open
Title: Efficient genotype compression and analysis of large genetic-variation data sets
Authors: Layer, RM
Kindlon, N
Karczewski, KJ
Quinlan, AR
Item Type: Journal Article
Abstract: Genotype Query Tools (GQT) is an indexing strategy that expedites analyses of genome-variation data sets in Variant Call Format based on sample genotypes, phenotypes and relationships. GQT's compressed genotype index minimizes decompression for analysis, and its performance relative to that of existing methods improves with cohort size. We show substantial (up to 443-fold) gains in performance over existing methods and demonstrate GQT's utility for exploring massive data sets involving thousands to millions of genomes. GQT can be accessed at https://github.com/ryanlayer/gqt.
Issue Date: 9-Nov-2015
Date of Acceptance: 7-Oct-2015
URI: http://hdl.handle.net/10044/1/53390
DOI: https://dx.doi.org/10.1038/NMETH.3654
ISSN: 1548-7091
Publisher: Nature Publishing Group
Start Page: 63
End Page: 65
Journal / Book Title: Nature Methods
Volume: 13
Issue: 1
Copyright Statement: Copyright © 2015, Rights Managed by Nature Publishing Group
Keywords: Science & Technology
Life Sciences & Biomedicine
Biochemical Research Methods
Biochemistry & Molecular Biology
FEATURES
Datasets as Topic
Genetic Variation
Genotype
Exome Aggregation Consortium
06 Biological Sciences
10 Technology
11 Medical And Health Sciences
Developmental Biology
Publication Status: Published
Appears in Collections:Institute of Clinical Sciences