55
IRUS TotalDownloads
Altmetric
Efficient genotype compression and analysis of large genetic-variation data sets
File | Description | Size | Format | |
---|---|---|---|---|
Efficient genotype compression and analysis of large genetic-variation data sets.pdf | Accepted version | 560.97 kB | Adobe PDF | View/Open |
Title: | Efficient genotype compression and analysis of large genetic-variation data sets |
Authors: | Layer, RM Kindlon, N Karczewski, KJ Quinlan, AR |
Item Type: | Journal Article |
Abstract: | Genotype Query Tools (GQT) is an indexing strategy that expedites analyses of genome-variation data sets in Variant Call Format based on sample genotypes, phenotypes and relationships. GQT's compressed genotype index minimizes decompression for analysis, and its performance relative to that of existing methods improves with cohort size. We show substantial (up to 443-fold) gains in performance over existing methods and demonstrate GQT's utility for exploring massive data sets involving thousands to millions of genomes. GQT can be accessed at https://github.com/ryanlayer/gqt. |
Issue Date: | 9-Nov-2015 |
Date of Acceptance: | 7-Oct-2015 |
URI: | http://hdl.handle.net/10044/1/53390 |
DOI: | https://dx.doi.org/10.1038/NMETH.3654 |
ISSN: | 1548-7091 |
Publisher: | Nature Publishing Group |
Start Page: | 63 |
End Page: | 65 |
Journal / Book Title: | Nature Methods |
Volume: | 13 |
Issue: | 1 |
Copyright Statement: | Copyright © 2015, Rights Managed by Nature Publishing Group |
Keywords: | Science & Technology Life Sciences & Biomedicine Biochemical Research Methods Biochemistry & Molecular Biology FEATURES Datasets as Topic Genetic Variation Genotype Exome Aggregation Consortium 06 Biological Sciences 10 Technology 11 Medical And Health Sciences Developmental Biology |
Publication Status: | Published |
Appears in Collections: | Institute of Clinical Sciences |