A phylogenetic method to perform genome-wide association studies in microbes that accounts for population structure and recombination
File(s)journal.pcbi.1005958.pdf (3.38 MB)
In Press version
Author(s)
Collins, Caitlin
Didelot, X
Type
Journal Article
Abstract
Genome-Wide Association Studies (GWAS) in microbial organisms have the potential to vastly improve the way we understand, manage, and treat infectious diseases. Yet, microbial GWAS methods established thus far remain insufficiently able to capitalise on the growing wealth of bacterial and viral genetic sequence data. Facing clonal population structure and homologous recombination, existing GWAS methods struggle to achieve both the precision necessary to reject spurious findings and the power required to detect associations in microbes. In this paper, we introduce a novel phylogenetic approach that has been tailor-made for microbial GWAS, which is applicable to organisms ranging from purely clonal to frequently recombining, and to both binary and continuous phenotypes. Our approach is robust to the confounding effects of both population structure and recombination, while maintaining high statistical power to detect associations. Thorough testing via application to simulated data provides strong support for the power and specificity of our approach and demonstrates the advantages offered over alternative cluster-based and dimension-reduction methods. Two applications to Neisseria meningitidis illustrate the versatility and potential of our method, confirming previously-identified penicillin resistance loci and resulting in the identification of both well-characterised and novel drivers of invasive disease. Our method is implemented as an open-source R package called treeWAS which is freely available at https://github.com/caitiecollins/treeWAS.
Date Issued
2018-02-05
Date Acceptance
2018-01-01
Citation
PLoS Computational Biology, 2018, 14 (2)
ISSN
1553-734X
Publisher
Public Library of Science (PLoS)
Journal / Book Title
PLoS Computational Biology
Volume
14
Issue
2
License URL
Sponsor
Medical Research Council (MRC)
National Institute for Health Research
Biotechnology and Biological Sciences Research Council (BBSRC)
Medical Research Council (MRC)
Grant Number
MR/K010174/1B
HPRU-2012-10080
BB/L023458/1
N/A
Subjects
06 Biological Sciences
08 Information And Computing Sciences
01 Mathematical Sciences
Bioinformatics
Publication Status
Published
Article Number
e1005958