Phylogenetic Detection of Recombination with a Bayesian Prior on the Distance between Trees
Author(s)
Martins, LDEO
Leal, E
Kishino, H
Type
Journal Article
Abstract
Genomic regions participating in recombination events may support distinct topologies, and phylogenetic analyses should incorporate this heterogeneity. Existing phylogenetic methods for recombination detection are challenged by the enormous number of possible topologies, even for a moderate number of taxa. If, however, the detection analysis is conducted independently between each putative recombinant sequence and a set of reference parentals, potential recombinations between the recombinants are neglected. In this context, a recombination hotspot can be inferred in phylogenetic analyses if we observe several consecutive breakpoints. We developed a distance measure between unrooted topologies that closely resembles the number of recombinations. By introducing a prior distribution on these recombination distances, a Bayesian hierarchical model was devised to detect phylogenetic inconsistencies occurring due to recombinations. This model relaxes the assumption of known parental sequences, still common in HIV analysis, allowing the entire dataset to be analyzed at once. On simulated datasets with up to 16 taxa, our method correctly detected recombination breakpoints and the number of recombination events for each breakpoint. The procedure is robust to rate and transitionratiotransversion heterogeneities for simulations with and without recombination. This recombination distance is related to recombination hotspots. Applying this procedure to a genomic HIV-1 dataset, we found evidence for hotspots and de novo recombination.
Date Issued
2008-07-09
Date Acceptance
2008-06-07
Citation
PLOS One, 2008, 3 (7)
ISSN
1932-6203
Publisher
Public Library of Science
Journal / Book Title
PLOS One
Volume
3
Issue
7
Copyright Statement
© 2008 Martins et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
License URL
Subjects
Bayes Theorem
Genetic Heterogeneity
HIV-1
Phylogeny
Recombination, Genetic
General Science & Technology
MD Multidisciplinary
Publication Status
Published
Article Number
e2651