32
IRUS TotalDownloads
Altmetric
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction
File | Description | Size | Format | |
---|---|---|---|---|
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction.pdf | Published version | 2.26 MB | Adobe PDF | View/Open |
Title: | A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction |
Authors: | De Oliveira Martins, L Mallo, D Posada, D |
Item Type: | Journal Article |
Abstract: | Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models. |
Issue Date: | 3-Oct-2014 |
Date of Acceptance: | 30-Sep-2014 |
URI: | http://hdl.handle.net/10044/1/41404 |
DOI: | http://dx.doi.org/10.1093/sysbio/syu082 |
ISSN: | 1076-836X |
Publisher: | Oxford University Press |
Start Page: | 397 |
End Page: | 416 |
Journal / Book Title: | Systematic Biology |
Volume: | 65 |
Issue: | 3 |
Copyright Statement: | This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
Keywords: | hierarchical Bayesian model phylogenomics reconciliation supertree tree distance Evolutionary Biology 0603 Evolutionary Biology 0604 Genetics |
Publication Status: | Published |
Appears in Collections: | Faculty of Engineering |