Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled
File(s)
Author(s)
Brbic, M
Warnecke, T
Krisko, A
Supek, F
Type
Journal Article
Abstract
The amino acid composition (AAC) of proteomes differs greatly between microorganisms and is associated with the environmental
niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and
mayconferadvantagesattheDNA/RNAlevel.Theseinfluencesoverlapinprotein-codingsequences,makingitdifficulttogaugetheir
relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide
composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C
content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts.
However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little
cross-speciesvariabilityremainsunexplained(91%ofvarianceaccountedfor).Qualitativelysimilarresultswereobtainedfor49fungal
genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon
factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes’ ecological
preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environmentrelated
AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in
overallAACappeartooccuralmostexclusivelythroughfactorsshapingtheglobaloligonucleotidecontentofthegenome.Wediscuss
these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that
adaptation takes place at the protein level.
niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and
mayconferadvantagesattheDNA/RNAlevel.Theseinfluencesoverlapinprotein-codingsequences,makingitdifficulttogaugetheir
relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide
composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C
content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts.
However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little
cross-speciesvariabilityremainsunexplained(91%ofvarianceaccountedfor).Qualitativelysimilarresultswereobtainedfor49fungal
genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon
factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes’ ecological
preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environmentrelated
AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in
overallAACappeartooccuralmostexclusivelythroughfactorsshapingtheglobaloligonucleotidecontentofthegenome.Wediscuss
these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that
adaptation takes place at the protein level.
Date Issued
2015-06-01
Date Acceptance
2015-05-09
Citation
Genome Biology and Evolution, 2015, 7 (6), pp.1519-1532
ISSN
1759-6653
Publisher
Oxford University Press (OUP)
Start Page
1519
End Page
1532
Journal / Book Title
Genome Biology and Evolution
Volume
7
Issue
6
Copyright Statement
© The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits
non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits
non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
Subjects
Science & Technology
Life Sciences & Biomedicine
Evolutionary Biology
Genetics & Heredity
amino acid composition
oligonucleotide composition
intergenic DNA
ecological preferences
prokaryotic genome
fungal genome
support vector regression
AMINO-ACID-COMPOSITION
CYTOSINE CONTENT
GENE-FUNCTION
SIGNATURES
PROKARYOTES
PROTEINS
EVOLUTION
BACTERIA
HETEROGENEITY
TEMPERATURE
Publication Status
Published