k-SLAM: Accurate and ultra-fast taxonomic classification and gene identification for large metagenomic datasets
File(s)gkw1248.pdf (886.65 KB)
Published version
Author(s)
Ainsworth, D
Sternberg, MJE
Raczy, C
Butcher, SA
Type
Journal Article
Abstract
k-SLAM is a highly e cient algorithm for the characterisa-
tion of metagenomic data. Unlike other ultra-fast metage-
nomic classi ers, full sequence alignment is performed allow-
ing for gene identi cation and variant calling in addition to
accurate taxonomic classi cation. A
k
-mer based method
provides greater taxonomic accuracy than other classi ers
and a three orders of magnitude speed increase over align-
ment based approaches. The use of alignments to nd vari-
ants and genes along with their taxonomic origins enables
novel strains to be characterised. k-SLAM's speed allows a
full taxonomic classi cation and gene identi cation to be
tractable on modern large datasets. A pseudo-assembly
method is used to increase classi cation accuracy by up to
40% for species which have high sequence homology within
their genus.
tion of metagenomic data. Unlike other ultra-fast metage-
nomic classi ers, full sequence alignment is performed allow-
ing for gene identi cation and variant calling in addition to
accurate taxonomic classi cation. A
k
-mer based method
provides greater taxonomic accuracy than other classi ers
and a three orders of magnitude speed increase over align-
ment based approaches. The use of alignments to nd vari-
ants and genes along with their taxonomic origins enables
novel strains to be characterised. k-SLAM's speed allows a
full taxonomic classi cation and gene identi cation to be
tractable on modern large datasets. A pseudo-assembly
method is used to increase classi cation accuracy by up to
40% for species which have high sequence homology within
their genus.
Date Issued
2016-12-13
Date Acceptance
2016-11-28
Citation
Nucleic Acids Research, 2016, 45 (4), pp.1649-1656
ISSN
1362-4962
Publisher
Oxford University Press (OUP)
Start Page
1649
End Page
1656
Journal / Book Title
Nucleic Acids Research
Volume
45
Issue
4
Copyright Statement
© The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which
permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which
permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Sponsor
Medical Research Council (MRC)
Grant Number
MR/L01632X/1
Subjects
Science & Technology
Life Sciences & Biomedicine
Biochemistry & Molecular Biology
HUMAN MICROBIOME
GUT MICROBIOME
TOOL
Developmental Biology
05 Environmental Sciences
06 Biological Sciences
08 Information And Computing Sciences
Publication Status
Published