TY - JOUR
T1 - Platanus-B
T2 - An accurate de novo assembler for bacterial genomes using an iterative error-removal process
AU - Kajitani, Rei
AU - Yoshimura, Dai
AU - Ogura, Yoshitoshi
AU - Gotoh, Yasuhiro
AU - Hayashi, Tetsuya
AU - Itoh, Takehiko
N1 - Publisher Copyright:
© 2020 The Author(s) 2020. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
PY - 2020/6/1
Y1 - 2020/6/1
N2 - De novo assembly of short DNA reads remains an essential technology, especially for large-scale projects and high-resolution variant analyses in epidemiology. However, the existing tools often lack sufficient accuracy required to compare closely related strains. To facilitate such studies on bacterial genomes, we developed Platanus-B, a de novo assembler that employs iterations of multiple error-removal algorithms. The benchmarks demonstrated the superior accuracy and high contiguity of Platanus-B, in addition to its ability to enhance the hybrid assembly of both short and nanopore long reads. Although the hybrid strategies for short and long reads were effective in achieving near full-length genomes, we found that short-read-only assemblies generated with Platanus-B were sufficient to obtain ≥90% of exact coding sequences in most cases. In addition, while nanopore long-read-only assemblies lacked fine-scale accuracies, inclusion of short reads was effective in improving the accuracies. Platanus-B can, therefore, be used for comprehensive genomic surveillances of bacterial pathogens and high-resolution phylogenomic analyses of a wide range of bacteria.
AB - De novo assembly of short DNA reads remains an essential technology, especially for large-scale projects and high-resolution variant analyses in epidemiology. However, the existing tools often lack sufficient accuracy required to compare closely related strains. To facilitate such studies on bacterial genomes, we developed Platanus-B, a de novo assembler that employs iterations of multiple error-removal algorithms. The benchmarks demonstrated the superior accuracy and high contiguity of Platanus-B, in addition to its ability to enhance the hybrid assembly of both short and nanopore long reads. Although the hybrid strategies for short and long reads were effective in achieving near full-length genomes, we found that short-read-only assemblies generated with Platanus-B were sufficient to obtain ≥90% of exact coding sequences in most cases. In addition, while nanopore long-read-only assemblies lacked fine-scale accuracies, inclusion of short reads was effective in improving the accuracies. Platanus-B can, therefore, be used for comprehensive genomic surveillances of bacterial pathogens and high-resolution phylogenomic analyses of a wide range of bacteria.
UR - http://www.scopus.com/inward/record.url?scp=85089712832&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089712832&partnerID=8YFLogxK
U2 - 10.1093/dnares/dsaa014
DO - 10.1093/dnares/dsaa014
M3 - Article
C2 - 32658266
AN - SCOPUS:85089712832
SN - 1340-2838
VL - 27
JO - DNA Research
JF - DNA Research
IS - 3
M1 - dsaa014
ER -