Comparison of the sequencing bias of currently available library preparation kits for Illumina sequencing of bacterial genomes and metagenomes

Mitsuhiko P. Sato, Yoshitoshi Ogura, Keiji Nakamura, Ruriko Nishida, Yasuhiro Gotoh, Masahiro Hayashi, Junzo Hisatsune, Motoyuki Sugai, Itoh Takehiko, Tetsuya Hayashi

Research output: Contribution to journalArticle

Abstract

In bacterial genome and metagenome sequencing, Illumina sequencers are most frequently used due to their high throughput capacity, and multiple library preparation kits have been developed for Illumina platforms. Here, we systematically analysed and compared the sequencing bias generated by currently available library preparation kits for Illumina sequencing. Our analyses revealed that a strong sequencing bias is introduced in low-GC regions by the Nextera XT kit. The level of bias introduced is dependent on the level of GC content; stronger bias is generated as the GC content decreases. Other analysed kits did not introduce this strong sequencing bias. The GC content-associated sequencing bias introduced by Nextera XT was more remarkable in metagenome sequencing of a mock bacterial community and seriously affected estimation of the relative abundance of low-GC species. The results of our analyses highlight the importance of selecting proper library preparation kits according to the purposes and targets of sequencing, particularly in metagenome sequencing, where a wide range of microbial species with various degrees of GC content is present. Our data also indicate that special attention should be paid to which library preparation kit was used when analysing and interpreting publicly available metagenomic data.

Original languageEnglish
Pages (from-to)391-398
Number of pages8
JournalDNA research : an international journal for rapid publication of reports on genes and genomes
Volume26
Issue number5
DOIs
Publication statusPublished - Oct 1 2019

Fingerprint

Metagenome
Bacterial Genomes
Base Composition
Libraries
Metagenomics

All Science Journal Classification (ASJC) codes

  • Molecular Biology
  • Genetics

Cite this

Comparison of the sequencing bias of currently available library preparation kits for Illumina sequencing of bacterial genomes and metagenomes. / Sato, Mitsuhiko P.; Ogura, Yoshitoshi; Nakamura, Keiji; Nishida, Ruriko; Gotoh, Yasuhiro; Hayashi, Masahiro; Hisatsune, Junzo; Sugai, Motoyuki; Takehiko, Itoh; Hayashi, Tetsuya.

In: DNA research : an international journal for rapid publication of reports on genes and genomes, Vol. 26, No. 5, 01.10.2019, p. 391-398.

Research output: Contribution to journalArticle

@article{19a3bdcda8af47268da8d233e7d93f99,
title = "Comparison of the sequencing bias of currently available library preparation kits for Illumina sequencing of bacterial genomes and metagenomes",
abstract = "In bacterial genome and metagenome sequencing, Illumina sequencers are most frequently used due to their high throughput capacity, and multiple library preparation kits have been developed for Illumina platforms. Here, we systematically analysed and compared the sequencing bias generated by currently available library preparation kits for Illumina sequencing. Our analyses revealed that a strong sequencing bias is introduced in low-GC regions by the Nextera XT kit. The level of bias introduced is dependent on the level of GC content; stronger bias is generated as the GC content decreases. Other analysed kits did not introduce this strong sequencing bias. The GC content-associated sequencing bias introduced by Nextera XT was more remarkable in metagenome sequencing of a mock bacterial community and seriously affected estimation of the relative abundance of low-GC species. The results of our analyses highlight the importance of selecting proper library preparation kits according to the purposes and targets of sequencing, particularly in metagenome sequencing, where a wide range of microbial species with various degrees of GC content is present. Our data also indicate that special attention should be paid to which library preparation kit was used when analysing and interpreting publicly available metagenomic data.",
author = "Sato, {Mitsuhiko P.} and Yoshitoshi Ogura and Keiji Nakamura and Ruriko Nishida and Yasuhiro Gotoh and Masahiro Hayashi and Junzo Hisatsune and Motoyuki Sugai and Itoh Takehiko and Tetsuya Hayashi",
year = "2019",
month = "10",
day = "1",
doi = "10.1093/dnares/dsz017",
language = "English",
volume = "26",
pages = "391--398",
journal = "DNA Research",
issn = "1340-2838",
publisher = "Oxford University Press",
number = "5",

}

TY - JOUR

T1 - Comparison of the sequencing bias of currently available library preparation kits for Illumina sequencing of bacterial genomes and metagenomes

AU - Sato, Mitsuhiko P.

AU - Ogura, Yoshitoshi

AU - Nakamura, Keiji

AU - Nishida, Ruriko

AU - Gotoh, Yasuhiro

AU - Hayashi, Masahiro

AU - Hisatsune, Junzo

AU - Sugai, Motoyuki

AU - Takehiko, Itoh

AU - Hayashi, Tetsuya

PY - 2019/10/1

Y1 - 2019/10/1

N2 - In bacterial genome and metagenome sequencing, Illumina sequencers are most frequently used due to their high throughput capacity, and multiple library preparation kits have been developed for Illumina platforms. Here, we systematically analysed and compared the sequencing bias generated by currently available library preparation kits for Illumina sequencing. Our analyses revealed that a strong sequencing bias is introduced in low-GC regions by the Nextera XT kit. The level of bias introduced is dependent on the level of GC content; stronger bias is generated as the GC content decreases. Other analysed kits did not introduce this strong sequencing bias. The GC content-associated sequencing bias introduced by Nextera XT was more remarkable in metagenome sequencing of a mock bacterial community and seriously affected estimation of the relative abundance of low-GC species. The results of our analyses highlight the importance of selecting proper library preparation kits according to the purposes and targets of sequencing, particularly in metagenome sequencing, where a wide range of microbial species with various degrees of GC content is present. Our data also indicate that special attention should be paid to which library preparation kit was used when analysing and interpreting publicly available metagenomic data.

AB - In bacterial genome and metagenome sequencing, Illumina sequencers are most frequently used due to their high throughput capacity, and multiple library preparation kits have been developed for Illumina platforms. Here, we systematically analysed and compared the sequencing bias generated by currently available library preparation kits for Illumina sequencing. Our analyses revealed that a strong sequencing bias is introduced in low-GC regions by the Nextera XT kit. The level of bias introduced is dependent on the level of GC content; stronger bias is generated as the GC content decreases. Other analysed kits did not introduce this strong sequencing bias. The GC content-associated sequencing bias introduced by Nextera XT was more remarkable in metagenome sequencing of a mock bacterial community and seriously affected estimation of the relative abundance of low-GC species. The results of our analyses highlight the importance of selecting proper library preparation kits according to the purposes and targets of sequencing, particularly in metagenome sequencing, where a wide range of microbial species with various degrees of GC content is present. Our data also indicate that special attention should be paid to which library preparation kit was used when analysing and interpreting publicly available metagenomic data.

UR - http://www.scopus.com/inward/record.url?scp=85073578548&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85073578548&partnerID=8YFLogxK

U2 - 10.1093/dnares/dsz017

DO - 10.1093/dnares/dsz017

M3 - Article

C2 - 31364694

AN - SCOPUS:85073578548

VL - 26

SP - 391

EP - 398

JO - DNA Research

JF - DNA Research

SN - 1340-2838

IS - 5

ER -