TY - JOUR
T1 - Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample
AU - Caporaso, J. Gregory
AU - Lauber, Christian L.
AU - Walters, William A.
AU - Berg-Lyons, Donna
AU - Lozupone, Catherine A.
AU - Turnbaugh, Peter J.
AU - Fierer, Noah
AU - Knight, Rob
PY - 2011/3/15
Y1 - 2011/3/15
N2 - The ongoing revolution in high-throughput sequencing continues to democratize the ability of small groups of investigators to map the microbial component of the biosphere. In particular, the coevolution of new sequencing platforms and new software tools allows data acquisition and analysis on an unprecedented scale. Here we report the next stage in this coevolutionary arms race, using the Illumina GAIIx platform to sequence a diverse array of 25 environmental samples and three known "mock communities" at a depth averaging 3.1 million reads per sample. We demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of meta-analysis of many studies from the literature (notably, the saline/nonsaline split in environmental samples and the split between host-associated and free-living communities). We also demonstrate that 2,000 Illumina single-end reads are sufficient to recapture the same relationships among samples that we observe with the full dataset. The results thus open up the possibility of conducting large-scale studies analyzing thousands of samples simultaneously to survey microbial communities at an unprecedented spatial and temporal resolution.
AB - The ongoing revolution in high-throughput sequencing continues to democratize the ability of small groups of investigators to map the microbial component of the biosphere. In particular, the coevolution of new sequencing platforms and new software tools allows data acquisition and analysis on an unprecedented scale. Here we report the next stage in this coevolutionary arms race, using the Illumina GAIIx platform to sequence a diverse array of 25 environmental samples and three known "mock communities" at a depth averaging 3.1 million reads per sample. We demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of meta-analysis of many studies from the literature (notably, the saline/nonsaline split in environmental samples and the split between host-associated and free-living communities). We also demonstrate that 2,000 Illumina single-end reads are sufficient to recapture the same relationships among samples that we observe with the full dataset. The results thus open up the possibility of conducting large-scale studies analyzing thousands of samples simultaneously to survey microbial communities at an unprecedented spatial and temporal resolution.
KW - Human microbiome
KW - Microbial community analysis
KW - Microbial ecology
KW - Next-generation sequencing
UR - http://www.scopus.com/inward/record.url?scp=79952005915&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79952005915&partnerID=8YFLogxK
U2 - 10.1073/pnas.1000080107
DO - 10.1073/pnas.1000080107
M3 - Article
C2 - 20534432
AN - SCOPUS:79952005915
SN - 0027-8424
VL - 108
SP - 4516
EP - 4522
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - SUPPL. 1
ER -