Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data

Research output: Contribution to journalArticlepeer-review

Abstract

We describe an approach for genotyping bacterial strains from low coverage genome datasets, including metagenomic data from complex samples. Sequence reads from unknown samples are aligned to a reference genome where the allele states of known SNPs are determined. The Whole Genome Focused Array SNP Typing (WG-FAST) pipeline can identify unknown strains with much less read data than is needed for genome assembly. To test WG-FAST, we resampled SNPs from real samples to understand the relationship between low coverage metagenomic data and accurate phylogenetic placement. WG-FAST can be downloaded from https://github.com/jasonsahl/wgfast.

Original languageEnglish (US)
Article number52
JournalGenome Medicine
Volume7
Issue number1
DOIs
StatePublished - Jun 9 2015

ASJC Scopus subject areas

  • Molecular Medicine
  • Molecular Biology
  • Genetics
  • Genetics(clinical)

Fingerprint

Dive into the research topics of 'Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data'. Together they form a unique fingerprint.

Cite this