Similar genomic patterns of clinical infective endocarditis and oral isolates of Streptococcus sanguinis and Streptococcus gordonii
Open Access
- 17 February 2020
- journal article
- research article
- Published by Springer Science and Business Media LLC in Scientific Reports
- Vol. 10 (1), 1-11
- https://doi.org/10.1038/s41598-020-59549-4
Abstract
Streptococcus gordonii and Streptococcus sanguinis belong to the Mitis group streptococci, which mostly are commensals in the human oral cavity. Though they are oral commensals, they can escape their niche and cause infective endocarditis, a severe infection with high mortality. Several virulence factors important for the development of infective endocarditis have been described in these two species. However, the background for how the commensal bacteria, in some cases, become pathogenic is still not known. To gain a greater understanding of the mechanisms of the pathogenic potential, we performed a comparative analysis of 38 blood culture strains, S. sanguinis (n = 20) and S. gordonii (n = 18) from patients with verified infective endocarditis, along with 21 publicly available oral isolates from healthy individuals, S. sanguinis (n = 12) and S. gordonii (n = 9). Using whole genome sequencing data of the 59 streptococci genomes, functional profiles were constructed, using protein domain predictions based on the translated genes. These functional profiles were used for clustering, phylogenetics and machine learning. A clear separation could be made between the two species. No clear differences between oral isolates and clinical infective endocarditis isolates were found in any of the 675 translated core-genes. Additionally, random forest-based machine learning and clustering of the pan-genome data as well as amino acid variations in the core-genome could not separate the clinical and oral isolates. A total of 151 different virulence genes was identified in the 59 genomes. Among these homologs of genes important for adhesion and evasion of the immune system were found in all of the strains. Based on the functional profiles and virulence gene content of the genomes, we believe that all analysed strains had the ability to become pathogenic.Funding Information
- Hjerteforeningen (15-R99-A6040-22951)
- Novo Nordisk Fonden (NNF14CC0001, NNF14CC0001)
This publication has 84 references indexed in Scilit:
- CD-HIT: accelerated for clustering the next-generation sequencing dataBioinformatics, 2012
- SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell SequencingJournal of Computational Biology, 2012
- VFDB 2012 update: toward the genetic diversity and molecular evolution of bacterial virulence factorsNucleic Acids Research, 2011
- Assemblathon 1: A competitive assessment of de novo short read assembly methodsGenome Research, 2011
- New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0Systematic Biology, 2010
- FIGfams: yet another set of protein familiesNucleic Acids Research, 2009
- Clinical Presentation, Etiology, and Outcome of Infective Endocarditis in the 21st CenturyJAMA Internal Medicine, 2009
- SUPERFAMILY—sophisticated comparative genomics, data mining, visualization and phylogenyNucleic Acids Research, 2008
- VFDB 2008 release: an enhanced web-based resource for comparative pathogenomicsNucleic Acids Research, 2007
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004