Abundance, composition, diversity and novelty of soil Proteobacteria

Abstract
Small subunit (16S) rRNA gene surveys generating near full-length 16S rRNA clones offer a unique opportunity for in-depth phylogenetic analysis to highlight the breadth of diversity within various major bacterial phyla encountered in soil. This study offers a detailed phylogenetic analysis of the Proteobacteria-affiliated clones identified from 13 001 nearly full-length 16S rRNA gene clones derived from Oklahoma tall-grass prairie soil. Proteobacteria was the most abundant phylum in the community, and comprised 25% of the total clones. The most abundant and diverse class within the Proteobacteria was Alphaproteobacteria, followed by the Delta-, Beta- and Gammaproteobacteria. Members of the Epsilon- and Zetaproteobacteria were not detected in the dataset. Our analysis identified 15 novel order-level and 48 novel family-level Proteobacteria lineages. In addition, we show that the majority of Proteobacteria clones in the dataset belong to orders and families containing no described cultivated representatives (50% and 65%, respectively). An examination of the ecological distribution of the six most abundant Proteobacteria lineages in this dataset with no characterized pure culture representatives provided important information regarding their global distribution and environmental preferences. This level of novel phylogenetic diversity indicates that our understanding of the functions of soil microorganisms, even those belonging to phyla with numerous and diverse well-characterized cultured representatives such as the Proteobacteria, remains far from adequate.