Structure of the Expression Site Reveals Global Diversity in MSP2 (P44) Variants in Anaplasma phagocytophilum

Abstract
Anaplasma phagocytophilum , a recently reclassified bacteria in the order Rickettsiales , infects many different animal species and causes an emerging tick-borne disease of humans. The genome contains a large number of related genes and gene fragments encoding partial or apparently full-length outer membrane protein MSP2 (P44). Previous data using strains isolated from humans in the United States suggest that antigenic diversity results from RecF-mediated conversion of a single MSP2 (P44) expression site by partially homologous donor sequences. However, whether similar mechanisms operate in naturally infected animal species and the extent of global diversity in MSP2 (P44) are unknown. We analyzed the structure and diversity of the MSP2 (P44) expression site in strains derived from the United States and Europe and from infections of different animal species, including wildlife reservoirs. The results show that a syntenic expression site is present in all strains of A. phagocytophilum investigated. This genomic locus contained diverse MSP2 (P44) variants in all infected animals sampled, and variants also differed at different time points during infection. Although similar variants were found among different populations of U.S. origin, there was little sequence identity between U.S. strain variants (including genomic copies from a completely sequenced U.S. strain) and expression site variants infecting sheep and dogs in Norway and Sweden. Finally, the possibility that combinatorial mechanisms can generate additional diversity beyond the basic donor sequence repertoire is supported by the observation of shared sequence blocks throughout the MSP2 (P44) hypervariable region in reservoir hosts. These data suggest similar genetic mechanisms for A. phagocytophilum variation in all hosts but worldwide diversity of the MSP2 (P44) outer membrane protein.