Role of APOBEC3 in Genetic Diversity among Endogenous Murine Leukemia Viruses

Abstract
The ability of human and murine APOBECs (specifically, APOBEC3) to inhibit infecting retroviruses and retrotransposition of some mobile elements is becoming established. Less clear is the effect that they have had on the establishment of the endogenous proviruses resident in the human and mouse genomes. We used the mouse genome sequence to study diversity and genetic traits of nonecotropic murine leukemia viruses (polytropic [Pmv], modified polytropic [Mpmv], and xenotropic [Xmv] subgroups), the best-characterized large set of recently integrated proviruses. We identified 49 proviruses. In phylogenetic analyses, Pmvs and Mpmvs were monophyletic, whereas Xmvs were divided into several clades, implying a greater number of replication cycles between the integration events. Four distinct primer binding site types (Pro, Gln1, Gln2 and Thr) were dispersed within the phylogeny, indicating frequent mispriming. We analyzed the frequency and context of G-to-A mutations for the role of mA3 in formation of these proviruses. In the Pmv and Mpmv (but not Xmv) groups, mutations attributable to mA3 constituted a large fraction of the total. A significant number of nonsense mutations suggests the absence of purifying selection following mutation. A strong bias of G-to-A relative to C-to-T changes was seen, implying a strand specificity that can only have occurred prior to integration. The optimal sequence context of G-to-A mutations, TTC, was consistent with mA3. At least in the Pmv group, a significant 5′ to 3′ gradient of G-to-A mutations was consistent with mA3 editing. Altogether, our results for the first time suggest mA3 editing immediately preceding the integration event that led to retroviral endogenization, contributing to inactivation of infectivity. Vertebrate genomes are littered with remnants from earlier retroviral infections, in the form of endogenous retroviruses (ERVs). Cellular host defenses against retroviruses, including the APOBEC3 family of cytidine deaminases, have been described previously. APOBEC3 proteins have been shown to edit some retroviruses and other retrotransposing elements during their replication by deamination of C to U during negative-strand synthesis, resulting in G-to-A mutations in the sense strand. Here, we studied the possible effects that the APOBEC-protein family might have had in the establishing ERVs. We identified 49 endogenous (nonecotropic) murine leukemia viruses, divided into three groups; polytropic, modified polytropic, and xenotropic, in the sequenced C57BL/6J mouse genome. We analyzed genetic variation within and among subgroups and found mutation patterns consistent with APOBEC3 editing of Pmv and Mpmv, but not Xmv proviruses. Evidence such as (i) significantly higher G-to-A mutation frequencies compared to controls and large fractions leading to inactivating stop mutations, (ii) optimal sequence contexts surrounding the mutation positions, and (iii) editing gradient following the time course of retroviral replication, implicate APOBEC3 as a factor contributing to inactivation of these ERVs in the mouse genome.