Adaptation to Different Human Populations by HIV-1 Revealed by Codon-Based Analyses

Abstract
Several codon-based methods are available for detecting adaptive evolution in protein-coding sequences, but to date none specifically identify sites that are selected differentially in two populations, although such comparisons between populations have been historically useful in identifying the action of natural selection. We have developed two fixed effects maximum likelihood methods: one for identifying codon positions showing selection patterns that persist in a population and another for detecting whether selection is operating differentially on individual codons of a gene sampled from two different populations. Applying these methods to two HIV populations infecting genetically distinct human hosts, we have found that few of the positively selected amino acid sites persist in the population; the other changes are detected only at the tips of the phylogenetic tree and appear deleterious in the long term. Additionally, we have identified seven amino acid sites in protease and reverse transcriptase that are selected differentially in the two samples, demonstrating specific population-level adaptation of HIV to human populations. Despite the efforts devoted to surveying HIV genetic diversity and the development of an effective vaccine, there is still no consensus on the extent to which the former prejudices the latter. Experimental studies show that escape from cell-mediated immunity is selected for in HIV and SIV, and sometimes very quickly. Conversely, escape mutants may be selected against at transmission, so how much does this selection within individuals influence the genotype of the circulating HIV population overall? Kosakovsky Pond, Leigh Brown, and colleagues have developed a new statistical approach to address this question. Using sequences from the globally most abundant HIV subtype (subtype C), the authors directly compared virus of the same subtype infecting genetically different human populations. They show at least half of the amino acid sites selected within individuals are not selected at a population level, and they identify six amino acid sites in the RT gene that are selected differentially between populations. We can now partition molecular adaptation between individual and population components for whatever genes may be included in candidate vaccines, in the target populations themselves. The methods are also important beyond the HIV world, where analogous issues arise in the more general question of the evolution of virulence in pathogens.