Molecular Evolution of Hepatitis B Virus over 25 Years

Abstract
Determining the longitudinal molecular evolution of hepatitis B virus (HBV) is difficult due to HBV's genomic complexity and the need to study paired samples collected over long periods of time. In this study, serial samples were collected from eight hepatitis B virus e antigen-negative asymptomatic carriers of HBV genotype B in 1979 and 2004, thus providing a 25-year period to document the long-term molecular evolution of HBV. The rate, nature, and distribution of mutations that emerged over 25 years were determined by phylogenetic and linear regression analysis of full-length HBV genome sequences. Nucleotide hypervariability was observed within the polymerase and pre-S/S overlap region and within the core gene. The calculated mean number of nucleotide substitutions/site/year (7.9 × 10 −5 ) was slightly higher than published estimates (1.5 × 10 −5 to 5 × 10 −5 ). Nucleotide changes in the quasispecies population did not significantly alter the molecular evolutionary rate, based on linear regression analysis of evolutionary distances among serial clone pre-S region sequences. Therefore, the directly amplified or dominant sequence was sufficient to estimate the putative molecular evolutionary rate for these long-term serial samples. On average, the ratio of synonymous ( d S ) to nonsynonymous ( d N ) substitutions was highest for the polymerase-coding region and lowest for the core-coding region. The low d S / d N ratios observed within the core suggest that selection occurs within this gene region, possibly as an immune evasion strategy. The results of this study suggest that HBV sequence divergence may occur more rapidly than previously estimated, in a host immune phase-dependent manner.