Whole-Genome Sequence Analysis Reveals the Enterovirus D68 Isolates during the United States 2014 Outbreak Mainly Belong to a Novel Clade

Abstract
In the late summer and the fall of 2014, the United States experienced an unprecedented outbreak of enterovirus D68 (EV-D68) infections. During the outbreak, we collected nasopharyngeal swab specimens from patients in the Lower Hudson Valley of New York. Here, we conduct a retrospective study on the genomic diversity of EV-D68 strains. We first employ a metagenomic shotgun sequencing protocol on a total of 93 clinical samples, including 21 negative controls, the results of which allow assembly of 20 EV-D68 genomes, six complete and 14 near-complete. We then investigate their genetic relationships, along with additional 20 EV-D68 strains having whole-genome sequences publicly available. Our comparative genomic analysis uncovers that the majority (26/29) of EV-D68 strains circulating in the 2014 outbreak differ significantly from prior ones, have a main feature of three variables, C1817T, C3277A and A4020G and belong to a new clade. C3277A causes amino acid substitution T860N in the protease 2Apro cleavage site between VP1 and 2A, whereas A4020G causes S1108G in a 3Cpro cleavage site between 2B and 2C. The two functional mutations may alter the proteases’ cleavage efficiency, leading to increased rate of viral replication and transmission. These provide insights into the evolution of epidemic EV-D68 strains.