Genomic Epidemiology of SARS-CoV-2 in Madrid, Spain, during the First Wave of the Pandemic: Fast Spread and Early Dominance by D614G Variants

Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) was first detected in Madrid, Spain, on 25 February 2020. It increased in frequency very fast and by the end of May more than 70,000 cases had been confirmed by reverse transcription-polymerase chain reaction (RT-PCR). To study the lineages and the diversity of the viral population during this first epidemic wave in Madrid we sequenced 224 SARS-CoV-2 viral genomes collected from three hospitals from February to May 2020. All the known major lineages were found in this set of samples, though B.1 and B.1.5 were the most frequent ones, accounting for more than 60% of the sequences. In parallel with the B lineages and sublineages, the D614G mutation in the Spike protein sequence was detected soon after the detection of the first coronavirus disease 19 (COVID-19) case in Madrid and in two weeks became dominant, being found in 80% of the samples and remaining at this level during all the study periods. The lineage composition of the viral population found in Madrid was more similar to the European population than to the publicly available Spanish data, underlining the role of Madrid as a national and international transport hub. In agreement with this, phylodynamic analysis suggested multiple independent entries before the national lockdown and air transportation restrictions.