The Neighborhood of the Spike Gene Is a Hotspot for Modular Intertypic Homologous and
Nonhomologous Recombination in Coronavirus Genomes
Coronaviruses (CoVs) have very large RNA viral genomes with a distinct genomic architecture
of core and accessory open reading frames (ORFs). It is of utmost importance to understand
their patterns and limits of homologous and nonhomologous recombination, because such events
may affect the emergence of novel CoV strains, alter their host range, infection rate,
tissue tropism pathogenicity, and their ability to escape vaccination programs. Intratypic
recombination among closely related CoVs of the same subgenus has often been reported;
however, the patterns and limits of genomic exchange between more distantly related CoV
lineages (intertypic recombination) need further investigation. Here, we report
computational/evolutionary analyses that clearly demonstrate a substantial ability for CoVs
of different subgenera to recombine. Furthermore, we show that CoVs can obtain—through
nonhomologous recombination—accessory ORFs from core ORFs, exchange accessory ORFs with
different CoV genera, with other viruses (i.e., toroviruses, influenza C/D, reoviruses,
rotaviruses, astroviruses) and even with hosts. Intriguingly, most of these radical events
result from double crossovers surrounding the Spike ORF, thus highlighting both the
instability and mobile nature of this genomic region. Although many such events have often
occurred during the evolution of various CoVs, the genomic architecture of the relatively
young SARS-CoV/SARS-CoV-2 lineage so far appears to be stable.
Major findings:
Core ORFs undergo homologous recombination at the species, subgenus and genus levels.
CoVs can obtain AOFs through non
homologous recombination, even from other viruses or
hosts.
Recombination events are mostly localized at the Spike neighborhood.
Comparative Analysis of SARS-CoV-2 Variants of Concern, Including Omicron, Highlights Their
Common and Distinctive Amino Acid Substitution Patterns, Especially at the Spike ORF
In order to gain a deeper understanding of the recently emerged and highly divergent Omicron
variant of concern (VoC), a study of amino acid substitution (AAS) patterns was performed
and compared with those of the other four successful variants of concern (Alpha, Beta,
Gamma, Delta) and one closely related variant of interest (VoI—Lambda). The Spike ORF
consistently emerges as an AAS hotspot in all six lineages, but in Omicron this enrichment
is significantly higher. The progenitors of each of these VoC/VoI lineages underwent
positive selection in the Spike ORF. However, once they were established, their Spike ORFs
have been undergoing purifying selection, despite the application of global vaccination
schemes from 2021 onwards. Our analyses reject the hypothesis that the heavily mutated
receptor binding domain (RBD) of the Omicron Spike was introduced via recombination from
another closely related Sarbecovirus. Thus, successive point mutations appear as the most
parsimonious scenario. Intriguingly, in each of the six lineages, we observed a significant
number of AAS wherein the new residue is not present at any homologous site among the other
known Sarbecoviruses. Such AAS should be further investigated as potential adaptations to
the human host. By studying the phylogenetic distribution of AAS shared between the six
lineages, we observed that the Omicron (BA.1) lineage had the highest number (8/10) of
recurrent mutations.
Major findings:
The Spike ORF consistently emerges as an AAS hotspot in all six lineages, but in Omicron
this enrichment is significantly higher.
The VoC/VoI lineage ancestors undergo positive selection, followed by purifying
selection after variant emergence.
Vaccination does not accelerate the accumulation of non-synonymous mutations at Spike.
Omicron recurrent mutations may be a result of inter-lineage recombination
(Recombination with other Sarbecovirus is rejected via CONSEL).
The Remarkable Evolutionary Plasticity of Coronaviruses by Mutation and Recombination:
Insights for the COVID-19 Pandemic and the Future Evolutionary Paths of SARS-CoV-2
Coronaviruses (CoVs) constitute a large and diverse subfamily of positive-sense
single-stranded RNA viruses. They are found in many mammals and birds and have great
importance for the health of humans and farm animals. The current SARS-CoV-2 pandemic, as
well as many previous epidemics in humans that were of zoonotic origin, highlights the
importance of studying the evolution of the entire CoV subfamily in order to understand how
novel strains emerge and which molecular processes affect their adaptation,
transmissibility, host/tissue tropism, and patho non-homologous genicity. In this review, we
focus on studies over the last two years that reveal the impact of point mutations,
insertions/deletions, and intratypic/intertypic homologous and non-homologous recombination
events on the evolution of CoVs. We discuss whether the next generations of CoV vaccines
should be directed against other CoV proteins in addition to or instead of spike. Based on
the observed patterns of molecular evolution for the entire subfamily, we discuss five
scenarios for the future evolutionary path of SARS-CoV-2 and the COVID-19 pandemic. Finally,
within this evolutionary context, we discuss the recently emerged Omicron (B.1.1.529) VoC.