Introduction
Cryo-electron microscopy (EM) is undergoing a revolution, enabling the study of viral pathogens in unprecedented detail. The asymmetric EM reconstruction of bacteriophage MS2 at medium resolution (8.7 Å) by Koning et al.1, and the subsequent reconstruction at even higher resolution (3.6 Å) by Dai et al.2 revealed the structures of both the protein shell and the asymmetric genomic RNA and the unique maturation protein (A). It is the start of a wave of such structural data for viruses, and calls for the development of new analytical tools to describe the results. One approach is Hamiltonian path analysis (HPA) that we introduced to describe repeated, sequence-specific contacts between the MS2 genome and its protein shell3. Here, we describe how HPA is consistent with the new structures and, in turn, how it extends our understanding beyond the structural data alone.
Koning et al.’s and Dai et al.’s reconstructions of MS2 reveal multiple contacts between genomic RNA and the viral capsid. These mimick the contacts seen by crystallography of virus-like particles carrying multiple copies of the high-affinity RNA packaging signal in this virus, the translational repressor (TR) that functions also as assembly initiation signal4. TR occurs only once in the MS2 genome, but insights into the roles played by RNA-coat protein (CP) contacts during assembly5 together with normal mode analysis6 and structural studies suggest that many different stem loops (SLs) in the viral genome should be able to bind CP and thus promote virion formation, i.e., act as RNA packaging signals (PSs). HPA has enabled us to identify such sites within the MS2 genome7, which is important, because it underlies the development of a new paradigm for ssRNA virus assembly based on multiple PSs, that seems to occur very widely in nature8,9,10,11.
HPA is a mathematical abstraction of virus assembly pathways, simultaneously encoding the order in which capsomers are recruited to the growing capsid shell along different assembly pathways. It captures geometric constraints on PS positions in the linear genomic sequence that arise from the relative positions of the RNA-CP-binding sites in the inner capsid surface (Fig. 1). SLs in close proximity in the linear genomic sequence acting as PSs must occupy proximal CP-binding sites on the inner capsid surface. SLs distal in the genomic sequence can also potentially be neighbours on the capsid surface, provided that the RNA segment between them occupies the capsid interior so as to bring them into proximity. Such constraint sets are akin to those of a large Sudoku puzzle, and HPA collectively tests them against experimental data. In particular, in HPA a polyhedron is used to represent all possible connections between neighbouring CP-PS contacts, with vertices at the binding sites and edges representing these (possible) connections. Each CP-PS contact is unique, i.e., can only occur once. We therefore represent the order in which these contacts are formed pictorially as an inscribed self-avoiding path on the polyhedron (see Fig. 1 bottom for an example of such a path). We stress that, in mathematics terms, this path is only 'topologically equivalent' to the more complicated 3D path taken by the RNA, meaning that the connectivity between binding sites with reference to the linear genomic sequence is the same in both cases (see Fig. 1 top, illustrating how a genomic fragment containing three PSs could map into 3D biology; PDB ID: 1ZDH12). We explicitly assume that the RNA genome is highly branched, since we predict these contacts to be with SLs located within the MS2 genome7, as seen in the new reconstruction. The Hamiltonian path concept is thus an abstraction of discrete 3D contact sites into a linear path that should be understood in the same spirit as the simple lines between atoms in molecular structures are shorthand for the much more complex electronic arrangements of covalent bonds. We note that HPA does not require all possible RNA-CP sites to be occupied. Indeed, testing all possible SLs in the ensemble of putative PS candidates against all complete Hamiltonian paths would be a complex task, and indeed is not always possible as different interaction patterns can occur in the vicinity of asymmetric features such as the A-protein13.
An application of HPA to MS27 has revealed that binding sites are differentially constrained across the capsid surface, implying that some (highly constrained) ones are likely to be present in almost every particle, while others (mostly low affinity ones) are more likely to vary across different particles, thus predicting that the RNA conformation in contact with the protein shell will exhibit some similar structural characteristics in every viral particle. This astonishing conclusion is consistent with both Koning et al.’s and Dai et al.’s reconstructions of MS2, identifying a core set of contacts that are present in every particle. Dai et al. explicitly identified 15 RNA SLs in contact with CP and one in contact with the A-protein. Our HPA in Dykeman et al.7 has predicted all of these 15 RNA-CP contacts as shown in Fig. 2. The additional PSs that were also identified by HPA are predominantly of lower affinity to CP, and are therefore not expected to be present in every particle. The results of HPA are also in excellent agreement with the RNA-CP-binding sites identified via cross-linking immunoprecipitation (CLIP) experiments14
The HPA has been key in identifying the nature of the RNA-CP contacts in MS2, and thus is fundamental to our understanding of the virion structure. It has played a central role in establishing the packaging signal hypothesis3,9,10,11,15,16,17,18, and in understanding how PSs cooperatively promote efficient virus assembly16. This suggests that HPA should be useful also for the study of the many other viruses, whose complete structures are likely to emerge in the near future from modern EM studies. Such structures will also highlight those viruses that exploit the multiple RNA packaging signal-mediated assembly mechanism.
References
Koning, R. I. et al. Asymmetric cryo-EM reconstruction of phage MS2 reveals genome structure in situ. Nat. Commun. 7, 12524 (2016).
Dai, X. et al. In situ structures of the genome and genome-delivery apparatus in a single-stranded RNA virus. Nature 541, 112–116 (2017).
Dykeman, E. C. et al. Simple rules for efficient assembly predict the layout of a packaged viral RNA. J. Mol. Biol. 408, 399–407 (2011).
Valegård, K., Murray, J. B., Stockley, P. G., Stonehouse, N. J. & Liljas, L. Crystal structure of an RNA bacteriophage coat protein-operator complex. Nature 371, 623–626 (1994).
Stockley, P. G. et al. A simple, RNA-mediated allosteric switch controls the pathway to formation of a T = 3 viral capsid. J. Mol. Biol. 369, 541–552 (2007).
Dykeman, E. C., Stockley, P. G. & Twarock, R. Dynamic allostery controls coat protein conformer switching during MS2 phage assembly. J. Mol. Biol. 395, 916–923 (2010).
Dykeman, E. C., Stockley, P. G. & Twarock, R. Packaging signals in two single-stranded RNA viruses imply a conserved assembly mechanism and geometry of the packaged genome. J. Mol. Biol. 425, 3235–3249 (2013).
Prevelige, P. Follow the yellow brick road: a paradigm shift in virus assembly. J. Mol. Biol. 428, 416–418 (2016).
Shakeel, S. et al. Genomic RNA folding mediates assembly of human parechovirus. Nat. Commun. 8, 5 (2017).
Patel, N. et al. HBV RNA pre-genome encodes specific motifs that mediate interactions with the viral core protein that promote nucleocapsid assembly. Nat. Microbiol. 2, 17098 (2017).
Stewart, H. et al. Identification of novel RNA secondary structures within the hepatits C virus genome reveals a cooperative involvement in genome packaging. Sci. Rep. 6, 22952 (2016).
Valegård, K. et al. The three-dimensional structures of two complexes between recombinant MS2 capsids and RNA operator fragments reveal sequence-specific protein-RNA interactions. J. Mol. Biol. 270, 724–738 (1997).
Geraets, J. A. et al. Asymmetric genome organization in an RNA virus revealed via graph-theoretical analysis of tomographic data. PLoS Comp. Biol. 11, e1004146 (2015).
Rolfsson, O. et al. Direct evidence for packaging signal-mediated assembly of bacteriophage MS2. J. Mol. Biol. 428, 431–448 (2016).
Borodavka, A., Tuma, R. & Stockley, P. G. Evidence that viral RNAs have evolved for efficient, two-stage packaging. PNAS 109, 15769–15774 (2012).
Dykeman, E. C., Stockley, P. G. & Twarock, R. Solving a levinthal’s paradox for virus assembly suggests a novel anti-viral therapy. PNAS 111, 5361–5366 (2014).
Patel, N. et al. Revealing the density of encoded functions in a viral RNA. PNAS 112, 2227–2232 (2015).
Patel, N. et al. Rewriting nature’s assembly manual for a ssRNA virus. PNAS 114, 12255–12260 (2017).
Acknowledgements
We thank Dr. Richard Bingham for help with this figure.
Author information
Authors and Affiliations
Contributions
R.T., G.L. and P.G.S. jointly wrote the correspondence.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Twarock, R., Leonov, G. & Stockley, P.G. Hamiltonian path analysis of viral genomes. Nat Commun 9, 2021 (2018). https://doi.org/10.1038/s41467-018-03713-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-018-03713-y
This article is cited by
-
Programmable polymorphism of a virus-like particle
Communications Materials (2022)
-
Physics of viral dynamics
Nature Reviews Physics (2021)
-
Individual subunits of a rhinovirus causing common cold exhibit largely different protein-RNA contact site conformations
Communications Biology (2020)
-
Weighted Fundamental Group
Bulletin of the Malaysian Mathematical Sciences Society (2020)
-
Structural puzzles in virology solved with an overarching icosahedral design principle
Nature Communications (2019)