Hamiltonian path analysis of viral genomes

Twarock, Reidun; Leonov, German; Stockley, Peter G.

doi:10.1038/s41467-018-03713-y

Download PDF

Correspondence
Open access
Published: 22 May 2018

Hamiltonian path analysis of viral genomes

Nature Communications volume 9, Article number: 2021 (2018) Cite this article

3399 Accesses
25 Citations
2 Altmetric
Metrics details

Subjects

Introduction

Cryo-electron microscopy (EM) is undergoing a revolution, enabling the study of viral pathogens in unprecedented detail. The asymmetric EM reconstruction of bacteriophage MS2 at medium resolution (8.7 Å) by Koning et al.¹, and the subsequent reconstruction at even higher resolution (3.6 Å) by Dai et al.² revealed the structures of both the protein shell and the asymmetric genomic RNA and the unique maturation protein (A). It is the start of a wave of such structural data for viruses, and calls for the development of new analytical tools to describe the results. One approach is Hamiltonian path analysis (HPA) that we introduced to describe repeated, sequence-specific contacts between the MS2 genome and its protein shell³. Here, we describe how HPA is consistent with the new structures and, in turn, how it extends our understanding beyond the structural data alone.

Koning et al.’s and Dai et al.’s reconstructions of MS2 reveal multiple contacts between genomic RNA and the viral capsid. These mimick the contacts seen by crystallography of virus-like particles carrying multiple copies of the high-affinity RNA packaging signal in this virus, the translational repressor (TR) that functions also as assembly initiation signal⁴. TR occurs only once in the MS2 genome, but insights into the roles played by RNA-coat protein (CP) contacts during assembly⁵ together with normal mode analysis⁶ and structural studies suggest that many different stem loops (SLs) in the viral genome should be able to bind CP and thus promote virion formation, i.e., act as RNA packaging signals (PSs). HPA has enabled us to identify such sites within the MS2 genome⁷, which is important, because it underlies the development of a new paradigm for ssRNA virus assembly based on multiple PSs, that seems to occur very widely in nature^8,9,10,11.

HPA is a mathematical abstraction of virus assembly pathways, simultaneously encoding the order in which capsomers are recruited to the growing capsid shell along different assembly pathways. It captures geometric constraints on PS positions in the linear genomic sequence that arise from the relative positions of the RNA-CP-binding sites in the inner capsid surface (Fig. 1). SLs in close proximity in the linear genomic sequence acting as PSs must occupy proximal CP-binding sites on the inner capsid surface. SLs distal in the genomic sequence can also potentially be neighbours on the capsid surface, provided that the RNA segment between them occupies the capsid interior so as to bring them into proximity. Such constraint sets are akin to those of a large Sudoku puzzle, and HPA collectively tests them against experimental data. In particular, in HPA a polyhedron is used to represent all possible connections between neighbouring CP-PS contacts, with vertices at the binding sites and edges representing these (possible) connections. Each CP-PS contact is unique, i.e., can only occur once. We therefore represent the order in which these contacts are formed pictorially as an inscribed self-avoiding path on the polyhedron (see Fig. 1 bottom for an example of such a path). We stress that, in mathematics terms, this path is only 'topologically equivalent' to the more complicated 3D path taken by the RNA, meaning that the connectivity between binding sites with reference to the linear genomic sequence is the same in both cases (see Fig. 1 top, illustrating how a genomic fragment containing three PSs could map into 3D biology; PDB ID: 1ZDH¹²). We explicitly assume that the RNA genome is highly branched, since we predict these contacts to be with SLs located within the MS2 genome⁷, as seen in the new reconstruction. The Hamiltonian path concept is thus an abstraction of discrete 3D contact sites into a linear path that should be understood in the same spirit as the simple lines between atoms in molecular structures are shorthand for the much more complex electronic arrangements of covalent bonds. We note that HPA does not require all possible RNA-CP sites to be occupied. Indeed, testing all possible SLs in the ensemble of putative PS candidates against all complete Hamiltonian paths would be a complex task, and indeed is not always possible as different interaction patterns can occur in the vicinity of asymmetric features such as the A-protein¹³.

An application of HPA to MS2⁷ has revealed that binding sites are differentially constrained across the capsid surface, implying that some (highly constrained) ones are likely to be present in almost every particle, while others (mostly low affinity ones) are more likely to vary across different particles, thus predicting that the RNA conformation in contact with the protein shell will exhibit some similar structural characteristics in every viral particle. This astonishing conclusion is consistent with both Koning et al.’s and Dai et al.’s reconstructions of MS2, identifying a core set of contacts that are present in every particle. Dai et al. explicitly identified 15 RNA SLs in contact with CP and one in contact with the A-protein. Our HPA in Dykeman et al.⁷ has predicted all of these 15 RNA-CP contacts as shown in Fig. 2. The additional PSs that were also identified by HPA are predominantly of lower affinity to CP, and are therefore not expected to be present in every particle. The results of HPA are also in excellent agreement with the RNA-CP-binding sites identified via cross-linking immunoprecipitation (CLIP) experiments¹⁴

The HPA has been key in identifying the nature of the RNA-CP contacts in MS2, and thus is fundamental to our understanding of the virion structure. It has played a central role in establishing the packaging signal hypothesis^{3,9,10,11,15,16,17,18}, and in understanding how PSs cooperatively promote efficient virus assembly¹⁶. This suggests that HPA should be useful also for the study of the many other viruses, whose complete structures are likely to emerge in the near future from modern EM studies. Such structures will also highlight those viruses that exploit the multiple RNA packaging signal-mediated assembly mechanism.

References

Koning, R. I. et al. Asymmetric cryo-EM reconstruction of phage MS2 reveals genome structure in situ. Nat. Commun. 7, 12524 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Dai, X. et al. In situ structures of the genome and genome-delivery apparatus in a single-stranded RNA virus. Nature 541, 112–116 (2017).
Article ADS PubMed CAS Google Scholar
Dykeman, E. C. et al. Simple rules for efficient assembly predict the layout of a packaged viral RNA. J. Mol. Biol. 408, 399–407 (2011).
Article PubMed CAS Google Scholar
Valegård, K., Murray, J. B., Stockley, P. G., Stonehouse, N. J. & Liljas, L. Crystal structure of an RNA bacteriophage coat protein-operator complex. Nature 371, 623–626 (1994).
Article ADS PubMed Google Scholar
Stockley, P. G. et al. A simple, RNA-mediated allosteric switch controls the pathway to formation of a T = 3 viral capsid. J. Mol. Biol. 369, 541–552 (2007).
Article PubMed CAS Google Scholar
Dykeman, E. C., Stockley, P. G. & Twarock, R. Dynamic allostery controls coat protein conformer switching during MS2 phage assembly. J. Mol. Biol. 395, 916–923 (2010).
Article PubMed CAS Google Scholar
Dykeman, E. C., Stockley, P. G. & Twarock, R. Packaging signals in two single-stranded RNA viruses imply a conserved assembly mechanism and geometry of the packaged genome. J. Mol. Biol. 425, 3235–3249 (2013).
Article PubMed CAS Google Scholar
Prevelige, P. Follow the yellow brick road: a paradigm shift in virus assembly. J. Mol. Biol. 428, 416–418 (2016).
Article PubMed CAS Google Scholar
Shakeel, S. et al. Genomic RNA folding mediates assembly of human parechovirus. Nat. Commun. 8, 5 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Patel, N. et al. HBV RNA pre-genome encodes specific motifs that mediate interactions with the viral core protein that promote nucleocapsid assembly. Nat. Microbiol. 2, 17098 (2017).
Article PubMed PubMed Central CAS Google Scholar
Stewart, H. et al. Identification of novel RNA secondary structures within the hepatits C virus genome reveals a cooperative involvement in genome packaging. Sci. Rep. 6, 22952 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Valegård, K. et al. The three-dimensional structures of two complexes between recombinant MS2 capsids and RNA operator fragments reveal sequence-specific protein-RNA interactions. J. Mol. Biol. 270, 724–738 (1997).
Article PubMed Google Scholar
Geraets, J. A. et al. Asymmetric genome organization in an RNA virus revealed via graph-theoretical analysis of tomographic data. PLoS Comp. Biol. 11, e1004146 (2015).
Article CAS Google Scholar
Rolfsson, O. et al. Direct evidence for packaging signal-mediated assembly of bacteriophage MS2. J. Mol. Biol. 428, 431–448 (2016).
Article PubMed PubMed Central CAS Google Scholar
Borodavka, A., Tuma, R. & Stockley, P. G. Evidence that viral RNAs have evolved for efficient, two-stage packaging. PNAS 109, 15769–15774 (2012).
Article ADS PubMed PubMed Central Google Scholar
Dykeman, E. C., Stockley, P. G. & Twarock, R. Solving a levinthal’s paradox for virus assembly suggests a novel anti-viral therapy. PNAS 111, 5361–5366 (2014).
Article ADS PubMed PubMed Central CAS Google Scholar
Patel, N. et al. Revealing the density of encoded functions in a viral RNA. PNAS 112, 2227–2232 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Patel, N. et al. Rewriting nature’s assembly manual for a ssRNA virus. PNAS 114, 12255–12260 (2017).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

We thank Dr. Richard Bingham for help with this figure.

Author information

Authors and Affiliations

York Cross-disciplinary Centre for Systems Analysis, University of York, York, YO10 5DD, UK
Reidun Twarock & German Leonov
Departments of Mathematics and Biology, University of York, York, YO10 5DD, UK
Reidun Twarock & German Leonov
Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds, LS2 9JT, UK
Peter G. Stockley

Authors

Reidun Twarock
View author publications
You can also search for this author in PubMed Google Scholar
German Leonov
View author publications
You can also search for this author in PubMed Google Scholar
Peter G. Stockley
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

R.T., G.L. and P.G.S. jointly wrote the correspondence.

Corresponding author

Correspondence to Reidun Twarock.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Twarock, R., Leonov, G. & Stockley, P.G. Hamiltonian path analysis of viral genomes. Nat Commun 9, 2021 (2018). https://doi.org/10.1038/s41467-018-03713-y

Download citation

Received: 30 September 2016
Accepted: 02 March 2018
Published: 22 May 2018
DOI: https://doi.org/10.1038/s41467-018-03713-y

This article is cited by

Programmable polymorphism of a virus-like particle
- Artur P. Biela
- Antonina Naskalska
- Jonathan G. Heddle
Communications Materials (2022)
Physics of viral dynamics
- Robijn F. Bruinsma
- Gijs J. L. Wuite
- Wouter H. Roos
Nature Reviews Physics (2021)
Individual subunits of a rhinovirus causing common cold exhibit largely different protein-RNA contact site conformations
- Dieter Blaas
Communications Biology (2020)
Weighted Fundamental Group
- Chengyuan Wu
- Shiquan Ren
- Kelin Xia
Bulletin of the Malaysian Mathematical Sciences Society (2020)
Structural puzzles in virology solved with an overarching icosahedral design principle
- Reidun Twarock
- Antoni Luque
Nature Communications (2019)