Structure prediction for orphan proteins

Singh, Arunima

doi:10.1038/s41592-023-01795-1

Research Highlight
Published: 10 February 2023

Computational biology

Structure prediction for orphan proteins

Arunima Singh¹

Nature Methods volume 20, page 176 (2023)Cite this article

2155 Accesses
1 Citations
3 Altmetric
Metrics details

Subjects

Access through your institution

Buy or subscribe

AlphaFold2 and other deep learning-based approaches have provided a breakthrough advance in the field of protein structure prediction. Inspired by this, scientists all over the world have been extending the idea to even more challenging areas in structure prediction, such as multimeric protein complexes and RNA structures. One such structure prediction challenge is that of orphan proteins, or proteins that have no close homologs. An important step within AlphaFold2 is the use of coevolution signals derived from a multiple sequence alignment. This alignment identifies homologous — similar but not identical — sequences for the protein of interest. This step, however, is not feasible for orphan proteins, and current approaches fail to predict accurate structures for such protein and their complexes.

Researchers from Nankai University and Shandong University in China, led by Jianyi Yang, have developed trRosettaX-Single, a single-sequence protein structure prediction method that shows better performance on orphan proteins than AlphaFold2 and RoseTTAFold. “A pretrained language model based on supervised learning (s-ESM-1b) is first employed in trRosettaX-Single to encode the sequence as an embedding vector. This vector is then fed into a multiscale residual network to predict inter-residue 2D geometry, including distance and orientations. Finally, energy minimization is adopted to generate 3D structure models from the predicted 2D geometry,” explains Yang. A few other new training strategies, including a multiscale residual network, sequence mask prediction and knowledge distillation, also contribute to success of trRosettaX-Single, he adds.

This is a preview of subscription content, access via your institution

Access options

Access through your institution

Buy this article

Purchase on Springer Link
Instant access to full article PDF

Buy now

Prices may be subject to local taxes which are calculated during checkout

Author information

Authors and Affiliations

Nature Methods https://www.nature.com/nmeth/
Arunima Singh

Authors

Arunima Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Arunima Singh.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Singh, A. Structure prediction for orphan proteins. Nat Methods 20, 176 (2023). https://doi.org/10.1038/s41592-023-01795-1

Download citation

Published: 10 February 2023
Issue Date: February 2023
DOI: https://doi.org/10.1038/s41592-023-01795-1

Structure prediction for orphan proteins

Subjects

Access options

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Search

Quick links

Subjects

Access options

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Quick links