Skip to main content

Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

  • Comment
  • Published:

Enhancing the Polygenic Score Catalog with tools for score calculation and ancestry normalization

Polygenic scores (PGSs) have transformed human genetic research and have numerous potential clinical applications. Here we present a series of recent enhancements to the PGS Catalog and highlight the PGS Catalog Calculator, an open-source, scalable and portable pipeline for reproducibly calculating PGSs that democratizes equitable PGS applications.

This is a preview of subscription content, access via your institution

Access options

Buy this article

Prices may be subject to local taxes which are calculated during checkout

Fig. 1: Data growth and improvement of the PGS Catalog.
Fig. 2: The PGS Catalog Calculator for reproducible PGS calculation, with estimation and adjustment for genetic ancestry.

Data availability

Data in the PGS Catalog are accessible via our website (http://www.PGSCatalog.org), FTP server (https://ftp.ebi.ac.uk/pub/databases/spot/pgs/) and REST API (http://www.PGSCatalog.org/rest). PGS in the Catalog are distributed according to the European Bioinformatic Institute’s terms of use (https://www.ebi.ac.uk/about/terms-of-use) or specifically marked with open-access licenses supplied by the authors. The processed reference panels 1kGP and 1kGP+HGDP are available from our FTP server (https://ftp.ebi.ac.uk/pub/databases/spot/pgs/resources/). UK Biobank data can be accessed via application (https://www.ukbiobank.ac.uk/enable-your-research/apply-for-access); the data for this publication were accessed as part of projects 49978 and 78537.

Code availability

All codes for the PGS Catalog are hosted on GitHub under the PGS Catalog organization (https://github.com/PGScatalog/; see https://github.com/PGScatalog/pgsc_calc for calculator) and are released under an Apache v2 license.

References

  1. Abdellaoui, A., Yengo, L., Verweij, K. J. H. & Visscher, P. M. Am. J. Hum. Genet. 110, 179–194 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Xiang, R. et al. Genome Med 16, 33 (2024).

    Article  PubMed  PubMed Central  Google Scholar 

  3. Suzuki, K. et al. Nature 627, 347–357 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Lambert, S. A. et al. Nat. Genet. 53, 420–425 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Wand, H. et al. Nature 591, 211–219 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Truong, B. et al. Cell Genom 4, 100523 (2024).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Hao, L. et al. Nat. Med. 28, 1006–1013 (2022).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Allot, A., Lee, K., Chen, Q., Luo, L. & Lu, Z. Nucleic Acids Res. 49, W352–W358 (2021).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Martin, A. R. et al. Nat. Genet. 51, 584–591 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Ewels, P. A. et al. Nat. Biotechnol. 38, 276–278 (2020).

    Article  CAS  PubMed  Google Scholar 

  11. Martin, A. R. et al. Am. J. Hum. Genet. 100, 635–649 (2017).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Khera, A. V. et al. Circulation 139, 1593–1602 (2019).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Linder, J. E. et al. Genet. Med. 25, 100006 (2023).

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Committee on the Use of Race, Ethnicity, and Ancestry as Population Descriptors in Genomics Research et al. Using Population Descriptors in Genetics and Genomics Research (National Academies Press, 2023).

  15. Koenig, Z. et al. Genome Res. https://doi.org/10.1101/gr.278378.123 (2023).

Download references

Acknowledgements

We thank all the authors of publications in the PGS Catalog for making their data available and indexable in our database, and all those who responded to our inquiries and requests for data. We also thank the users of the PGS Catalog Calculator, especially those that have contributed to bug reports, feature requests, or discussions. This work was supported by core funding from the British Heart Foundation (BHF) (grants RG/18/13/33946 and RG/F/23/110103); the National Human Genome Research Institute of the National Institutes of Health (grant 1U24HG012542-01 to H.P. and M.I.); the UK National Institute for Health and Care Research (NIHR) Cambridge Biomedical Research Centre (grants BRC-1215–20014 and NIHR203312); the Cambridge BHF Centre of Research Excellence (grants RE/18/1/34212 and RE/18/1/34212]); a BHF Chair Award (CH/12/2/29428); the European Union’s Horizon 2020 research and innovation program (under grant agreement 101016775 INTERVENE to H.P., M.I. and B.W.); and by Health Data Research UK, which is funded by the UK Medical Research Council, Engineering and Physical Sciences Research Council, Economic and Social Research Council, Department of Health and Social Care (England), Chief Scientist Office of the Scottish Government Health and Social Care Directorates, Health and Social Care Research and Development Division (Welsh Government), Public Health Agency (Northern Ireland), British Heart Foundation and Wellcome. The views expressed are those of the authors and not necessarily those of the National Health Service (NHS), the NIHR or the Department of Health and Social Care. This work was carried out using computing resources provided by the Cambridge Service for Data Driven Discovery (CSD3) operated by the University of Cambridge Research Computing Service (www.csd3.cam.ac.uk), provided by Dell EMC and Intel using Tier-2 funding from the Engineering and Physical Sciences Research Council (capital grant EP/T022159/1), and DiRAC funding from the Science and Technology Facilities Council (www.dirac.ac.uk). The authors wish to acknowledge the CSC–IT Center for Science, Finland, for providing computing resources. M.I. is supported by the Munz Chair of Cardiovascular Prediction and Prevention and the UK Economic and Social Research 878 Council (grant ES/T013192/1). H.P. is supported by European Molecular Biology Laboratory Core Funds.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Samuel A. Lambert or Michael Inouye.

Ethics declarations

Competing interests

M.I. is a trustee of the Public Health Genomics Foundation; is a member of the Scientific Advisory Board of Open Targets; and has research collaborations with AstraZeneca, Nightingale Health and Pfizer that are unrelated to this study. All other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Niall Lennon and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Supplementary information

Supplementary Information

Supplementary Notes 1–4, Supplementary Figures 1–6, and Supplementary Tables 1–2.

Reporting Summary

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lambert, S.A., Wingfield, B., Gibson, J.T. et al. Enhancing the Polygenic Score Catalog with tools for score calculation and ancestry normalization. Nat Genet (2024). https://doi.org/10.1038/s41588-024-01937-x

Download citation

  • Published:

  • DOI: https://doi.org/10.1038/s41588-024-01937-x

Search

Quick links

Nature Briefing: Translational Research

Sign up for the Nature Briefing: Translational Research newsletter — top stories in biotechnology, drug discovery and pharma.

Get what matters in translational research, free to your inbox weekly. Sign up for Nature Briefing: Translational Research