Getting polygenic risk scores into the clinic

Credit: Spencer Phillips

Getting polygenic risk scores into the clinic

10 Mar 2021 - 16:00


  • The PGS Catalog is a new open database for polygenic risk scores
  • Polygenic risk scores represent a new approach for assessing a person's inherited risk for certain diseases such as Type 2 diabetes or coronary heart disease
  • A set of new guidelines for reporting polygenic risk scores in a consistent way could make the data more useful in clinical care

10 March 2021, Cambridge – Scientists and healthcare providers are beginning to use a new approach for assessing a person's inherited risk for complex diseases such as Type 2 diabetes, coronary heart disease and breast cancer, which involves calculating a polygenic risk score (PGS). The score provides an estimate of an individual’s risk for the disease, based on their DNA changes related to those diseases.

Despite the rise in studies reporting PGS, researchers have observed inconsistencies in how such scores are calculated and reported, and lack of adherence to standards has hindered the translation of this important tool into clinical and public health

PGS Catalog

To support the adoption of PGS in the clinical setting, EMBL’s European Bioinformatics Institute (EMBL-EBI) teamed up with the University of Cambridge to create the PGS Catalog, an open database of published polygenic scores.

Each PGS in the database is consistently annotated with relevant metadata, including how the PGS was developed and applied, and evaluations of its predictive performance.

What is a polygenic score?

A polygenic score (PGS) aggregates the effects of many genetic variants into a single number, which predicts genetic predisposition for a trait or phenotype.

PGS are typically composed of millions of genetic variants (usually single-nucleotide polymorphism also called SNPs), which are combined using a weighted sum of allele dosages, multiplied by their corresponding effect sizes.

PGS are also sometimes called genetic scores or genomic scores, polygenic risk scores (PRS) or genomic risk scores (GRS) if they predict a discrete phenotype, such as a disease.

To calculate a person’s polygenic risk score, researchers survey DNA variants in over 6 billion locations in the human genome.

“The PGS Catalog was born out of an urgent need to collect published polygenic risk scores in one place, but without consistent standards, the data are not as useful as they could be,” explains Helen Parkinson, Head of Molecular Archival Resources at EMBL-EBI. “The next step was to create and test a set of guidelines outlining which information scientists should include in their PGS studies.”

Guidelines for publication

The PGS Catalog team collaborated with NHGRI’s Clinical Genome Resource's (ClinGen) Complex Disease Working Group to create a minimal information framework for polygenic risk scores, published in the journal Nature, which will help promote the validity, transparency and reproducibility of the data.

"A real challenge is that the research community has not adopted any universal best practices for reporting polygenic risk scores," said Erin Ramos, Program Director for ClinGen, Deputy Director of the NHGRI Division of Genomic Medicine and co-author of the paper. "With the field growing as fast as it is, we need standards in place so we can meaningfully evaluate these scores and determine which ones are ready to be used in clinical care."

The new framework suggests that scientists should explain the statistical methods they used to develop and validate the polygenic risk scores. Without a consistent way of reporting PGS, it is nearly impossible to compare the utility of the scores for assessing disease risk in people. According to the new guidelines, researchers should also consider potential limitations of these scores and how clinicians should use the scores in patient care.

"If researchers can follow these guidelines, it will be more straightforward to evaluate published polygenic risk scores and decide which ones are a good fit for the clinical setting," said Michael Inouye, Director of the Cambridge Baker Systems Genomics Initiative. "For diseases such as breast cancer and many others, we will be able to responsibly place patients in different risk categories and provide beneficial screening strategies and treatments. Ideally, in the future we will detect risk early enough to combat the disease more effectively."

Find out more

Read the full NHGRI press release.

Source articles

WAND, H., et al. (2021). Improving reporting standards for polygenic scores in risk prediction studies. Nature. Published online 10 04; DOI: 10.1038/s41586-021-03243-6

LAMBERT, S. A. (2020). The Polygenic Score Catalog: an open database for reproducibility and systematic evaluation. medRxiv. Published online 23 05; DOI: 10.1101/2020.05.20.20108217


ClinGen is funded by NHGRI. Stanford University, an awardee site of the ClinGen consortium, led this effort. The PGS Catalog is funded by the EMBL European Bioinformatics Institute, University of Cambridge, Baker Heart and Diabetes Institute and Health Data Research, U.K.

Contact the news team

Vicky Hatch | Communications Officer

Oana Stroe | Senior Communications Officer

Subscribe to the email newsletter

Subscribe to our publications.

Sign up Or stay updated with the RSS feed (EMBL-EBI only).