Federated analysis for polygenic risk score calculations
Polygenic risk scores (PRS) provide an estimate of an individual’s disposition to a trait or complex disease which are calculated as the sum of the risk alleles weighted by the effect size estimate of the genome-wide association study data on the phenotype. PRS scores help in understanding the shared aetiology of certain traits and also in risk prediction and prevention of certain diseases.
As a part of the CINECA project we have been working on the development of a demonstrator of federated genetic analyses utilising a computational pipeline for PRS analysis. We have implemented this workflow using Nextflow, a modular and reproducible workflow manager that can be deployed and autoscaled to many different working environments including Slurm-based clusters and Kubernetes deployed using AWS amongst other computing environments. In this webinar, we will provide an overview of this PRS pipeline utilising the CINECA UK1 synthetic dataset, derived from the 1000 genomes project, as a demonstrator. As part of the work we will further extend the demonstrator to other datasets and GWAS workflows, and integrate it to be run in a federated setting utilising the GA4GH guidelines on using Beacons for data discovery along with AAI passports for data access authorisation and deployments.
The CINECA (Common Infrastructure for National Cohorts in Europe, Canada, and Africa) project aims to develop a federated cloud enabled infrastructure to make population scale genomic and biomolecular data accessible across international borders, to accelerate research, and improve the health of individuals across continents.
About the speakers
Will Rayner is the head of the Data and Analytics group at the Institute of Translational Genomics in the Computational Health Department at Helmholtz Munich. He is interested in all aspects of data management and data privacy and has been leading the CINECA WP4 PRS use case.
Anshika Chowdhary is a Data informatician, at the Institute of translational genomics in Helmholtz Zentrum Munich. She has been working on the development of the workflows in WP4 of the PRS use case, and analysis of the eQTL catalog on the datasets at HMGU.
Who is this course for?
Computational biologists or similar interested in federated analysis and polygenic risk scores.
Explain federated data analyses for polygenic risk scores (PRS).
Describe the computational pipeline created for PRS analysis.
This webinar, hosted on Zoom Webinar, is free to attend, but you need to register for a place beforehand, using the 'Register' button. Once registered, you will receive a confirmation email with a link to join the webinar.