Virtual course
Summer school in bioinformatics (Virtual)
This course provides an introduction to the use of bioinformatics in biological research, giving participants guidance for using bioinformatics in their work whilst also providing hands-on training in tools and resources appropriate to their research.
Participants will initially be introduced to bioinformatics theory and practice, including best practices for undertaking bioinformatics analysis, data management and reproducibility. To enable specific exploration of resources in their particular field of interest, participants will be divided into focused groups to work on a small project set by EMBL-EBI resource and research staff, ending in a presentation from each group on the final day of the course to bring together learnings from all participants.
The course includes training and mentoring by experts from EMBL-EBI and external institutes.
Group projects
A major element of this course is a group project, where participants will be placed in small groups to work together on a challenge set by trainers from EMBL-EBI. This allows people to explore the bioinformatics tools and resources available in their area of interest and apply them to a set problem, providing participants with hands-on experience relevant to their own research. The group work will culminate in a presentation session involving all participants on the final day of the course, giving an opportunity for wider discussion on the benefits and challenges of working with biological data.
Groups are mentored and supported by the trainers who set the initial challenge, but the groups will be responsible for driving their projects forward, with all members expected to take an active role. Groups are pre-organised before the course, and all group members will be sent some short “homework” in preparation for their project work prior to the start of the course.
Basic outlines of the projects on offer this year are given below. In your application you must indicate your first and second choice of project, based on which you think would benefit your research most. Not all projects may be offered, and final decisions on which projects will be run during the course will be made based on the number of applicants per project.
This year’s projects are as follows:
Networks and pathways
This project will make use of gene expression data (RNA-seq) to build protein-protein interaction networks, which can be used to explore functional relationships between the (potentially) expressed protein products. You will use Cytoscape to visualise protein networks, identify key regulators of biological pathways and explore biological function through network analysis, integration and co-visualisation of additional data, and ontology/functional enrichment analysis - helping to build a better view of the wider biological context.
Modelling cell signalling pathways
Curating models of biological processes is an effective training in computational systems biology, where the curators gain an integrative knowledge of biological systems, modelling and bioinformatics. You will learn to encode computational models of signalling pathways from a recent publication using COPASI and how to reproduce the simulation results. Furthermore, you will learn how to annotate models and re-use pre-existing models from open repositories such as BioModels.
Genome variation across human populations
Natural variation between individuals or between different human populations is a result of genome mutations throughout evolutionary history. Some mutations may become fixed because of their beneficial effect while most drift among individuals. During this project, you will investigate genomic variation between two separate human populations of European and Asian descent. Using sequence data from a number of individuals from each population, you will use a range of bioinformatics tools to discover variants that exist between them. In the second section of the project, you will attempt to analyse the functional consequences of the variants you have identified, linking them to phenotypes.
Metabolic network engineering using a systems model-based approach
Metabolic pathway analysis helps to identify the structure and dynamics of a metabolic network and thereby also allow us to have an insight into cell physiology which is the foundation of metabolic engineering. You will work with a curated model related to a metabolism network chosen from BioModels, and learn how to carry out computational analyses to find common patterns in the networks. These might include computing feasible pathways through the network and minimal reactions to knock out specific metabolic functions, along with visualisation and exploration to gain a further insight into the results.
Finding and interpreting publicly available structural data
This project will introduce you to the wealth of publicly available structural data and give you the opportunity to investigate how this data can be used to analyse macromolecular structures. You will firstly explore the search and entry pages at PDBe to identify the type of data available for analysis. Then using this knowledge, you will discover how to access this data programmatically and analyse a subset of your results to interpret biological relevance.
Functional annotation of proteins
Functional annotation refers to discovering the functions of proteins. Participants will annotate a protein of interest which has not been annotated previously, either manually or by any UniProt automatic system. Participants will use different ways to discover the function of their protein of interest from alignment to machine learning. You will learn the different annotation types in UniProtKB and how to use the UniProt resources for their investigation. You will also learn how to build machine learning models, from dataset construction to prediction.
Who is this course for?
This course is aimed at individuals working across life sciences who have little or no experience in bioinformatics. Applicants are expected to be at an early stage of using bioinformatics in their research with the need to develop their knowledge and skills further. No previous knowledge of programming is required for this course; group projects may give you the opportunity to learn basic programming, but participants will be supported in this by their mentors. Depending on your chosen project, an introductory programming tutorial may be given as homework prior to attending the course.
What will I learn?
Learning outcomes
After this course you should be able to:
-
Discuss applications of bioinformatics in biological research
-
Browse, search, and retrieve biological data from public repositories
-
Use appropriate bioinformatics tools to explore biological data
-
Comprehend ways that biological data can be stored, organised and integrated
Course content
During this course you will learn about:
-
Bioinformatics as a science
-
Designing bioinformatics studies
-
Data management and reproducibility
-
Basic tools and resources for bioinformatics
The exact range of resources and tools covered will vary depending on the group project undertaken; there will be no opportunity for you to analyse your own data during this course.
Trainers
Alexandra Holinski
EMBL-EBI, UK Anna Swan
EMBL-EBI, UK Alex Bateman
EMBL-EBI, UK Sarah Morgan
EMBL-EBI, UK Lee Larcombe
Amphimatic Mohamed Alibi
EMBL-EBI, UK Peter McQuilton
Oxford e-Research Centre Fabio Madeira
EMBL-EBI, UK Nandana Madhusoodanan
EMBL-EBI, UK Nikiforos Karamanis
EMBL-EBI, UK Christine Orengo
University College London, UK
Programme
Day 1 – Monday 16th October 2020
10:00 - 10:45
Welcome and introduction
Alex Holinski, Anna Swan
10:45 - 11:15
Icebreaker
Alex Holinski, Anna Swan
11:15 - 11:30
Break
11:30 - 12:30
The science of bioinformatics
Alex Bateman
12:30 - 13:30
Lunch
13:30 - 14:30
Introduction to the command line
Mohammed Alibi
14:30 - 15:00
Break
15:00 - 16:00
Good data management: Making your data FAIR
16:00 - 17:00
Social: tell us about your work!
Alex Holinski, Anna Swan
17:00 - 17:30
Optional get together
Day 2 – Tuesday 17th October 2020
09:30 - 10:00
Introduction and mini challenge
Alex Holinski, Anna Swan
10:00 - 11:15
An introduction to EMBL-EBI data resources
Sarah Morgan
11:15 - 11:30
Break
11:30 - 12:30
Data visualisation 101: A practical introdcution to designing scientific figures
Niki Karamanis
12:30 - 13:30
Lunch
13:30 - 14:00
Q&A for pre-recorded keynote lecture
Isidro Cortes
14:00 - 14:30
Break
14:30 - 16:00
Introduction to group projects and meet your mentors
Sarah Morgan and all mentors
Day 3 – Wednesday 18th December 2020
09:45 - 10:00
Day introduction
10:00 - 10:30
Group check-in with mentors
10:30 - 12:00
Group project work
12:00 - 12:30
Group check-in with mentors
12:30 - 13:30
Lunch
13:30 - 14:00
Mini tutorial Q&A: Interpreting integrated data
Lee Larcombe
14:00 - 15:30
Group prject work
15:30 - 16:00
Group check-in with mentors
Day 4 – Thursday 1st October 2020
09:45 - 10:00
Day introduction
10:00 - 10:30
Group check-in with mentors
10:30 - 12:00
Group project work
12:00 - 12:30
Mini tutorial Q&A: Introduction to data visualisation in R
Anna Swan
12:30 - 13:30
Lunch
From 1 pm: Optional chat with EBI Industry Team, Matt Hall and Effie Mutasa-Gottgens
13:30 - 14:00
Group check-in with mentors
14:00 - 15:30
Group project work
15:30 - 16:00
Q&A for pre-recorded keynote lecture
Christine Orengo
Day 4 – Friday 2nd October 2020
09:45 - 10:00
Day introduction
10:00 - 10:30
Group check-in with mentors
10:30 - 12:30
Group project work and presentation preparation
12:30 - 13:30
Lunch
13:30 - 14:45
Group presentation with mentors
14:45 - 15:15
Course feedback and wrap-up
Registration is handled through the Wellcome Genome Campus, Scientific Conferences and Advanced Courses website.
EMBL-EBI, UK
EMBL-EBI, UK
EMBL-EBI, UK
EMBL-EBI, UK
Amphimatic
EMBL-EBI, UK
Oxford e-Research Centre
EMBL-EBI, UK
EMBL-EBI, UK
EMBL-EBI, UK
University College London, UK
Programme
Day 1 – Monday 16th October 2020 |
||
|---|---|---|
| 10:00 - 10:45 | Welcome and introduction | Alex Holinski, Anna Swan |
| 10:45 - 11:15 | Icebreaker | Alex Holinski, Anna Swan |
| 11:15 - 11:30 | Break | |
|
11:30 - 12:30 |
The science of bioinformatics |
Alex Bateman |
| 12:30 - 13:30 |
Lunch |
|
| 13:30 - 14:30 | Introduction to the command line | Mohammed Alibi |
| 14:30 - 15:00 | Break | |
| 15:00 - 16:00 | Good data management: Making your data FAIR | |
| 16:00 - 17:00 | Social: tell us about your work! | Alex Holinski, Anna Swan |
| 17:00 - 17:30 | Optional get together |
Day 2 – Tuesday 17th October 2020 |
||
|---|---|---|
| 09:30 - 10:00 | Introduction and mini challenge | Alex Holinski, Anna Swan |
| 10:00 - 11:15 | An introduction to EMBL-EBI data resources | Sarah Morgan |
| 11:15 - 11:30 | Break | |
| 11:30 - 12:30 | Data visualisation 101: A practical introdcution to designing scientific figures | Niki Karamanis |
| 12:30 - 13:30 | Lunch | |
| 13:30 - 14:00 | Q&A for pre-recorded keynote lecture | Isidro Cortes |
| 14:00 - 14:30 | Break | |
| 14:30 - 16:00 | Introduction to group projects and meet your mentors | Sarah Morgan and all mentors |
Day 3 – Wednesday 18th December 2020 |
||
|---|---|---|
| 09:45 - 10:00 | Day introduction | |
| 10:00 - 10:30 | Group check-in with mentors | |
| 10:30 - 12:00 | Group project work | |
| 12:00 - 12:30 | Group check-in with mentors | |
| 12:30 - 13:30 | Lunch | |
| 13:30 - 14:00 | Mini tutorial Q&A: Interpreting integrated data | Lee Larcombe |
| 14:00 - 15:30 | Group prject work | |
| 15:30 - 16:00 | Group check-in with mentors |
Day 4 – Thursday 1st October 2020 |
||
|---|---|---|
| 09:45 - 10:00 | Day introduction | |
| 10:00 - 10:30 | Group check-in with mentors | |
| 10:30 - 12:00 | Group project work | |
| 12:00 - 12:30 | Mini tutorial Q&A: Introduction to data visualisation in R | Anna Swan |
| 12:30 - 13:30 |
Lunch From 1 pm: Optional chat with EBI Industry Team, Matt Hall and Effie Mutasa-Gottgens |
|
| 13:30 - 14:00 | Group check-in with mentors | |
| 14:00 - 15:30 | Group project work | |
| 15:30 - 16:00 | Q&A for pre-recorded keynote lecture | Christine Orengo |
Day 4 – Friday 2nd October 2020 |
||
|---|---|---|
| 09:45 - 10:00 | Day introduction | |
| 10:00 - 10:30 | Group check-in with mentors | |
| 10:30 - 12:30 | Group project work and presentation preparation | |
| 12:30 - 13:30 | Lunch | |
| 13:30 - 14:45 | Group presentation with mentors | |
| 14:45 - 15:15 | Course feedback and wrap-up |
Registration is handled through the Wellcome Genome Campus, Scientific Conferences and Advanced Courses website.