Course at EMBL-EBI
Summer school in bioinformatics
Thank you to those of you who have applied, we have now selected our participants and wait list. There are no more spaces available. Please register your interest for the 2025 course here.
Details of the 2024 course can be found below.
This course provides an introduction to the use of bioinformatics in biological research, giving you guidance for using bioinformatics in your work whilst also providing hands-on training in tools and resources appropriate to your research.
You will initially be introduced to bioinformatics theory and practice, including best practices for undertaking bioinformatics analysis, data management, and reproducibility.
You will be required to review some pre-recorded material for their group project prior to the start of the course.
Group projects
A major element of this course is a group project, where you'll be placed in small groups to work together on a challenge set by trainers from EMBL-EBI and external institutes. This allows you to explore the bioinformatics tools and resources available in your area of interest and apply them to a set problem, providing you with hands-on experience relevant to your own research. The group work will culminate in a presentation session involving everyone on the final day of the course, giving an opportunity for wider discussion on the benefits and challenges of working with biological data.
Groups are mentored and supported by the trainers who set the initial challenge, but the groups will be responsible for driving their projects forward, with all members expected to take an active role. Groups are pre-organised before the course, and all group members will be sent some short “homework” in preparation for your project work prior to the start of the course.
Basic outlines of the projects on offer this year are given below. In your application, you must indicate your first and second choice of project, based on which you think would benefit your research most. Not all projects may be offered, and final decisions on which projects will be run during the course will be made based on the number of applicants per project.
Most of the projects cover mammalian data sets, however, in many cases, the methods and approaches taught are transferable to data from various species.
Group project one: Genome variation across human populations
Natural variation between individuals or between different human populations is a result of genome mutations throughout evolutionary history. Some mutations may become fixed because of their beneficial effect while most drift among individuals. During this project, you will investigate genomic variation between two separate human populations of European and Asian descent. Using sequence data from a number of individuals from each population, you will use a range of bioinformatics tools to discover variants that exist between them. In the second section of the project, you will attempt to analyse the functional consequences of the variants you have identified, attempting to find clinical association and linking them to phenotypes.
Project mentor: Anu Shivalikanjli | EMBL-EBI
Group project two: Interpreting functional information from large scale protein structure data
This project will introduce you to the wealth of publicly available data in the Protein Data Bank (PDB) and give you the opportunity to investigate how large subsets of structure data can be used to analyse protein features and determine function. In the project you will learn how to identify relevant protein structures, collate and interpret functional information, and implement this process programmatically.
Project mentor: Marcus Bage | EMBL-EBI and Joseph Ellaway | EMBL-EBI
Group project three: Modelling cell signalling pathways
Curating models of biological processes is an effective training in computational systems biology, where the curators gain an integrative knowledge of biological systems, modelling, and bioinformatics. You will learn to encode and simulate ordinary differential equation models of signalling pathways from a recent publication using user-friendly software such as COPASI even without extensive mathematical background. You will learn to perform in-silico experiments, new predictions, and develop hypotheses. Furthermore, you will learn how to annotate models and re-use pre-existing models from open repositories such as BioModels.
Project mentors: Rahuman Sheriff | EMBL-EBI and Krishna Tiwari | EMBL-EBI
Group project four: Improving AI-based bioimage analysis
Artificial Intelligence (AI) algorithms outperform classical image analysis methods, however, the performance of these models is highly dependent on the quality of the annotated image datasets used to train them. In this project, you will explore the application of AI for biological imaging and the relationship between model training data and model performance. You will use models stored in the BioImage Model Zoo and data in the BioImage Archive to fine-tune and aggregate AI outputs. The aim of this project will be to test, evaluate, and improve model performance on a diverse set of microscopy images and annotations within the BioImage Archive. You will learn how to apply, train, tune, and employ the most performant state-of-the-art computer vision models. This project serves as a valuable demonstration of how FAIR (Findable, Accessible, Interoperable, Reusable) data plays an essential role in the training and enhancement of AI models.
Project mentors: Aybuke Kupcu Yoldas | EMBL-EBI and Craig Russel | EMBL-EBI
Group project five: Single-cell RNA-sequencing analysis with Python
In this project, you will learn how to perform single-cell RNA-sequencing data analysis to investigate cell type heterogeneity and expression differences across conditions. The analysis will be based on the SCANPY framework in Python. You will start by collecting the raw count matrix and relevant metadata from the Single-cell Expression Atlas. After constructing the AnnData objects, you will perform quality control, preprocessing, dimensionality reduction, cell type annotation, and differential expression analysis. We will also explore the batch effect and its correction.
Project mentors: Yuyao Song | EMBL-EBI and Anna Vathrakokoili-Pournara | EMBL-EBI
Group project six: Networks and pathways
This project will cover typical bioinformatics analysis steps needed to put differentially expressed genes into a wider biological context. You will start with gene expression data (RNA-seq) to build an initial interaction network. Next, you will learn to combine public network datasets, identify key regulators of biological pathways, and explore biological function through network analysis. You will get first-hand experience in integration and co-visualising with additional data and functional enrichment analysis. All this helps to put the initial results into a previously known context and provide hypotheses for potential follow up experiments. We will use Cytoscape, Expression Atlas, g:Profiler, StringDb, among other tools. We may also give a few R packages a try.
Project mentor: Priit Adler | University of Tartu
Who is this course for?
Applicants are expected to be at an early stage of using bioinformatics in their research with the need to develop their knowledge and skills further. No previous knowledge of programming is required for this course; group projects may give you the opportunity to learn basic programming, but participants will be supported in this by their mentors. Depending on your chosen project, an introductory programming tutorial may be given as homework prior to attending the course.
Though programming skills are not a prerequisite for attending the course, we will ask applicants to specify their current level of programming skills in the applications. This will allow the mentors to target the group projects better to the skills and needs of the final course participants.
During the course, there is a session titled ‘Computational and data visualisation skills using R’. In the application form, you will be asked if you would like to attend the introductory or intermediate groups. The introductory group is for anyone new to the programming language R or programming in general; the intermediate group is for anyone who can create simple visualisations in R. There will be an opportunity to switch to the other group at the start of the course if it is more suitable at that time.
What will I learn?
Learning outcomes
After this course you should be able to:
- Discuss applications of bioinformatics in biological research
- Browse, search, and retrieve biological data from public repositories
- Use appropriate bioinformatics tools to explore biological data
- Describe ways that biological data can be stored, organised, and integrated
Course content
During this course you will learn about:
- Bioinformatics as a science
- Designing bioinformatics studies
- Data management and reproducibility
- Basic tools and resources for bioinformatics
The exact range of resources and tools covered will vary depending on the group project undertaken; there will be no opportunity for you to analyse your own data during this course.
Trainers
Anna Vathrakokoili-Pournara
EMBL-EBI
Programme
All times in the programme are listed in BST.
Time
Topic
Trainer
Day one – Monday 17 June 2024
10:00 – 10:30
Registration and coffee
10:30 – 11:30
Welcome and introduction
Anna Swan
11:30 – 12:30
An introduction to EMBL-EBI data resources
Anna Swan
12:30 – 13:30
Lunch
13:30 – 14:30
The science of bioinformatics
Alex Bateman
14:30 – 15:30
Coffee break and poster session
15:30 – 17:00
Good data management: making your data FAIR
Dayane Araújo
17:00 – 18:00
Accommodation check-in at Hinxton Hall Conference Centre and free time
18:00 – 19:00
Pre-dinner talk, networking, and drinks
19:00
Evening meal at Hinxton Hall Conference Centre
Day two – Tuesday 18 June 2024
08:45 – 09:00
Arrivals and registration
09:00 – 09:30
Day introduction and mini-challenge
Anna Swan
09:30 – 11:00
Computational and data visualisation skills using R – introductory or intermediate
Priit Adler and Anna Swan
11:00 – 11:30
Coffee break
11:30 – 13:00
Computational and data visualisation skills using R – introductory or intermediate
Priit Adler and Anna Swan
13:00 – 14:00
Lunch featuring EMBL-EBI biocurators and poster session
14:00 – 17:30
Introduction to group projects and meet your mentors
All mentors
17:30
End of day two
17:30 – 18:30
Free time
18:30
Evening meal at Hinxton Hall Conference Centre
Day three – Wednesday 19 June 2024
08:45 – 09:00
Arrival and registrations
09:00 – 09:15
Day introduction and project update
Anna Swan
09:15 – 10:30
Group work
All
10:30 – 11:00
Coffee break
11:00 – 12:00
Group work
All
12:00 – 13:00
Keynote
Loïc Lannelongue
13:00 – 14:00
Lunch
14:00 – 15:30
Group work
All
15:30 – 16:00
Coffee break
16:00 – 17:30
Group work
All
17:30
End of day three
17:30 – 18:30
Free time
18:30
Evening meal at Hinxton Hall Conference Centre
Day four – Thursday 20 June 2024
08:45 – 09:00
Arrival and registrations
09:00 – 09:30
Day introduction and project two-minute updates
Anna Swan
09:30 – 10:30
Group work
All
10:30 – 11:00
Coffee break
11:00 – 13:00
Group work
All
13:00 – 14:00
Lunch
14:00 – 15:00
Group work
All
15:00 – 16:00
Keynote
Amanda M. Saravia-Butler
16:00 – 16:30
Coffee break
16:30 – 18:00
Group work
All
18:00 – 18:30
Free time
18:30 – 19:30
Pre-dinner drinks at The Red Lion, Hinxton
19:30
Dinner at The Red Lion, Hinxton
Day five – Friday 21 June 2024
08:45 – 09:00
Arrival and registrations
09:00 – 09:15
Day introduction
Anna Swan
09:15 – 10:30
Preparation of group project presentations
All
10:30 – 11:00
Coffee break
11:00 – 12:00
Group presentations
All
12:00 – 13:00
Lunch
13:00 – 14:00
Group presentations and award ceremony
All
14:00 – 14:30
Course feedback and wrap up
All
14:30
End of course
14:45
Coach to Cambridge Train Station
Thank you to those of you who have applied, we have now selected our participants and wait list. There are no more spaces available. Please register your interest for the 2025 course here. Details about the 2024 application process can be found below.
Please read our application support page before starting your application. In order to be considered for a place on this course, you must do the following:
- Complete the online application form.
- Ensure you add relevant information to the ‘submission details’ section where you are asked to provide information on your:
- pre-requisite skills and knowledge
- current work and course expectations
- data availability
- Upload one letter of support from your supervisor or a senior colleague detailing reasons why you should be selected for the course.
Please submit all documents during the application process by 23:59 (GMT) on 25 February 2024. Items marked * in the application are mandatory. Incomplete registrations will not be processed.
All applicants will be informed of the status of their application (successful, waiting list, unsuccessful) by 18 March 2024. If you have any questions regarding the application process please contact Sophie Spencer.
The registration fee of £850.00 includes:
- Catering as detailed on the course programme
- Accommodation for four nights (17, 18, 19, and 20 June)
- Bespoke course handbook with links to all course materials
- Use of a computer in the EMBL-EBI training suite throughout the course
- Shuttle bus on the final course day to Cambridge train station
EMBL-EBI
Programme
All times in the programme are listed in BST.
Time | Topic | Trainer |
Day one – Monday 17 June 2024 | ||
10:00 – 10:30 | Registration and coffee | |
10:30 – 11:30 | Welcome and introduction | Anna Swan |
11:30 – 12:30 | An introduction to EMBL-EBI data resources | Anna Swan |
12:30 – 13:30 | Lunch | |
13:30 – 14:30 | The science of bioinformatics | Alex Bateman |
14:30 – 15:30 | Coffee break and poster session | |
15:30 – 17:00 | Good data management: making your data FAIR | Dayane Araújo |
17:00 – 18:00 | Accommodation check-in at Hinxton Hall Conference Centre and free time | |
18:00 – 19:00 | Pre-dinner talk, networking, and drinks | |
19:00 | Evening meal at Hinxton Hall Conference Centre | |
Day two – Tuesday 18 June 2024 | ||
08:45 – 09:00 | Arrivals and registration | |
09:00 – 09:30 | Day introduction and mini-challenge | Anna Swan |
09:30 – 11:00 | Computational and data visualisation skills using R – introductory or intermediate | Priit Adler and Anna Swan |
11:00 – 11:30 | Coffee break | |
11:30 – 13:00 | Computational and data visualisation skills using R – introductory or intermediate | Priit Adler and Anna Swan |
13:00 – 14:00 | Lunch featuring EMBL-EBI biocurators and poster session | |
14:00 – 17:30 | Introduction to group projects and meet your mentors | All mentors |
17:30 | End of day two | |
17:30 – 18:30 | Free time | |
18:30 | Evening meal at Hinxton Hall Conference Centre | |
Day three – Wednesday 19 June 2024 | ||
08:45 – 09:00 | Arrival and registrations | |
09:00 – 09:15 | Day introduction and project update | Anna Swan |
09:15 – 10:30 | Group work | All |
10:30 – 11:00 | Coffee break | |
11:00 – 12:00 | Group work | All |
12:00 – 13:00 | Keynote | Loïc Lannelongue |
13:00 – 14:00 | Lunch | |
14:00 – 15:30 | Group work | All |
15:30 – 16:00 | Coffee break | |
16:00 – 17:30 | Group work | All |
17:30 | End of day three | |
17:30 – 18:30 | Free time | |
18:30 | Evening meal at Hinxton Hall Conference Centre | |
Day four – Thursday 20 June 2024 | ||
08:45 – 09:00 | Arrival and registrations | |
09:00 – 09:30 | Day introduction and project two-minute updates | Anna Swan |
09:30 – 10:30 | Group work | All |
10:30 – 11:00 | Coffee break | |
11:00 – 13:00 | Group work | All |
13:00 – 14:00 | Lunch | |
14:00 – 15:00 | Group work | All |
15:00 – 16:00 | Keynote | Amanda M. Saravia-Butler |
16:00 – 16:30 | Coffee break | |
16:30 – 18:00 | Group work | All |
18:00 – 18:30 | Free time | |
18:30 – 19:30 | Pre-dinner drinks at The Red Lion, Hinxton | |
19:30 | Dinner at The Red Lion, Hinxton | |
Day five – Friday 21 June 2024 | ||
08:45 – 09:00 | Arrival and registrations | |
09:00 – 09:15 | Day introduction | Anna Swan |
09:15 – 10:30 | Preparation of group project presentations | All |
10:30 – 11:00 | Coffee break | |
11:00 – 12:00 | Group presentations | All |
12:00 – 13:00 | Lunch | |
13:00 – 14:00 | Group presentations and award ceremony | All |
14:00 – 14:30 | Course feedback and wrap up | All |
14:30 | End of course | |
14:45 | Coach to Cambridge Train Station |
Thank you to those of you who have applied, we have now selected our participants and wait list. There are no more spaces available. Please register your interest for the 2025 course here. Details about the 2024 application process can be found below.
Please read our application support page before starting your application. In order to be considered for a place on this course, you must do the following:
- Complete the online application form.
- Ensure you add relevant information to the ‘submission details’ section where you are asked to provide information on your:
- pre-requisite skills and knowledge
- current work and course expectations
- data availability
- Upload one letter of support from your supervisor or a senior colleague detailing reasons why you should be selected for the course.
Please submit all documents during the application process by 23:59 (GMT) on 25 February 2024. Items marked * in the application are mandatory. Incomplete registrations will not be processed.
All applicants will be informed of the status of their application (successful, waiting list, unsuccessful) by 18 March 2024. If you have any questions regarding the application process please contact Sophie Spencer.
The registration fee of £850.00 includes:
- Catering as detailed on the course programme
- Accommodation for four nights (17, 18, 19, and 20 June)
- Bespoke course handbook with links to all course materials
- Use of a computer in the EMBL-EBI training suite throughout the course
- Shuttle bus on the final course day to Cambridge train station