Parallel processing and Pipeline building
All data and files for this section can be found in the EBI training FTP.
Scaling things up: Genome bioinformatics on clusters & parallel computing – lecture and practical
Trainers: Sean Laidlaw
Overview: This lecture provides an overview of processing multiple biological datasets through a variety of methods, such as sequential and parallel computing. The practical provides training on parallelize processes for genomic data processing and analysis.
Learning outcomes:
By the end of this session you will be able to:
- List the steps required to conduct sequential and parallel computing.
- Know how to apply sequential and parallel computing for the processing and analysis of genomic data.
Materials:
- ‘Scaling things up’ slides
- Material for practical exercises:
Building a pipeline
Trainers: Victor Flores Lopez
Overview: In this session participants will learn about the reasons and advantages to build a workflow that integrates different tasks, and will have an introduction to building their own pipeline.
Materials: