Parallel processing and Pipeline building

All data and files for this section can be found in the EBI training FTP.

Scaling things up: Genome bioinformatics on clusters & parallel computing – lecture and practical

Trainers: Sean Laidlaw

Overview: This lecture provides an overview of processing multiple biological datasets through a variety of methods, such as sequential and parallel computing. The practical provides training on parallelize processes for genomic data processing and analysis.

Learning outcomes:

By the end of this session you will be able to:

  • List the steps required to conduct sequential and parallel computing.
  • Know how to apply sequential and parallel computing for the processing and analysis of genomic data.

Materials:


Building a pipeline

Trainers: Victor Flores Lopez

Overview: In this session participants will learn about the reasons and advantages to build a workflow that integrates different tasks, and will have an introduction to building their own pipeline.

Materials: