Course content

The ‘Data science for life scientists’ course provides an introduction to a range of topics in data science, and it’s ideal for life science researchers or students starting in the field. For all details about the live course, please refer to the course page and programme.

The contents table below outlines the different sessions that were included in the course, as well as the format of the materials (slides/practicals) and each session’s trainer. You can proceed through the materials using the arrows at the end of each page, learning at your own pace. If you are interested in a specific topic only, you can instead jump straight to it from the contents table below, or the side menu.

Please note that if you are new to Python but interested in the “Introduction to statistics” or “Machine learning” sections, it is essential to first go through the “Introduction to Python” section.

Legend

 Presentation slides –  View or download the slides

 Practicals – View or download the slides, exercises, datasets, analysis scripts and results

Table of contents

FormatSubject Trainer
Introduction to Python
Getting started with Python Andrian Yang, Iris Diana Yu
Data handling in PythonAndrian Yang, Iris Diana Yu
Data visualisation in PythonAndrian Yang, Iris Diana Yu
Introduction to statisticsSarah Kaspar
EMBL-EBI data resources
Introduction to EMBL-EBI data resourcesFlaminia Zane
Programmatic access to data resources Nandana Madhusoodanan,
Fabio Madeira
Introduction to machine learning
Machine learning: introductory lectureMelissa Adasme, Jiawei Wang
Machine learning practicalMelissa Adasme, Jiawei Wang
Application of machine learning in biosciencesJiawei Wang
Introduction to network analysis and CytoscapeKalpana Panneerselvam, Juan Jose Medina
Data science: a broader view
Good data management: making your data FAIRAjay Mishra
Graphic design principles for data visualisationHolly Joynes
Environmental impact of computing and bioinformaticsLoïc Lannelongue
A journey from data ethics to AI governanceNathanael Sheehan
Group projects (short research challenges)
Gene expression (single-cell RNA-seq)Iris Diana Yu, Daianna Gonzalez-Padilla
Protein structure (with Protein Data Bank in Europe)Cristian Escobar Bravo, Paulyna Magaña