ML in the drug discovery pipeline

Figure 2 explains how machine learning can be used in each step of the drug development pipeline. This tutorial will focus mainly on drug target identification. Drug targets are molecules (e.g., proteins) that have a causal relationship with a disease and can be modulated by a drug to produce a therapeutic effect. Target identification is the first step in the drug discovery pipeline, and successful target identification can increase the success rate of drug development.

The next section of this tutorial will provide an introduction to WEKA and how it can be used to develop ML models for target identification. Later in the tutorial, we will introduce a number of ML/NLP tools that can help in target identification.

Target identification and validation, compound screening and lead discovery, pre-clinical development, clinical development.
Figure 2 Examples of applications of machine learning in various stages of the drug discovery pipeline. Image courtesy of Jessica Vamathevan, EMBL-EBI. Further details about these stages are discussed in Vamathevan et al, 2019 [11].