Alex Bateman
Senior Team Leader – Protein Sequence Resources
agb [at] ebi.ac.uk
ORCID: 0000-0002-6982-4660
EditProviding protein and RNA family resources
Senior Team Leader – Protein Sequence Resources
agb [at] ebi.ac.uk
ORCID: 0000-0002-6982-4660
EditAntiFam is a collection of HMMs to help identify protein sequences in the databases that are likely to be false predictions.
A web-based engine for small RNA sequence analysis, quantitation and variant calling.
InterPro is used to classify proteins into families and to predict the presence of domains and functionally important site. The project integrates signatures from 13 major protein signature databases: CATH-Gene3D, CDD, HAMAP, PANTHER, Pfam, PIRSF, PRINTS, PROSITE (patterns and profiles), SFLD,…
InterProScan is a tool that provides automated functional analysis of protein and nucleic acid sequences, the latter via a full six-frame translation. It offers the ability to identify both structural and functional regions of interest, based upon methods and models that have been generated by a la…
The MEROPS database comprises proteolytic enzymes (also termed proteases, proteinases and peptidases), their substrates and inhibitors. MEROPS uses a hierarchical, structure-based classification of proteolytic enzymes and protein inhibitors. Each peptidase or inhibitor is assigned to a Family on th…
Pfam is a database of protein sequence families. Each Pfam family is represented by a statistical model, known as a profile-hidden Markov model, which is trained using a curated alignment of representative sequences. These models can be searched against all protein sequences in order to find occurre…
Search a sequence against the Pfam HMM library
Rfam is a curated database of non-coding RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models. Our families may be divided into non-coding RNA genes, structured cis-regulatory elements and self-splicing RNAs. Rfam families are created f…
Search of Rfam’s covariance model collection
RNAcentral is a database of non-coding RNA sequences that provides a single entry point for accessing the data from an international consortium of RNA resources. RNAcentral provides a unified view of non-coding RNA sequence data and aims to represent all non-coding RNA types from all organisms
Do you want to be part of EMBL’s newly created AI Hub, using AI to solve complex interdisciplinary challenges? We’re seeking a visionary scientist to establish and lead a new AI Engineering & Automation Team at the AI Hub Heidelberg as part of EMBL AI, a major institutional initiative to embed AI ac…
Closes on 12th June. Posted 29th May 2026
EditWe are looking for a highly motivated Computational Biologist / Bioinformatician to join the Chemical Biology Resources team at the European Bioinformatics Institute (EMBL-EBI), located on the Wellcome Genome Campus near Cambridge, as part of the Wellcome Trust funded LIGMAP project, a multi-discipl…
Closes on 18th June. Posted 28th May 2026
Edit