Pfam is a database of protein sequence families. Each Pfam family is represented by a statistical model, known as a profile-hidden Markov model, which is trained using a curated alignment of representative sequences. These models can be searched against all protein sequences in order to find occurrences of Pfam families, thereby aiding the identification of evolutionarily-related (or homologous) sequences. As homologous proteins are more likely to share structural and functional features, Pfam families can aid in the annotation of uncharacterised sequences and guide experimental work.

Team members

Rob Finn
Simon Potter
Jaina Mistry
Alex Bateman