Pfam is a database of protein sequence families. Each Pfam family is represented by a statistical model, known as a profile hidden Markov model, which is trained using a curated alignment of representative sequences. These models can be searched against all protein sequences in order to find occurrences of Pfam families, thereby aiding the identification of evolutionarily-related (or homologous) sequences. As homologous proteins are more likely to share structural and functional features, Pfam families can aid in the annotation of uncharacterised sequences and guide experimental work.

Team members

Rob Finn
Jaina Mistry
Alex Bateman
Ruth Eberhardt