0%

Search by sequence in CATH

The search by sequence uses FunFHMMer function prediction server, which takes a protein sequence in FASTA format or a UniProt/GenBank sequence identifier as input. The server identifies the functional family (FunFam) for a query sequence by scanning input against a library of FunFam HMMs in CATH. The output of the web server provides matched FunFam assignments within the query sequence and also highlights the multi-domain architecture of the sequence, where appropriate. Watch the demonstration about how to use the CATH sequence search tool (Video 1).

Video 1 CATH sequence search tool.

Let’s do it together: locate the position of structural domains on a protein sequence

  1. Download the FASTA sequence of Alanine Racemase from E. coli, using UniProt ID: P0A6B4 (359 amino acids) from UniProt database.
  2. Go to http://www.cathdb.info/search/by_sequence and paste the sequence in the text area and click on ‘Search’ button (Figure 6).
  3. The search can take some time to be processed as shown in the ‘Progress’ tab (Figure 6).
  4. Click on ‘Matching Domains’ or ‘Matching FunFams’ to view the search results (explained on the next page).
Figure 6 Search by sequence page in CATH.