The patent nucleotides are the sequences of the entire patent data class (PAT) in EMBL-Bank. The database consists of different data classes. The PAT class is the data class containing patent nucleotides. The data is made available as part of the EMBL-Bank release which is updated quarterly.

Data Search and Query

To search sequences against the patent nucleotides, the user can use similarity& homology search tools. Please remember to choose the database you require from the database list before running the search. For text query, the user can use  SRS and  EB-eye.

Downloads

EMBL format files and FASTA format files for EMBL released/updated patent nucleotides are available at the FTP sites listed in the table below.

PAT class Release / Daily update File format FTP site File names
PAT class in EMBL Release Release EMBL ftp://ftp.ebi.ac.uk/pub/databases/embl/patent/ rel_pat_*.dat.gz
Release FASTA ftp://ftp.ebi.ac.uk/pub/databases/fastafiles/emblrelease/ em_rel_pat*.gz
PAT class in EMBL NEW Daily update EMBL ftp://ftp.ebi.ac.uk/pub/databases/embl/new/ cum_pat_*.dat.gz
Daily update FASTA ftp://ftp.ebi.ac.uk/pub/databases/fastafiles/emblnew/ em_cum_pat_*.gz

Documentation

For documentation about the format of these patent sequence files, please refer to the  EMBL format documentation.