The patent protein databases cover sequences of EPO proteins, JPO proteins, KIPO proteins and USPTO proteins. Protein sequences are extracted from patent applications submitted to different patent offices ( EPOJPOKIPO and  USPTO). Updated EPO protein data is made available at each EMBL-Bank release. JPO and USPTO data are updated quarterly.

Patent proteins Description
EPO proteins Protein sequences extracted from patent applications submitted to the  European Patent Office (EPO). The EPO's policy is to release data to the public 18 months after the patent application date, independent of whether a patent has been granted or not.
JPO proteins Protein sequences extracted from patent applications to the  Japan Patent Office (JPO).
KIPO proteins Protein sequences extracted from patent applications to the  Korean Intellectual Property Office (KIPO).
USPTO proteins Protein sequences extracted from patent applications to the  United States Patent and Trademark Office (USPTO).

Data Search and Query

To search sequences against the patent proteins, the user can use  similarity & homology search tools. Please remember to choose the database you wish from the database list before running the search. For text query, the user can use  EBI Search

Downloads

Patent proteins FASTA format EMBL format
EPO proteins EPOP epo_prt.dat
JPO proteins JPOP jpo_prt.dat
KIPO proteins KPOP kipo_prt.dat
USPTO proteins USPTO uspto_prt.dat

For documentation about the format of these patent sequence files, please refer to the  EMBL format documentation.