The patent protein databases cover sequences of EPO proteins, JPO proteins, KIPO proteins and USPTO proteins. Protein sequences are extracted from patent applications submitted to different patent offices ( EPO, JPO, KIPO and USPTO). Updated EPO protein data is made available at each EMBL-Bank release. JPO and USPTO data are updated quarterly.
| Patent proteins | Description |
|---|---|
| EPO proteins | Protein sequences extracted from patent applications submitted to the European Patent Office (EPO). The EPO's policy is to release data to the public 18 months after the patent application date, independent of whether a patent has been granted or not. |
| JPO proteins | Protein sequences extracted from patent applications to the Japan Patent Office (JPO). |
| KIPO proteins | Protein sequences extracted from patent applications to the Korean Intellectual Property Office (KIPO). |
| USPTO proteins | Protein sequences extracted from patent applications to the United States Patent and Trademark Office (USPTO). |
Data Search and Query
To search sequences against the patent proteins, the user can use similarity & homology search tools. Please remember to choose the database you wish from the database list before running the search. For text query, the user can use EBI Search.
Downloads
| Patent proteins | FASTA format | EMBL format |
|---|---|---|
| EPO proteins | EPOP | epo_prt.dat |
| JPO proteins | JPOP | jpo_prt.dat |
| KIPO proteins | KPOP | kipo_prt.dat |
| USPTO proteins | USPTO | uspto_prt.dat |
For documentation about the format of these patent sequence files, please refer to the EMBL format documentation.
Publications
Li W., McWilliam H., Richart de la Torre A., Grodowski A., Benediktovich I., Goujon M., Nauche, S. and Lopez, R. (2010) Non-redundant patent sequence databases with value-added annotations at two levels. Nucleic Acids Research. 38(Database issue):D52-D56. DOI: 10.1093/nar/gkp960 full-text PDF
