--- 
- 
  aux_for_url: 0
  base_id_url: https://www.ebi.ac.uk/chembldb/compound/inspect/
  base_id_url_available: 1
  description: A database of bioactive drug-like small molecules and bioactivities abstracted from the scientific literature.
  name: chembl
  name_label: ChEMBL
  name_long: ChEMBL
  src_compound_id: 
    - CHEMBL373204
  src_id: 1
  src_url: https://www.ebi.ac.uk/chembl/
- 
  aux_for_url: 0
  base_id_url: http://www.genome.jp/dbget-bin/www_bget?
  base_id_url_available: 1
  description: KEGG LIGAND is a composite DB consisting of COMPOUND, GLYCAN, REACTION, RPAIR, RCLASS, and ENZYME DBs, whose entries are identified by C, G, R, RP, RC, and EC numbers, respectively.
  name: kegg_ligand
  name_label: KEGG Ligand
  name_long: KEGG (Kyoto Encyclopedia of Genes and Genomes) Ligand
  src_compound_id: 
    - C10984
  src_id: 6
  src_url: http://www.genome.jp/kegg/ligand.html
- 
  aux_for_url: 0
  base_id_url: http://www.ebi.ac.uk/chebi/searchId.do?chebiId=CHEBI%3A
  base_id_url_available: 1
  description: ChEBI is a freely available dictionary of molecular entities focused on 'small' chemical compounds
  name: chebi
  name_label: ChEBI
  name_long: ChEBI (Chemical Entities of Biological Interest).
  src_compound_id: 
    - 4042
  src_id: 7
  src_url: http://www.ebi.ac.uk/chebi/downloadsForward.do
- 
  aux_for_url: 0
  base_id_url: http://www.emolecules.com/cgi-bin/more?vid=
  base_id_url_available: 1
  description: A free chemical structure search engine containing millions of public domain structures. Pricing, availabilities, and vendor information requires an eMolecules Plus subscription.
  name: emolecules
  name_label: eMolecules
  name_long: eMolecules
  src_compound_id: 
    - 1934429
  src_id: 10
  src_url: http://www.emolecules.com/
- 
  aux_for_url: 0
  base_id_url: http://www-935.ibm.com/services/us/gbs/bao/siip/nih/?sid=
  base_id_url_available: 1
  description: The data are provided by IBM-NIH and include all chemistry extracted by means of text and image mining from the patent corpus (USPTO, WIPO and EPO) for patent documents published through 31-12-2010. Identifiers in UniChem are IBM compound identifiers.
  name: ibm
  name_label: IBM Patent System
  name_long: IBM strategic IP insight platform and the National Institutes of Health
  src_compound_id: 
    - 0719DC552D187D1F8ABADF1EA874313B
  src_id: 11
  src_url: http://www-935.ibm.com/services/us/gbs/bao/siip/nih/
- 
  aux_for_url: 0
  base_id_url: http://worldwide.espacenet.com/searchResults?DB=EPODOC&locale=en_EP&ST=advanced&compact=false&PN=
  base_id_url_available: 1
  description: "The data are provided by IBM-NIH and include exact compounds extracted by means of text and image mining from the patent corpus (USPTO, WIPO and EPO) for patent documents published through 31-12-2010. Further filters included removal of: 1. All molecules mapping to > 10,000 patents, 2. Non-organic molecules, 3. Very small molecules (MW < 90, number of atoms < 7). In addition, for structures mapping to > 100 patents, only 100 randomly selected patent identifiers were included. Identifiers in UniChem are patent number identifiers."
  name: ibm_patents
  name_label: IBM Patents
  name_long: IBM strategic IP insight platform and the National Institutes of Health, Patents
  src_compound_id: 
    - US20090163516
    - EP0805805B1
    - WO2009002809A2
    - US20050228027
    - US20080202259
    - US20040076670
    - US6905699
    - US20030032811
    - US7442801
    - US20060147632
    - US4541963
    - US20090197768
    - EP1594878B1
    - US6191155
    - US20080182755
    - US20080227635
    - EP2261197A1
    - EP1546144A2
    - WO2010063986A2
    - US6746990
    - US7772281
    - EP1686851A1
    - EP2088858A1
    - US20090137395
    - EP1201646A1
    - EP0098561A2
    - EP0374796B1
    - US5945113
    - US20090297871
    - WO2009071450A1
    - US6828300
    - WO2002062144A2
    - US6045816
    - US5396730
    - EP1711474A1
    - EP1399019A1
    - WO2005053401A2
    - EP1131152B1
    - US20030206965
    - WO2008031512A1
    - WO2007066346A3
    - US20070298970
    - US20080193387
    - EP0801896B1
    - EP1931312A2
    - US7358214
    - US20060241098
    - EP2207421A1
    - WO2006063848A1
    - WO2003093223A1
    - US4379147
    - WO2007014913A1
    - EP1210877A1
    - EP0527037B1
    - EP0867186B1
    - US20100210461
    - EP0369614A1
    - EP0828734A1
    - US20080108115
    - WO2006128870A3
    - EP1139739B1
    - WO2008092580A2
    - US6716874
    - WO2006094792A1
    - WO2005035508A2
    - EP1599547A2
    - WO2009062905A1
    - EP1890536A1
    - US6849744
    - US5114461
    - US7064108
    - US20080108667
    - WO2004068945A1
    - US6479489
    - US20100120619
    - US20100152194
    - US20060246151
    - US20030229139
    - US5994386
    - EP2034840A2
    - EP1952690A2
    - EP0283173A1
    - WO1999059967A1
    - US6090415
    - EP0214936B1
    - EP0643040B1
    - EP2258181A2
    - US20080176746
    - US20100190646
    - EP1164849B1
    - US20060217447
    - WO2010094728A1
    - WO2009077197A1
    - EP1432308A1
    - US5922880
    - EP2031961A2
    - EP1490370A2
    - US4963584
    - WO2006124606A3
    - US20070270612
  src_id: 13
  src_url: http://www-935.ibm.com/services/us/gbs/bao/siip/nih/
- 
  aux_for_url: 0
  base_id_url: https://www.surechembl.org/chemical/
  base_id_url_available: 1
  description: SureChEMBL automatically extracts chemistry from the full text of all major patent authorities. Compounds are derived from either chemical names found in text or in chemical depictions. All SureChEMBL compounds are included, except those failing UniChem loading rules.
  name: surechembl
  name_label: SureChEMBL
  name_long: SureChEMBL
  src_compound_id: 
    - SCHEMBL21972
  src_id: 15
  src_url: https://www.surechembl.org
- 
  aux_for_url: 0
  base_id_url: http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?sid=
  base_id_url_available: 1
  description: "A subset of the PubChem DB: from the original depositor 'Thomson Pharma'."
  name: pubchem_tpharma
  name_label: "PubChem: Thomson Pharma "
  name_long: PubChem ('Thomson Pharma' subset)
  src_compound_id: 
    - 14806709
  src_id: 21
  src_url: http://www.thomson-pharma.com/
- 
  aux_for_url: 0
  base_id_url: http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=
  base_id_url_available: 1
  description: A database of normalized PubChem compounds (CIDs) from the PubChem Database.
  name: pubchem
  name_label: PubChem
  name_long: PubChem Compounds
  src_compound_id: 
    - 2912
  src_id: 22
  src_url: http://pubchem.ncbi.nlm.nih.gov
- 
  aux_for_url: 0
  base_id_url: http://actor.epa.gov/actor/GenericChemical?casrn=
  base_id_url_available: 1
  description: ACToR (Aggregated Computational Toxicology Resource)
  name: actor
  name_label: ACToR
  name_long: ACToR
  src_compound_id: 
    - 137497-61-1
    - 52315-07-8
  src_id: 26
  src_url: http://actor.epa.gov/actor/faces/ACToRHome.jsp
- 
  aux_for_url: 0
  base_id_url: https://www.molport.com/shop/molecule-link/
  base_id_url_available: 1
  description: MolPort. A database designed to assist users find commercial sources of compounds. Access requires (free) registration.
  name: molport
  name_label: MolPort
  name_long: MolPort
  src_compound_id: 
    - MolPort-003-846-106
  src_id: 28
  src_url: https://www.molport.com/shop/index
- 
  aux_for_url: 0
  base_id_url: http://nikkajiweb.jst.go.jp/nikkaji_web/pages/top_e.jsp?CONTENT=syosai&SN=
  base_id_url_available: 1
  description: " Nakkaji (The Japan Chemical Substance Dictionary) is an organic compound dictionary database prepared by the Japan Science and Technology Agency (JST)."
  name: nikkaji
  name_label: Nikkaji
  name_long: Nikkaji
  src_compound_id: 
    - J9.674A
  src_id: 29
  src_url: " http://nikkajiweb.jst.go.jp/nikkaji_web/pages/top_e.jsp"