spacer

SIFTS Statistics FTP

The FTP site provides access to data from the SIFTS initiative. The public ftp account is maintained by the European Bioinformatics Institute

The residue level cross reference data are available in XML format and are located in the XML directory. This directory contains file for each PDB entry.

The data for chain level mapping for all the PDB chains are also available in a tab delimited ASCII text format and are located in the "text" directory

The "text" directory contains a number of flat files exported from the MSD database. All of the files contain tab separated columns. The contents of each file are as follows:

pdb_chain_uniprot.lst A summary of the MSD to UniProt residue level mapping, showing the start and end residues of the mapping using SEQRES, PDB sequence and UniProt numbering.
pdb_chain_taxonomy.lst A summary of the NCBI tax_id(s),scientific_name(s) and chain type for each PDB chain that has been processed.
pdb_pubmed.lst A summary of the Pubmed id(s) associated with each PDB entry, together with an ordinal number.
pdb_chain_enzyme.lst A summary of the EC number(s) (derived via the UniProt mapping) for each PDB chain that has been processed.
pdb_chain_go.lst A summary of the GO identifier(s) (derived via the UniProt mapping) for each PDB chain that has been processed.
pdb_chain_interpro.lst A summary of the InterPro identifier(s) (derived via the UniProt mapping) for each PDB chain that has been processed.
pdb_chain_pfam.lst A summary of the Pfam domain identifier(s)(derived via the UniProt mapping) for each PDB chain that has been processed.
pdb_chain_cath_uniprot.lst A summary of the CATH identifier(s) and UniProt primary accession number(s) for each PDB chain that has been processed.
pdb_chain_scop_uniprot.lst A summary of the SCOP identifier(s) and UniProt primary accession number(s) for each PDB chain that has been processed.
Primary developers: Sameer Velankar, Harry Boutselakis, Phil McNeil, Dimitris Dimitropoulos, Antonio Suarez (MSD group) and Virginie Mittard, Daniel Barrell, Julius Jacobsen (Sequence database group).
Last modified: Tue March 12 11:02:10 BST 2008
TEMBLOR-European Community Contract No. QLRI-CT-2001-00015 Medical Research Council home page EMBL Heidelberg home page
spacer
spacer