STRUCTURAL GENOMICS

The Target Information Server is a tool for searching and tracking Structural Genomics targets. These include public domain SG, PDB and prerleased PDB sequences
. Targets are mainly protein sequences however results of a given search query would contain related links to structural data (pdb).

The data in our database has been extracted from the following sites:

The Berkeley Structural Genomics Center

The Joint Center for Structural Genomics

The Midwest Center for Structural Genomics

The New York Structural Genomics Research Consortium

The Northeast Structural Genomics Consortium

The Southeast Collaboratory for Structural Genomics

The TB Structural Genomics Consortium

The S2F Structure 2 Function Project

The BSGI Bacterial Structural Genomics Initiative

The SGPP Structural Genomics for Pathogenic Protozoa

RIKEN Group

Yeast Structural Genomics (France)

BNL Group

It is worthwhile to note that the sgt data base of sequences is created from the public XML files. There are some differences in the XML between the sites. In addition not all the site identifiers are unique. Until these inconsistencies are resolved the data will not be updated routinely.

1. Target data will be described using the XML syntax proposed by the International Task Force.

Target data will be described according to the following skeleton DTD Updated 25-July-2001

2. The protocol for exchanging target data with the registration site will follow the Task Force recommendations:

The targets file is a concatenation of individual target entries. Targets data files are updated weekly. Each target entry represents a single protein, not a family. Target entries will not be deleted. Abandoned targets will be identified with a "work stopped" status code.

Tracking targets with this server is done by either searching for sequence similarity between
user's sequence(s) and SG targets or by direct detailed search(i.e. searching for targets by their status, protein name, or organism source .. etc.).

Getting Started

The Target Information Server (MSDtarget) offers two main options to search and track targets (Fig.1):

Search SG-targets by sequence similarity:

pasting

typing

loading

This will scan for similar target sequences and output a hit list which in turn provide links to targets data (status and sequence alignement with the user's sequence.)

Search/Track SG-targets:

Status

Protein Name

General Query

Task 1:>

public targets DBase targetDB

Search SG-targets by sequence similarity (option A)

Paste a sequence

Fig.1

Submit

MGSSHHHHHHDYDIPTTENLYFQGHMKVKILVDSTADVPFSWMEKYDIDSIPLYVVWEDG
RSEPDEREPEEIMNFYKRIREAGSVPKTSQPSVEDFKKRYLKYKEEDYDVVLVLTLSSKL
SGTYNSAVLASKEVDIPVYVVDTLLASGAIPLPARVAREMLENGATIEEVLKKLDERMKN
KDFKAIFYVSNFDYLVKGGRVSKFQGFVGNLLKIRVCLHIENGELIPYRKVRGDKKAIEA
LIEKLREDTPEGSKLRVIGVHADNEAGVVELLNTLRKSYEVVDEIISPMGKVITTHVGPG
TVGFGIEVLERKR

Fig.2

Column Maching Targets:

Click

Fig.3

Column Last Status:

Column Seq. Ln:

Column Similarity:

Column DBase name:

Column DBase ID:

Task 2:>

Submit

Upload sequences

Upload FASTA sequence(s) file:

FASTA example

AA Codes

Case1:

Case2:

Submi

Fig.4

Task 3:> (for Case1)

Browse...

1.fasta

Submit

Task 4:> (for Case2)

Browse...

2.fasta

Submit

Search/Track SG-targets (option B)

This search option allow the search of targets in a direct manner and offers six quering ways (Fig.5). It's worth while to note that the result page of any query will be a page of a hits list where links are provided to allow display of the targets details. However, if a query finds only one hit, the target details will be shown immediatly (see Task 6:>).

General Query:

Fig.6

Task 5:>

General Query:

oct-2003

Submit

Target ID:

Task 6:>

Target ID:

281950

Submit

Fig.7

Protein Name & Species:

Protein Name:

Task 7a:>

D-mannonate hydrolase

Submit

Fig.7

Task 7b:>

D-mannonate

Submit

Fig.9

Task 7c:>

hydrolase

Submit

Fig.10

Species:

Task 8:>

Species:

ESCHERICHIA COLI

Submit

ESCHERICHIA COLI

Note

General Query:

Protein Name:

hydolase

General Query:

hydolase

Protein Name

Fig.10

Fig.6

Status:

Selected

Cloned

Expressed

In PDB

Work Stopped

Status:

DTD

Status:

Task 9:>

Status:

In PDB

Submit

Fig.11

Fig.12

Stuctural Genomics Centers:

Task 10:>

Stuctural Genomics Centers:

Berkeley SG Center

Submit

Remark: Users are encouraged to try out different inputs as search arguments and examine the results.

Contents

Introduction

Target Information Server: Structural Genomics

Getting Started

Search SG-targets by sequence similarity (option A)

Search/Track SG-targets (option B)