spacer
spacer

Protein Databases


Sample UniProtKB/Swiss-Prot entry

Go to the main EBI website

ID   GRAA_HUMAN     STANDARD;      PRT;   262 AA.
AC   P12544;
DT   01-OCT-1989 (Rel. 12, Created)
DT   01-OCT-1989 (Rel. 12, Last sequence update)
DT   15-JUN-2002 (Rel. 41, Last annotation update)
DE   Granzyme A precursor (EC 3.4.21.78) (Cytotoxic T-lymphocyte proteinase
DE   1) (Hanukkah factor) (H factor) (HF) (Granzyme 1) (CTL tryptase)
DE   (Fragmentin 1).
GN   GZMA OR CTLA3 OR HFSP.
OS   Homo sapiens (Human).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo.
OX   NCBI_Taxid=9606;
RN   [1]
RP   SEQUENCE FROM N.A.
RC   TISSUE=T-CELL;
RX   MEDLINE=88125000; PubMed=3257574;
RA   Gershenfeld H.K., Hershberger R.J., Shows T.B., Weissman I.L.;
RT   "Cloning and chromosomal assignment of a human cDNA encoding a T cell-
RT   and natural killer cell-specific trypsin-like serine protease.";
RL   Proc. Natl. Acad. Sci. U.S.A. 85:1184-1188(1988).
RN   [2]
RP   SEQUENCE FROM N.A.
RC   TISSUE=BLOOD;
RA   Strausberg R.;
RL   Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases.
RN   [3]
RP   SEQUENCE OF 1-23 FROM N.A.
RA   Goralski T.J., Krensky A.M.;
RT   "The upstream region of the human granzyme A locus contains both
RT   positive and negative transcriptional regulatory elements.";
RL   Submitted (NOV-1995) to the EMBL/GenBank/DDBJ databases.
RN   [4]
RP   SEQUENCE OF 29-53.
RX   MEDLINE=88330824; PubMed=3047119;
RA   Poe M., Bennett C.D., Biddison W.E., Blake J.T., Norton G.P.,
RA   Rodkey J.A., Sigal N.H., Turner R.V., Wu J.K., Zweerink H.J.;
RT   "Human cytotoxic lymphocyte tryptase. Its purification from granules
RT   and the characterization of inhibitor and substrate specificity.";
RL   J. Biol. Chem. 263:13215-13222(1988).
RN   [5]
RP   SEQUENCE OF 29-40, AND CHARACTERIZATION.
RX   MEDLINE=89009866; PubMed=3262682;
RA   Hameed A., Lowrey D.M., Lichtenheld M., Podack E.R.;
RT   "Characterization of three serine esterases isolated from human IL-2
RT   activated killer cells.";
RL   J. Immunol. 141:3142-3147(1988).
RN   [6]
RP   SEQUENCE OF 29-39, AND CHARACTERIZATION.
RX   MEDLINE=89035468; PubMed=3263427;
RA   Kraehenbuhl O., Rey C., Jenne D.E., Lanzavecchia A., Groscurth P.,
RA   Carrel S., Tschopp J.;
RT   "Characterization of granzymes A and B isolated from granules of
RT   cloned human cytotoxic T lymphocytes.";
RL   J. Immunol. 141:3471-3477(1988).
RN   [7]
RP   3D-STRUCTURE MODELING.
RX   MEDLINE=89184501; PubMed=3237717;
RA   Murphy M.E.P., Moult J., Bleackley R.C., Gershenfeld H.,
RA   Weissman I.L., James M.N.G.;
RT   "Comparative molecular model building of two serine proteinases from
RT   cytotoxic T lymphocytes.";
RL   Proteins 4:190-204(1988).
CC   -!- FUNCTION: THIS ENZYME IS NECESSARY FOR TARGET CELL LYSIS IN CELL-
CC       MEDIATED IMMUNE RESPONSES. IT CLEAVES AFTER LYS OR ARG. MAY BE
CC       INVOLVED IN APOPTOSIS.
CC   -!- CATALYTIC ACTIVITY: HYDROLYSIS OF PROTEINS, INCLUDING FIBRONECTIN,
CC       TYPE IV COLLAGEN AND NUCLEOLIN. PREFERENTIAL CLEAVAGE: ARG-|-XAA,
CC       LYS-|-XAA >> PHE-|-XAA IN SMALL MOLECULE SUBSTRATES.
CC   -!- SUBUNIT: HOMODIMER; DISULFIDE-LINKED.
CC   -!- SUBCELLULAR LOCATION: CYtopLASMIC GRANULES.
CC   -!- SIMILARITY: BELONGS TO PEPTIDASE FAMILY S1. GRANZYME SUBFAMILY.
CC   --------------------------------------------------------------------------
CC   This Swiss-Prot entry is copyright. It is produced through a collaboration
CC   between  the Swiss Institute of Bioinformatics  and the  EMBL outstation -
CC   the European Bioinformatics Institute.  There are no  restrictions on  its
CC   use  by  non-profit  institutions as long  as its content  is  in  no  way
CC   modified and this statement is not removed.  Usage  by  and for commercial
CC   entities requires a license agreement (See http://www.isb-sib.ch/announce/
CC   or send an email to license@isb-sib.ch).
CC   --------------------------------------------------------------------------
DR   EMBL; M18737; AAA52647.1; -.
DR   EMBL; BC015739; AAH15739.1; -.
DR   EMBL; U40006; AAD00009.1; -.
DR   PIR; A28943; A28943.
DR   PIR; A30525; A30525.
DR   PIR; A30526; A30526.
DR   PIR; A31372; A31372.
DR   PDB; 1HF1; 15-OCT-94.
DR   MEROPS; S01.135; -.
DR   Genew; HGNC:4708; GZMA.
DR   MIM; 140050; -.
DR   InterPro; IPR001254; Ser_protease_Try.
DR   Pfam; PF00089; trypsin; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
DR   PROSITE; PS00135; TRYPSIN_SER; 1.
KW   Hydrolase; Serine protease; Zymogen; Signal; T-cell; Cytolysis;
KW   Apoptosis; 3D-structure.
FT   SIGNAL        1     26
FT   PROPEP       27     28       ACTIVATION PEPTIDE.
FT   CHAIN        29    262       GRANZYME A.
FT   ACT_SITE     69     69       CHARGE RELAY SYSTEM (BY SIMILARITY).
FT   ACT_SITE    114    114       CHARGE RELAY SYSTEM (BY SIMILARITY).
FT   ACT_SITE    212    212       CHARGE RELAY SYSTEM (BY SIMILARITY).
FT   DISULFID     54     70       BY SIMILARITY.
FT   DISULFID    148    218       BY SIMILARITY.
FT   DISULFID    179    197       BY SIMILARITY.
FT   DISULFID    208    234       BY SIMILARITY.
FT   CARBOHYD    170    170       N-LINKED (GLCNAC...) (POTENTIAL).
SQ   SEQUENCE   262 AA;  28968 MW;  DA87363A0D92BAF4 CRC64;
     MRNSYRFLAS SLSVVVSLLL IPEDVCEKII GGNEVTPHSR PYMVLLSLDR KTICAGALIA
     KDWVLTAAHC NLNKRSQVIL GAHSITREEP TKQIMLVKKE FPYPCYDPAT REGDLKLLQL
     TEKAKINKYV TILHLPKKGD DVKPGTMCQV AGWGRTHNSA SWSDTLREVN ITIIDRKVCN
     DRNHYNFNPV IGMNMVCAGS LRGGRDSCNG DSGSPLLCEG VFRGVTSFGL ENKCGDPRGP
     GVYILLSKKH LNWIIMTIKG AV
//

Abbreviation Key:

entryname dataclass molecule sequence length (Amino Acids)
GRAA_HUMAN standard PRT (protein) 262 AA

  • The AC (Accession Number) line lists the accession numbers associated with this entry.

  • The DT (DaTe) line shows when an entry first appeared in the the database and when it was last updated.

  • The DE (DEscription) lines contain general descriptive information about the sequence stored.

  • The GN (Gene Name) line contains the name(s) of the gene(s) that code for the stored protein sequence.

  • The OS (Organism Species) line specifies the preferred scientific name of the organism which was the source of the stored sequence.

  • The OG (OrGanelle) line indicates if the gene coding for a protein originates from the mitochondria, the chloroplast, a cyanelle, or a plasmid.

  • The OC (Organism Classification) lines contain the taxonomic classification of the source organism.

  • The OX (Organism taxonomy cross-reference) line is used to indicate the identifier to a specific organism in a taxonomic database.

  • The RN (Reference Number) line gives a sequential number to each reference citation in an entry.

  • The RP (Reference Position) line describes the extent of the work carried out by the authors of the reference cited.

  • The RC (Reference Comment) lines are optional lines which are used to store comments relevant to the reference cited.

  • The RX (Reference cross-reference) line is an optional line which is used to indicate the identifier assigned to a specific reference in a bibliographic database.

  • The RA (Reference Author) lines list the authors of the paper (or other work) cited.

  • The RT (Reference Title) lines give the title of the paper (or other work) cited as exactly as possible given the limitations of the computer character set.

  • The RL (Reference Location) lines contain the conventional citation information for the reference.

  • The RG (Reference Group) lines lists the consortium name associated with a given citation. RG line is mainly used in submission reference blocks, but could also be used in paper reference if the working group is cited as an author in the paper.

    Note: RA (Reference Author) and RG line could be present in the same reference block; at least one RA or RG line is mandatory per reference block.

    The format for this line will be:

    RG Consortium_name;

    Examples:

    RG The C. elegans Sequencing Consortium;
    RG The Brazilian Network for HIV Isolation and Characterization.

  • The CC lines are free text comments on the entry, and are used to convey any useful information.

  • The DR (Database cross-Reference) lines are used as pointers to information related to UniProtKB/Swiss-Prot entries and found in data collections other than UniProtKB/Swiss-Prot.

  • The KW (KeyWord) lines provide information that can be used to generate indexes of the sequence entries based on functional, structural, or other categories.

  • The FT (Feature Table) lines provide a precise but simple means for the annotation of the sequence data. The table describes regions or sites of interest in the sequence. In general the feature table lists posttranslational modifications, binding sites, enzyme active sites, local secondary structure or other characteristics reported in the cited references.

  • The SQ (SeQuence header) line marks the beginning of the sequence data and gives a quick summary of its content.

  • The sequence data line has a line code consisting of two blanks rather than the two-letter codes used until now. The sequence counts 60 amino acids per line, in groups of 10 amino acids, beginning in position 6 of the line.

  • The // (terminator) line contains no data or comments and designates the end of an entry.

See the UniProtKB/Swiss-Prot user manual for a more detailed and comprehensive description of the entries.




spacer
spacer