Dbfetch

ID   AY085452; SV 1; linear; mRNA; STD; PLN; 687 BP.
XX
AC   AY085452;
XX
DT   14-JUN-2002 (Rel. 72, Created)
DT   24-FEB-2006 (Rel. 86, Last updated, Version 4)
XX
DE   Arabidopsis thaliana clone 15222 mRNA, complete sequence.
XX
KW   FLI_CDNA.
XX
OS   Arabidopsis thaliana (thale cress)
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
XX
RN   [1]
RP   1-687
RX   PUBMED; 12093376.
RA   Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N.,
RA   Feldmann K.A., Flavell R.B., White O., Salzberg S.L.;
RT   "Full-length messenger RNA sequences greatly improve genome annotation";
RL   Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002).
XX
RN   [2]
RP   1-687
RA   Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.;
RT   "Features of Arabidopsis genes and genome discovered using full-length
RT   cDNAs";
RL   Plant Mol. Biol. 60(1):71-87(2006).
XX
RN   [3]
RP   1-687
RA   Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.;
RT   ;
RL   Submitted (11-MAR-2002) to the INSDC.
RL   Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA
XX
DR   MD5; ce5ef46af382cfa1d60c29773ef5931d.
XX
CC   This clone sequence is one of 5,000 Ceres full-length cDNAs made
CC   available to TIGR and Genbank. The following quality assessment of
CC   this set was done by comparison with known proteins: two percent of
CC   the clones are estimated to be 5'-truncated; less than one percent
CC   are 3'-truncated; approximately two percent represent alternative
CC   splice variants, including unspliced introns and spliced exons; one
CC   percent may contain premature stop codons; five percent may have
CC   frame shifts in a coding region. A sequence is considered to be
CC   5'-truncated if it lacks the translation initiation start (ATG). A
CC   sequence is considered to be 3'-truncated if it lacks the
CC   C-terminal end of the encoded protein. Please note that these cDNA
CC   sequences are derived from the Ws or LAer ecotypes and therefore
CC   may contain polymorphisms when compared to sequences from Col-0.
CC   Genset carried out the library production and sequencing of the
CC   full-length clones. Ceres, Inc. carried out the clustering of the
CC   5' sequences, selection of clones, and sequence assembly.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..687
FT                   /organism="Arabidopsis thaliana"
FT                   /mol_type="mRNA"
FT                   /clone="15222"
FT                   /db_xref="taxon:3702"
FT   CDS             123..485
FT                   /codon_start=1
FT                   /product="unknown"
FT                   /db_xref="GOA:Q8LEF3"
FT                   /db_xref="InterPro:IPR007808"
FT                   /db_xref="InterPro:IPR038567"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q8LEF3"
FT                   /protein_id="AAM62678.1"
FT                   /translation="MGKRKSRAKPAPTKRMDKLDTIFSCPFCNHGSSVECIIDMKHLIG
FT                   KAACRICEESFSTTITALTEAIDIYSEWIDECERVNTAEDDVVQEEEEEVEEEEEEEEE
FT                   EDDEDDHVSVKRKYNF"
XX
SQ   Sequence 687 BP; 233 A; 120 C; 149 G; 185 T; 0 other;
     aaaaatataa acaaacaaac atctctcgcc gtcaggttac atctatcgcc accgcaaaga        60
     gaccaccgtc tcctccgcaa tcttcataac ctaaacaacc ctcatcccct ggtacttaaa       120
     caatgggaaa gaggaaatca agagcaaagc ctgctcctac gaagcgaatg gataagcttg       180
     acacaatctt tagttgtcct ttctgcaatc acgggtctag tgtcgaatgc atcattgata       240
     tgaagcatct gattggtaaa gcagcttgta gaatctgtga agaaagcttt agtactacta       300
     tcacagcttt gactgaagct atagacattt atagcgaatg gatcgatgag tgcgagaggg       360
     ttaataccgc ggaagatgat gttgtgcaag aagaggaaga agaagtagaa gaggaagaag       420
     aagaggaaga agaggaggat gatgaagatg accatgtctc tgtcaaaagg aagtataact       480
     tctgagacga gtgttttatc gaaaatcatg taagtcgtcg tcttagagtt atctgcttta       540
     tgttgtaata tctatctgat gaaatcacaa gaacaatctt tagtgttttc tcagtgtctg       600
     atagagaaac atacatttaa gtgaacaatc tttaatcaca ataacagtgt atgattatga       660
     tttgtaagtg gatttaaggc tttgctc                                           687
//