spacer
spacer

EBI Dbfetch

ID   AY087592; SV 1; linear; mRNA; STD; PLN; 1963 BP.
XX
AC   AY087592;
XX
DT   14-JUN-2002 (Rel. 72, Created)
DT   24-FEB-2006 (Rel. 86, Last updated, Version 5)
XX
DE   Arabidopsis thaliana clone 36882 mRNA, complete sequence.
XX
KW   FLI_CDNA.
XX
OS   Arabidopsis thaliana (thale cress)
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids;
OC   eurosids II; Brassicales; Brassicaceae; Arabidopsis.
XX
RN   [1]
RP   1-1963
RX   PUBMED; 12093376.
RA   Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N.,
RA   Feldmann K.A., Flavell R.B., White O., Salzberg S.L.;
RT   "Full-length messenger RNA sequences greatly improve genome annotation";
RL   Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002).
XX
RN   [2]
RP   1-1963
RA   Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.;
RT   "Features of Arabidopsis genes and genome discovered using full-length
RT   cDNAs";
RL   Plant Mol. Biol. 60(1):71-87(2006).
XX
RN   [3]
RP   1-1963
RA   Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.;
RT   ;
RL   Submitted (11-MAR-2002) to the EMBL/GenBank/DDBJ databases.
RL   Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA
XX
CC   This clone sequence is one of 5,000 Ceres full-length cDNAs made
CC   available to TIGR and Genbank. The following quality assessment of
CC   this set was done by comparison with known proteins: two percent of
CC   the clones are estimated to be 5'-truncated; less than one percent
CC   are 3'-truncated; approximately two percent represent alternative
CC   splice variants, including unspliced introns and spliced exons; one
CC   percent may contain premature stop codons; five percent may have
CC   frame shifts in a coding region. A sequence is considered to be
CC   5'-truncated if it lacks the translation initiation start (ATG). A
CC   sequence is considered to be 3'-truncated if it lacks the
CC   C-terminal end of the encoded protein. Please note that these cDNA
CC   sequences are derived from the Ws or LAer ecotypes and therefore
CC   may contain polymorphisms when compared to sequences from Col-0.
CC   Genset carried out the library production and sequencing of the
CC   full-length clones. Ceres, Inc. carried out the clustering of the
CC   5' sequences, selection of clones, and sequence assembly.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1963
FT                   /organism="Arabidopsis thaliana"
FT                   /mol_type="mRNA"
FT                   /clone="36882"
FT                   /db_xref="taxon:3702"
FT   CDS             85..1731
FT                   /codon_start=1
FT                   /product="putative beta-amylase"
FT                   /db_xref="GOA:Q9SMW0"
FT                   /db_xref="HSSP:1BYB"
FT                   /db_xref="InterPro:IPR013781"
FT                   /db_xref="UniProtKB/TrEMBL:Q9SMW0"
FT                   /protein_id="AAM65134.1"
FT                   /translation="MELTLNSSSSLIKRKDAKSSRNQESSSNNMTFAKMKPPTYQFQAK
FT                   NSVKEMKFTHEKTFTPEGETLEKWEKLHVLSYPHSKNDASVPVFVMLPLDTVTMSGHLN
FT                   KPRAMNASLMALKGAGVEGVMVDAWWGLVEKDGPMNYNWEGYAELIQMVQKHGLKLQVV
FT                   MSFHQCGGNVGDSCSIPLPPWVLEEISKNPDLVYTDKSGRRNPEYISLGCDSVPVLRGR
FT                   TPIQVYSDFMRSFRERFEGYIGGVIAEIQVGMGPCGELRYPSYPESNGTWRFPGIGEFQ
FT                   CYDKYMKSSLQAYAESIGKTNWGTSGPHDAGEYKNLPEDTEFFRRDGTWNSEYGKFFME
FT                   WYSGKLLEHGDQLLSSAKGIFQGSGAKLSGKVAGIHWHYNTRSHAAELTAGYYNTRNHD
FT                   GYLPIAKMFNKHGVVLNFTCMEMKDGEQPEHANCSPEGLVKQVQNATRQAGTELAGENA
FT                   LERYDSSAFGQVVATNRSDSGNGLTAFTYLRMNKRLFEGQNWQQLVEFVKNMKEGGHGR
FT                   RLSKEDTTGSDLYVGFVKGKIAENVEEAALV"
XX
SQ   Sequence 1963 BP; 627 A; 403 C; 476 G; 457 T; 0 other;
     aaacacaaac atatcttcta tcaaacacca acagctctat tctctacctc atttctcatc        60
     ataacaaaga gagagaaaaa aactatggaa ttgacactga attcctcgag ttctcttatc       120
     aaacgtaaag atgccaagag ttctagaaac caagaaagtt cctccaacaa catgaccttt       180
     gcgaagatga agccgccaac atatcaattc caagcaaaga actcggttaa ggaaatgaag       240
     ttcactcacg agaagacctt cacgccagaa ggtgaaaccc ttgagaaatg ggagaagctc       300
     cacgttctct catacccaca ctccaagaac gacgctagcg ttccggtgtt cgtcatgtta       360
     ccgctcgaca cagtaacaat gtcagggcat ttgaacaaac cacgagccat gaacgctagt       420
     ttgatggccc tgaaaggagc tggtgtggaa ggtgtgatgg tggatgcttg gtggggattg       480
     gtggagaaag atggacctat gaattataac tgggaaggct atgccgagct tatacagatg       540
     gttcaaaagc acggtctcaa actccaggtc gttatgtcat tccatcaatg tggaggaaac       600
     gtaggagact cttgcagtat ccccttgcct ccatgggtgc ttgaagagat cagcaagaac       660
     cctgatcttg tctacacaga caaatctggg agaaggaacc ctgaatatat ctccttggga       720
     tgtgattctg tgcctgtcct aagaggaaga acacctatcc aggtctactc agatttcatg       780
     aggagcttcc gtgaacgatt tgaaggctac ataggaggag ttattgcgga aattcaagta       840
     ggaatgggac cttgtggaga attgagatac ccatcatacc ctgagagcaa cgggacctgg       900
     agattccccg gaattggaga gttccagtgc tacgacaagt atatgaaatc gtcacttcaa       960
     gcatatgctg agtcaatcgg gaaaactaac tggggaacaa gtggacctca tgatgccggc      1020
     gagtacaaga acctcccaga agatactgaa tttttcagga gagacggaac atggaatagc      1080
     gagtatggaa agtttttcat ggaatggtac tccgggaagc tgctagaaca tggagaccaa      1140
     ctcctatctt cagcgaaagg tatctttcaa ggaagcggag caaagctatc aggaaaggta      1200
     gctggaattc actggcacta caacaccagg tcacacgcag ctgagctaac cgctggatac      1260
     tacaacacaa gaaaccatga cgggtatctg ccaatagcta agatgttcaa caaacatgga      1320
     gttgtgctca acttcacctg catggagatg aaagacgggg agcaacctga gcacgcgaat      1380
     tgctcaccag aaggtctggt caagcaagta cagaacgcga caaggcaggc cggaaccgaa      1440
     ctagcagggg agaacgcgct agaacgatat gactcaagcg cattcggaca agtggtagca      1500
     acaaataggt cagattctgg aaatgggtta accgcattta cttacctaag aatgaacaag      1560
     cggttatttg agggtcaaaa ttggcagcag ttagtggagt ttgttaagaa catgaaggaa      1620
     ggtggtcatg ggaggagact ctcaaaagaa gacacaactg gaagtgacct ttatgttgga      1680
     tttgtcaaag gcaagatcgc tgagaatgtg gaggaggctg ctttagtgta atttcccaca      1740
     taggtacata catatagtgt ggtgtttatt gtattcctgt ctgataaata actagagaga      1800
     tcaaaccagt aagagtgtta aagctataga tttgcacaat tctgggtcag agtcagagca      1860
     aagagaagca aaatcaagat gatgtacact tagatgtttc ctatgagttt tccttgtaca      1920
     tcatcttcat actcttaatc tcaaatacta tgcatttttc tcc                        1963
//


  
spacer
spacer