spacer
spacer

EBI Dbfetch

ID   AY085529; SV 1; linear; mRNA; STD; PLN; 1439 BP.
XX
AC   AY085529;
XX
DT   14-JUN-2002 (Rel. 72, Created)
DT   24-FEB-2006 (Rel. 86, Last updated, Version 5)
XX
DE   Arabidopsis thaliana clone 156141 mRNA, complete sequence.
XX
KW   FLI_CDNA.
XX
OS   Arabidopsis thaliana (thale cress)
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids;
OC   eurosids II; Brassicales; Brassicaceae; Arabidopsis.
XX
RN   [1]
RP   1-1439
RX   PUBMED; 12093376.
RA   Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N.,
RA   Feldmann K.A., Flavell R.B., White O., Salzberg S.L.;
RT   "Full-length messenger RNA sequences greatly improve genome annotation";
RL   Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002).
XX
RN   [2]
RP   1-1439
RA   Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.;
RT   "Features of Arabidopsis genes and genome discovered using full-length
RT   cDNAs";
RL   Plant Mol. Biol. 60(1):71-87(2006).
XX
RN   [3]
RP   1-1439
RA   Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.;
RT   ;
RL   Submitted (11-MAR-2002) to the EMBL/GenBank/DDBJ databases.
RL   Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA
XX
CC   This clone sequence is one of 5,000 Ceres full-length cDNAs made
CC   available to TIGR and Genbank. The following quality assessment of
CC   this set was done by comparison with known proteins: two percent of
CC   the clones are estimated to be 5'-truncated; less than one percent
CC   are 3'-truncated; approximately two percent represent alternative
CC   splice variants, including unspliced introns and spliced exons; one
CC   percent may contain premature stop codons; five percent may have
CC   frame shifts in a coding region. A sequence is considered to be
CC   5'-truncated if it lacks the translation initiation start (ATG). A
CC   sequence is considered to be 3'-truncated if it lacks the
CC   C-terminal end of the encoded protein. Please note that these cDNA
CC   sequences are derived from the Ws or LAer ecotypes and therefore
CC   may contain polymorphisms when compared to sequences from Col-0.
CC   Genset carried out the library production and sequencing of the
CC   full-length clones. Ceres, Inc. carried out the clustering of the
CC   5' sequences, selection of clones, and sequence assembly.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1439
FT                   /organism="Arabidopsis thaliana"
FT                   /mol_type="mRNA"
FT                   /clone="156141"
FT                   /db_xref="taxon:3702"
FT   CDS             61..1251
FT                   /codon_start=1
FT                   /product="alpha-galactosidase-like protein"
FT                   /db_xref="GOA:Q8RX86"
FT                   /db_xref="HSSP:1UAS"
FT                   /db_xref="InterPro:IPR000111"
FT                   /db_xref="InterPro:IPR002241"
FT                   /db_xref="InterPro:IPR013785"
FT                   /db_xref="InterPro:IPR017853"
FT                   /db_xref="UniProtKB/TrEMBL:Q8RX86"
FT                   /protein_id="AAM62753.1"
FT                   /translation="MVLLSFSLRFIAFTLTITLTQIADGFQSRMLMNNGLALSPQMGWN
FT                   SWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGELKRDSQGSLVAKASTFP
FT                   SGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAKTFASWGIDYLKYDNC
FT                   ENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIGNSWRTTGDIQDNWK
FT                   SMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIWALAKAPLLIGCDL
FT                   RSMDKVTFELLSNKEVIAVNQDKLGIQGKKVKKEGDLEVWAGPLSKKRVAVILWNRGSA
FT                   SANITARWAEIGLNSSDIVNARDLWEHSTYSCVKKQLSALVEPHACKMYTLTRRKA"
XX
SQ   Sequence 1439 BP; 467 A; 267 C; 331 G; 374 T; 0 other;
     ctctctctct caatctgttt atagccattg aaggaaacaa gaaaaaagct tgatcaagca        60
     atggttcttc ttagtttctc cttaagattc attgcgttca ctctgactat aactctgact       120
     cagattgctg atgggtttca gagtcgaatg ttgatgaaca atggacttgc tctctctcct       180
     caaatgggat ggaacagctg gaatcatttt cagtgtaata tcaatgaaac ccttatcaaa       240
     caaaccgctg atgcaatggt ttcaagtggt ctttctgcaa taggttacaa gtatatcaac       300
     atagatgatt gttggggtga actcaagaga gattctcagg ggagtttggt tgcaaaagca       360
     tcaacatttc cttcaggaat caaagcttta tcagattatg ttcatagcaa agggctaaaa       420
     cttgggatat actctgatgc cgggactctt acctgcagcc aaaccatgcc gggatcactc       480
     gggcatgaag aacaagatgc caagacattt gcctcatggg ggattgatta cctcaagtat       540
     gataactgcg aaaatacagg gacaagtcca agagaaagat accctaagat gagtaaagcg       600
     ctgttaaatt caggaagatc catattcttc tctctgtgcg aatggggaca agaggatcca       660
     gcaacttggg caggagatat tggcaacagt tggagaacaa caggagatat ccaagataat       720
     tggaaaagca tgacattgat agcggatcag aatgatcgat gggcatctta tgcaagacca       780
     ggatcgtgga acgatccaga catgcttgaa gtgggaaatg gaggcatgac taaggaagaa       840
     tacatgtcgc atttcagcat ctgggcattg gcaaaagctc ctctgctgat cggctgcgat       900
     cttagatcga tggacaaagt tacatttgaa ttgcttagca acaaagaggt gattgctgtt       960
     aatcaagaca aacttgggat ccagggaaag aaagtaaaga aagaaggtga tcttgaggta      1020
     tgggcaggtc cactaagcaa gaaacgagta gcggtcatcc tatggaacag aggatcagca      1080
     tctgcaaaca tcacagctcg atgggctgag attggcctca attcatcaga tattgttaat      1140
     gctcgtgact tatgggagca ttcgacatat tcatgtgtta aaaagcagct atctgcccta      1200
     gtggagcctc atgcctgcaa aatgtatact cttacaagac gcaaggcata agatgccaat      1260
     ctggtgcaga taaagttatg gaagtaaaac tgatcaatag caaaatcctg ttgaaagcag      1320
     aagattgtac caaaacatct tctggtttca gtttgtatct actatgctga atataagatg      1380
     ccagaaacag aaatactcaa tccaacttgc attatattaa gaataatttg agttatggc       1439
//


  
spacer
spacer