![]() |
EBI DbfetchID AY087036; SV 1; linear; mRNA; STD; PLN; 1895 BP. XX AC AY087036; XX DT 14-JUN-2002 (Rel. 72, Created) DT 24-FEB-2006 (Rel. 86, Last updated, Version 5) XX DE Arabidopsis thaliana clone 30798 mRNA, complete sequence. XX KW FLI_CDNA. XX OS Arabidopsis thaliana (thale cress) OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids; OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. XX RN [1] RP 1-1895 RX PUBMED; 12093376. RA Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N., RA Feldmann K.A., Flavell R.B., White O., Salzberg S.L.; RT "Full-length messenger RNA sequences greatly improve genome annotation"; RL Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002). XX RN [2] RP 1-1895 RA Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.; RT "Features of Arabidopsis genes and genome discovered using full-length RT cDNAs"; RL Plant Mol. Biol. 60(1):71-87(2006). XX RN [3] RP 1-1895 RA Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.; RT ; RL Submitted (11-MAR-2002) to the EMBL/GenBank/DDBJ databases. RL Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA XX CC This clone sequence is one of 5,000 Ceres full-length cDNAs made CC available to TIGR and Genbank. The following quality assessment of CC this set was done by comparison with known proteins: two percent of CC the clones are estimated to be 5'-truncated; less than one percent CC are 3'-truncated; approximately two percent represent alternative CC splice variants, including unspliced introns and spliced exons; one CC percent may contain premature stop codons; five percent may have CC frame shifts in a coding region. A sequence is considered to be CC 5'-truncated if it lacks the translation initiation start (ATG). A CC sequence is considered to be 3'-truncated if it lacks the CC C-terminal end of the encoded protein. Please note that these cDNA CC sequences are derived from the Ws or LAer ecotypes and therefore CC may contain polymorphisms when compared to sequences from Col-0. CC Genset carried out the library production and sequencing of the CC full-length clones. Ceres, Inc. carried out the clustering of the CC 5' sequences, selection of clones, and sequence assembly. XX FH Key Location/Qualifiers FH FT source 1..1895 FT /organism="Arabidopsis thaliana" FT /mol_type="mRNA" FT /clone="30798" FT /db_xref="taxon:3702" FT CDS 178..1788 FT /codon_start=1 FT /product="beta-amylase-like proten" FT /db_xref="GOA:Q8VYW2" FT /db_xref="HSSP:1FA2" FT /db_xref="InterPro:IPR013781" FT /db_xref="UniProtKB/TrEMBL:Q8VYW2" FT /protein_id="AAM64597.1" FT /translation="MEVSVIGNPQARICRAELAYRELGFRFGSDVISGESRNRVSFCNQ FT SSKWKEIAIRCSSRSVKCEAIVSDDASPFLKSTPKSKSLESVKLFVGLPLDTVSDCNNV FT NHLKAITAGLKALKLLGVEGIELPIFWGVVEKEAAGKYEWSGYLAVAEIVKKVGLKLHA FT SLSFHGSKQTEIGLPDWVAKIGDAEPGIYFTDRYGQQYKDCLSFAVDDVPVLDGKTPME FT VYRGFCESFKSAFADYMGNTITGITLGLGPDGELKYPSHQHNAKLSGAGEFQCYDKHML FT SALKGYAESTGNPLWGLGGPHDAPAYDQQPNSSSFFSDGGSWESQYGDFFLSWYSSLLT FT SHADRVLSVASSAFSGIGVPLCGKLPLLHQWHKLRSHPSELTAGFYSSNGQDRYEAIAE FT IFAKNSCRMIIPGMDLSDEHQSPESLSSPESLLGHIKTSCKKQGVVVSGQNSSTPVPGG FT FERIVENLKDENVGIDLFTYQRMGALFFSPEHFHAFTVFVRNLSQFELSSDDQASEAEV FT EAETASIGSGTGAPSLQTA" XX SQ Sequence 1895 BP; 500 A; 408 C; 416 G; 571 T; 0 other; aaaccaaatc cataatcctc accatatcca caatcctcaa gctatctcct ttgtgtccat 60 aatcgaaacg ctttttgcga tttatcctaa tcttttttct atcttctgct acaaatttgg 120 ttcttttttg gttgaatttg gataaaatca taatctcctc ttcaattttc ttgtgaaatg 180 gaagtttcag tgattggaaa tcctcaagcg aggatctgca gagcagaatt agcttacaga 240 gagcttggat ttagatttgg ctctgatgta atctccggtg aatcgagaaa tagggttagt 300 ttctgcaacc aaagctctaa atggaaagag atcgcgatac gttgctcttc gagatctgtc 360 aaatgtgaag ccatcgtctc cgatgacgct tctccgtttc tcaaatccac tccaaaatct 420 aaatcgctcg agagtgtaaa attatttgtt gggcttccgt tagacacagt ttcagactgc 480 aacaatgtga atcacttgaa agctattaca gctgggctta aagctttgaa gctacttggt 540 gtagaaggta ttgagttacc tatcttttgg ggagttgttg agaaagaagc tgctgggaaa 600 tatgaatggt ctgggtactt ggcagtagct gagattgtta agaaagtggg acttaagctt 660 catgcttcac tttctttcca tggatcgaaa caaacagaga taggtcttcc tgattgggtg 720 gcaaagattg gtgatgctga accagggatc tattttacag atagatatgg acaacagtac 780 aaagattgtt tgtcgtttgc tgttgatgat gttcctgttc ttgatgggaa gactcctatg 840 gaggtttaca gaggtttctg tgagagcttc aagtctgctt tcgcagatta catgggcaac 900 acaatcacgg gaatcacatt aggtttggga ccagacggtg agctgaaata tccttctcat 960 caacataatg ccaagctctc tggcgcggga gagttccagt gttacgacaa acacatgctt 1020 tctgctctta aaggctacgc tgaatccact ggaaaccctc tttggggtct cggtggtcct 1080 cacgatgctc ctgcttacga tcaacagcct aattcctctt cattcttctc agacggcggg 1140 tcatgggaat ctcagtacgg cgatttcttc ttgtcttggt attcgtctct tctcacctcc 1200 cacgcagacc gagtcctctc cgttgcttca tctgcattta gcgggattgg agtgcctcta 1260 tgtgggaagc tacctctctt acaccaatgg cacaagctaa gatctcatcc ttctgagtta 1320 acagctggat tctacagctc taatggtcag gacaggtacg aggctatcgc agagatcttt 1380 gcaaagaact cttgtagaat gataatacca ggaatggacc tatccgacga gcaccaatca 1440 cctgaatctc tctcgagccc cgagtcatta cttggccaca tcaaaacttc ctgcaagaaa 1500 caaggagtcg ttgtctcagg gcaaaactca tccaccccgg ttcctggtgg gtttgagagg 1560 atcgttgaga atctgaagga tgagaatgta ggaattgatc tgttcactta ccagagaatg 1620 ggagcacttt tcttctctcc agagcatttc catgctttca cagtctttgt ccggaacctg 1680 agccaattcg agttgtcctc agacgatcaa gcctcagagg ctgaggttga ggccgagaca 1740 gctagcatag gttcaggcac tggtgcacct agtttgcaaa ccgcttaatg aaatgcaaaa 1800 tcataatttt ttatgtaaat aagaaaagtc tcctgttgtt ttagcattta tatagaatgt 1860 catattactg taaaattatt tgcaaccatt tcttc 1895 // ![]() |