![]() |
EBI DbfetchID AY087592; SV 1; linear; mRNA; STD; PLN; 1963 BP. XX AC AY087592; XX DT 14-JUN-2002 (Rel. 72, Created) DT 24-FEB-2006 (Rel. 86, Last updated, Version 5) XX DE Arabidopsis thaliana clone 36882 mRNA, complete sequence. XX KW FLI_CDNA. XX OS Arabidopsis thaliana (thale cress) OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids; OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. XX RN [1] RP 1-1963 RX PUBMED; 12093376. RA Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N., RA Feldmann K.A., Flavell R.B., White O., Salzberg S.L.; RT "Full-length messenger RNA sequences greatly improve genome annotation"; RL Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002). XX RN [2] RP 1-1963 RA Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.; RT "Features of Arabidopsis genes and genome discovered using full-length RT cDNAs"; RL Plant Mol. Biol. 60(1):71-87(2006). XX RN [3] RP 1-1963 RA Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.; RT ; RL Submitted (11-MAR-2002) to the EMBL/GenBank/DDBJ databases. RL Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA XX CC This clone sequence is one of 5,000 Ceres full-length cDNAs made CC available to TIGR and Genbank. The following quality assessment of CC this set was done by comparison with known proteins: two percent of CC the clones are estimated to be 5'-truncated; less than one percent CC are 3'-truncated; approximately two percent represent alternative CC splice variants, including unspliced introns and spliced exons; one CC percent may contain premature stop codons; five percent may have CC frame shifts in a coding region. A sequence is considered to be CC 5'-truncated if it lacks the translation initiation start (ATG). A CC sequence is considered to be 3'-truncated if it lacks the CC C-terminal end of the encoded protein. Please note that these cDNA CC sequences are derived from the Ws or LAer ecotypes and therefore CC may contain polymorphisms when compared to sequences from Col-0. CC Genset carried out the library production and sequencing of the CC full-length clones. Ceres, Inc. carried out the clustering of the CC 5' sequences, selection of clones, and sequence assembly. XX FH Key Location/Qualifiers FH FT source 1..1963 FT /organism="Arabidopsis thaliana" FT /mol_type="mRNA" FT /clone="36882" FT /db_xref="taxon:3702" FT CDS 85..1731 FT /codon_start=1 FT /product="putative beta-amylase" FT /db_xref="GOA:Q9SMW0" FT /db_xref="HSSP:1BYB" FT /db_xref="InterPro:IPR013781" FT /db_xref="UniProtKB/TrEMBL:Q9SMW0" FT /protein_id="AAM65134.1" FT /translation="MELTLNSSSSLIKRKDAKSSRNQESSSNNMTFAKMKPPTYQFQAK FT NSVKEMKFTHEKTFTPEGETLEKWEKLHVLSYPHSKNDASVPVFVMLPLDTVTMSGHLN FT KPRAMNASLMALKGAGVEGVMVDAWWGLVEKDGPMNYNWEGYAELIQMVQKHGLKLQVV FT MSFHQCGGNVGDSCSIPLPPWVLEEISKNPDLVYTDKSGRRNPEYISLGCDSVPVLRGR FT TPIQVYSDFMRSFRERFEGYIGGVIAEIQVGMGPCGELRYPSYPESNGTWRFPGIGEFQ FT CYDKYMKSSLQAYAESIGKTNWGTSGPHDAGEYKNLPEDTEFFRRDGTWNSEYGKFFME FT WYSGKLLEHGDQLLSSAKGIFQGSGAKLSGKVAGIHWHYNTRSHAAELTAGYYNTRNHD FT GYLPIAKMFNKHGVVLNFTCMEMKDGEQPEHANCSPEGLVKQVQNATRQAGTELAGENA FT LERYDSSAFGQVVATNRSDSGNGLTAFTYLRMNKRLFEGQNWQQLVEFVKNMKEGGHGR FT RLSKEDTTGSDLYVGFVKGKIAENVEEAALV" XX SQ Sequence 1963 BP; 627 A; 403 C; 476 G; 457 T; 0 other; aaacacaaac atatcttcta tcaaacacca acagctctat tctctacctc atttctcatc 60 ataacaaaga gagagaaaaa aactatggaa ttgacactga attcctcgag ttctcttatc 120 aaacgtaaag atgccaagag ttctagaaac caagaaagtt cctccaacaa catgaccttt 180 gcgaagatga agccgccaac atatcaattc caagcaaaga actcggttaa ggaaatgaag 240 ttcactcacg agaagacctt cacgccagaa ggtgaaaccc ttgagaaatg ggagaagctc 300 cacgttctct catacccaca ctccaagaac gacgctagcg ttccggtgtt cgtcatgtta 360 ccgctcgaca cagtaacaat gtcagggcat ttgaacaaac cacgagccat gaacgctagt 420 ttgatggccc tgaaaggagc tggtgtggaa ggtgtgatgg tggatgcttg gtggggattg 480 gtggagaaag atggacctat gaattataac tgggaaggct atgccgagct tatacagatg 540 gttcaaaagc acggtctcaa actccaggtc gttatgtcat tccatcaatg tggaggaaac 600 gtaggagact cttgcagtat ccccttgcct ccatgggtgc ttgaagagat cagcaagaac 660 cctgatcttg tctacacaga caaatctggg agaaggaacc ctgaatatat ctccttggga 720 tgtgattctg tgcctgtcct aagaggaaga acacctatcc aggtctactc agatttcatg 780 aggagcttcc gtgaacgatt tgaaggctac ataggaggag ttattgcgga aattcaagta 840 ggaatgggac cttgtggaga attgagatac ccatcatacc ctgagagcaa cgggacctgg 900 agattccccg gaattggaga gttccagtgc tacgacaagt atatgaaatc gtcacttcaa 960 gcatatgctg agtcaatcgg gaaaactaac tggggaacaa gtggacctca tgatgccggc 1020 gagtacaaga acctcccaga agatactgaa tttttcagga gagacggaac atggaatagc 1080 gagtatggaa agtttttcat ggaatggtac tccgggaagc tgctagaaca tggagaccaa 1140 ctcctatctt cagcgaaagg tatctttcaa ggaagcggag caaagctatc aggaaaggta 1200 gctggaattc actggcacta caacaccagg tcacacgcag ctgagctaac cgctggatac 1260 tacaacacaa gaaaccatga cgggtatctg ccaatagcta agatgttcaa caaacatgga 1320 gttgtgctca acttcacctg catggagatg aaagacgggg agcaacctga gcacgcgaat 1380 tgctcaccag aaggtctggt caagcaagta cagaacgcga caaggcaggc cggaaccgaa 1440 ctagcagggg agaacgcgct agaacgatat gactcaagcg cattcggaca agtggtagca 1500 acaaataggt cagattctgg aaatgggtta accgcattta cttacctaag aatgaacaag 1560 cggttatttg agggtcaaaa ttggcagcag ttagtggagt ttgttaagaa catgaaggaa 1620 ggtggtcatg ggaggagact ctcaaaagaa gacacaactg gaagtgacct ttatgttgga 1680 tttgtcaaag gcaagatcgc tgagaatgtg gaggaggctg ctttagtgta atttcccaca 1740 taggtacata catatagtgt ggtgtttatt gtattcctgt ctgataaata actagagaga 1800 tcaaaccagt aagagtgtta aagctataga tttgcacaat tctgggtcag agtcagagca 1860 aagagaagca aaatcaagat gatgtacact tagatgtttc ctatgagttt tccttgtaca 1920 tcatcttcat actcttaatc tcaaatacta tgcatttttc tcc 1963 // ![]() |