![]() |
EBI DbfetchID AY085529; SV 1; linear; mRNA; STD; PLN; 1439 BP. XX AC AY085529; XX DT 14-JUN-2002 (Rel. 72, Created) DT 24-FEB-2006 (Rel. 86, Last updated, Version 5) XX DE Arabidopsis thaliana clone 156141 mRNA, complete sequence. XX KW FLI_CDNA. XX OS Arabidopsis thaliana (thale cress) OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons; rosids; OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. XX RN [1] RP 1-1439 RX PUBMED; 12093376. RA Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N., RA Feldmann K.A., Flavell R.B., White O., Salzberg S.L.; RT "Full-length messenger RNA sequences greatly improve genome annotation"; RL Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002). XX RN [2] RP 1-1439 RA Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.; RT "Features of Arabidopsis genes and genome discovered using full-length RT cDNAs"; RL Plant Mol. Biol. 60(1):71-87(2006). XX RN [3] RP 1-1439 RA Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.; RT ; RL Submitted (11-MAR-2002) to the EMBL/GenBank/DDBJ databases. RL Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA XX CC This clone sequence is one of 5,000 Ceres full-length cDNAs made CC available to TIGR and Genbank. The following quality assessment of CC this set was done by comparison with known proteins: two percent of CC the clones are estimated to be 5'-truncated; less than one percent CC are 3'-truncated; approximately two percent represent alternative CC splice variants, including unspliced introns and spliced exons; one CC percent may contain premature stop codons; five percent may have CC frame shifts in a coding region. A sequence is considered to be CC 5'-truncated if it lacks the translation initiation start (ATG). A CC sequence is considered to be 3'-truncated if it lacks the CC C-terminal end of the encoded protein. Please note that these cDNA CC sequences are derived from the Ws or LAer ecotypes and therefore CC may contain polymorphisms when compared to sequences from Col-0. CC Genset carried out the library production and sequencing of the CC full-length clones. Ceres, Inc. carried out the clustering of the CC 5' sequences, selection of clones, and sequence assembly. XX FH Key Location/Qualifiers FH FT source 1..1439 FT /organism="Arabidopsis thaliana" FT /mol_type="mRNA" FT /clone="156141" FT /db_xref="taxon:3702" FT CDS 61..1251 FT /codon_start=1 FT /product="alpha-galactosidase-like protein" FT /db_xref="GOA:Q8RX86" FT /db_xref="HSSP:1UAS" FT /db_xref="InterPro:IPR000111" FT /db_xref="InterPro:IPR002241" FT /db_xref="InterPro:IPR013785" FT /db_xref="InterPro:IPR017853" FT /db_xref="UniProtKB/TrEMBL:Q8RX86" FT /protein_id="AAM62753.1" FT /translation="MVLLSFSLRFIAFTLTITLTQIADGFQSRMLMNNGLALSPQMGWN FT SWNHFQCNINETLIKQTADAMVSSGLSAIGYKYINIDDCWGELKRDSQGSLVAKASTFP FT SGIKALSDYVHSKGLKLGIYSDAGTLTCSQTMPGSLGHEEQDAKTFASWGIDYLKYDNC FT ENTGTSPRERYPKMSKALLNSGRSIFFSLCEWGQEDPATWAGDIGNSWRTTGDIQDNWK FT SMTLIADQNDRWASYARPGSWNDPDMLEVGNGGMTKEEYMSHFSIWALAKAPLLIGCDL FT RSMDKVTFELLSNKEVIAVNQDKLGIQGKKVKKEGDLEVWAGPLSKKRVAVILWNRGSA FT SANITARWAEIGLNSSDIVNARDLWEHSTYSCVKKQLSALVEPHACKMYTLTRRKA" XX SQ Sequence 1439 BP; 467 A; 267 C; 331 G; 374 T; 0 other; ctctctctct caatctgttt atagccattg aaggaaacaa gaaaaaagct tgatcaagca 60 atggttcttc ttagtttctc cttaagattc attgcgttca ctctgactat aactctgact 120 cagattgctg atgggtttca gagtcgaatg ttgatgaaca atggacttgc tctctctcct 180 caaatgggat ggaacagctg gaatcatttt cagtgtaata tcaatgaaac ccttatcaaa 240 caaaccgctg atgcaatggt ttcaagtggt ctttctgcaa taggttacaa gtatatcaac 300 atagatgatt gttggggtga actcaagaga gattctcagg ggagtttggt tgcaaaagca 360 tcaacatttc cttcaggaat caaagcttta tcagattatg ttcatagcaa agggctaaaa 420 cttgggatat actctgatgc cgggactctt acctgcagcc aaaccatgcc gggatcactc 480 gggcatgaag aacaagatgc caagacattt gcctcatggg ggattgatta cctcaagtat 540 gataactgcg aaaatacagg gacaagtcca agagaaagat accctaagat gagtaaagcg 600 ctgttaaatt caggaagatc catattcttc tctctgtgcg aatggggaca agaggatcca 660 gcaacttggg caggagatat tggcaacagt tggagaacaa caggagatat ccaagataat 720 tggaaaagca tgacattgat agcggatcag aatgatcgat gggcatctta tgcaagacca 780 ggatcgtgga acgatccaga catgcttgaa gtgggaaatg gaggcatgac taaggaagaa 840 tacatgtcgc atttcagcat ctgggcattg gcaaaagctc ctctgctgat cggctgcgat 900 cttagatcga tggacaaagt tacatttgaa ttgcttagca acaaagaggt gattgctgtt 960 aatcaagaca aacttgggat ccagggaaag aaagtaaaga aagaaggtga tcttgaggta 1020 tgggcaggtc cactaagcaa gaaacgagta gcggtcatcc tatggaacag aggatcagca 1080 tctgcaaaca tcacagctcg atgggctgag attggcctca attcatcaga tattgttaat 1140 gctcgtgact tatgggagca ttcgacatat tcatgtgtta aaaagcagct atctgcccta 1200 gtggagcctc atgcctgcaa aatgtatact cttacaagac gcaaggcata agatgccaat 1260 ctggtgcaga taaagttatg gaagtaaaac tgatcaatag caaaatcctg ttgaaagcag 1320 aagattgtac caaaacatct tctggtttca gtttgtatct actatgctga atataagatg 1380 ccagaaacag aaatactcaa tccaacttgc attatattaa gaataatttg agttatggc 1439 // ![]() |