Dbfetch
ID AY085452; SV 1; linear; mRNA; STD; PLN; 687 BP. XX AC AY085452; XX DT 14-JUN-2002 (Rel. 72, Created) DT 24-FEB-2006 (Rel. 86, Last updated, Version 4) XX DE Arabidopsis thaliana clone 15222 mRNA, complete sequence. XX KW FLI_CDNA. XX OS Arabidopsis thaliana (thale cress) OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae; OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis. XX RN [1] RP 1-687 RX PUBMED; 12093376. RA Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N., RA Feldmann K.A., Flavell R.B., White O., Salzberg S.L.; RT "Full-length messenger RNA sequences greatly improve genome annotation"; RL Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002). XX RN [2] RP 1-687 RA Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.; RT "Features of Arabidopsis genes and genome discovered using full-length RT cDNAs"; RL Plant Mol. Biol. 60(1):71-87(2006). XX RN [3] RP 1-687 RA Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.; RT ; RL Submitted (11-MAR-2002) to the INSDC. RL Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA XX DR MD5; ce5ef46af382cfa1d60c29773ef5931d. XX CC This clone sequence is one of 5,000 Ceres full-length cDNAs made CC available to TIGR and Genbank. The following quality assessment of CC this set was done by comparison with known proteins: two percent of CC the clones are estimated to be 5'-truncated; less than one percent CC are 3'-truncated; approximately two percent represent alternative CC splice variants, including unspliced introns and spliced exons; one CC percent may contain premature stop codons; five percent may have CC frame shifts in a coding region. A sequence is considered to be CC 5'-truncated if it lacks the translation initiation start (ATG). A CC sequence is considered to be 3'-truncated if it lacks the CC C-terminal end of the encoded protein. Please note that these cDNA CC sequences are derived from the Ws or LAer ecotypes and therefore CC may contain polymorphisms when compared to sequences from Col-0. CC Genset carried out the library production and sequencing of the CC full-length clones. Ceres, Inc. carried out the clustering of the CC 5' sequences, selection of clones, and sequence assembly. XX FH Key Location/Qualifiers FH FT source 1..687 FT /organism="Arabidopsis thaliana" FT /mol_type="mRNA" FT /clone="15222" FT /db_xref="taxon:3702" FT CDS 123..485 FT /codon_start=1 FT /product="unknown" FT /db_xref="GOA:Q8LEF3" FT /db_xref="InterPro:IPR007808" FT /db_xref="InterPro:IPR038567" FT /db_xref="UniProtKB/Swiss-Prot:Q8LEF3" FT /protein_id="AAM62678.1" FT /translation="MGKRKSRAKPAPTKRMDKLDTIFSCPFCNHGSSVECIIDMKHLIG FT KAACRICEESFSTTITALTEAIDIYSEWIDECERVNTAEDDVVQEEEEEVEEEEEEEEE FT EDDEDDHVSVKRKYNF" XX SQ Sequence 687 BP; 233 A; 120 C; 149 G; 185 T; 0 other; aaaaatataa acaaacaaac atctctcgcc gtcaggttac atctatcgcc accgcaaaga 60 gaccaccgtc tcctccgcaa tcttcataac ctaaacaacc ctcatcccct ggtacttaaa 120 caatgggaaa gaggaaatca agagcaaagc ctgctcctac gaagcgaatg gataagcttg 180 acacaatctt tagttgtcct ttctgcaatc acgggtctag tgtcgaatgc atcattgata 240 tgaagcatct gattggtaaa gcagcttgta gaatctgtga agaaagcttt agtactacta 300 tcacagcttt gactgaagct atagacattt atagcgaatg gatcgatgag tgcgagaggg 360 ttaataccgc ggaagatgat gttgtgcaag aagaggaaga agaagtagaa gaggaagaag 420 aagaggaaga agaggaggat gatgaagatg accatgtctc tgtcaaaagg aagtataact 480 tctgagacga gtgttttatc gaaaatcatg taagtcgtcg tcttagagtt atctgcttta 540 tgttgtaata tctatctgat gaaatcacaa gaacaatctt tagtgttttc tcagtgtctg 600 atagagaaac atacatttaa gtgaacaatc tttaatcaca ataacagtgt atgattatga 660 tttgtaagtg gatttaaggc tttgctc 687 //