Dbfetch

ID   AY086511; SV 1; linear; mRNA; STD; PLN; 1043 BP.
XX
AC   AY086511;
XX
DT   14-JUN-2002 (Rel. 72, Created)
DT   24-FEB-2006 (Rel. 86, Last updated, Version 4)
XX
DE   Arabidopsis thaliana clone 25530 mRNA, complete sequence.
XX
KW   FLI_CDNA.
XX
OS   Arabidopsis thaliana (thale cress)
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
XX
RN   [1]
RP   1-1043
RX   PUBMED; 12093376.
RA   Haas B.J., Volfovsky N., Town C.D., Troukhan M., Alexandrov N.,
RA   Feldmann K.A., Flavell R.B., White O., Salzberg S.L.;
RT   "Full-length messenger RNA sequences greatly improve genome annotation";
RL   Genome Biol. 3(6):RESEARCH0029-RESEARCH0029(2002).
XX
RN   [2]
RP   1-1043
RA   Alexandrov N.A., Troukhan M.E., Brover V.V., Flavell R.B., Feldmann K.A.;
RT   "Features of Arabidopsis genes and genome discovered using full-length
RT   cDNAs";
RL   Plant Mol. Biol. 60(1):71-87(2006).
XX
RN   [3]
RP   1-1043
RA   Brover V., Troukhan M., Alexandrov N., Lu Y.-P., Flavell R., Feldmann K.;
RT   ;
RL   Submitted (11-MAR-2002) to the INSDC.
RL   Ceres, Inc, 3007 Malibu Canyon Road, Malibu, CA 90265, USA
XX
DR   MD5; 44aaaab30ff82aab32bbca7023fceb7f.
XX
CC   This clone sequence is one of 5,000 Ceres full-length cDNAs made
CC   available to TIGR and Genbank. The following quality assessment of
CC   this set was done by comparison with known proteins: two percent of
CC   the clones are estimated to be 5'-truncated; less than one percent
CC   are 3'-truncated; approximately two percent represent alternative
CC   splice variants, including unspliced introns and spliced exons; one
CC   percent may contain premature stop codons; five percent may have
CC   frame shifts in a coding region. A sequence is considered to be
CC   5'-truncated if it lacks the translation initiation start (ATG). A
CC   sequence is considered to be 3'-truncated if it lacks the
CC   C-terminal end of the encoded protein. Please note that these cDNA
CC   sequences are derived from the Ws or LAer ecotypes and therefore
CC   may contain polymorphisms when compared to sequences from Col-0.
CC   Genset carried out the library production and sequencing of the
CC   full-length clones. Ceres, Inc. carried out the clustering of the
CC   5' sequences, selection of clones, and sequence assembly.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1043
FT                   /organism="Arabidopsis thaliana"
FT                   /mol_type="mRNA"
FT                   /clone="25530"
FT                   /db_xref="taxon:3702"
FT   CDS             136..927
FT                   /codon_start=1
FT                   /product="putative ABC transporter"
FT                   /db_xref="GOA:Q9C9W0"
FT                   /db_xref="InterPro:IPR003439"
FT                   /db_xref="InterPro:IPR003593"
FT                   /db_xref="InterPro:IPR005670"
FT                   /db_xref="InterPro:IPR017871"
FT                   /db_xref="InterPro:IPR027417"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q9C9W0"
FT                   /protein_id="AAM63511.1"
FT                   /translation="MPSLWSNESDGSLREHLVDVVVSGSEPKIRVHDLTRVADDGSRIL
FT                   KGVTIDIPKGMIVGVIGPSGSGKSTFLRSLNRLWEPPESTVFLDGEDITNVDVIALRRR
FT                   VGMLFQLPVLFQGTVADNVRYGPNLRGEKLSDEEVYKLLSLADLDASFAKKTGAELSVG
FT                   QAQRVALARTLANEPEVLLLDEPTSALDPISTENIEDVIVKLKKQRGITTVIVSHSIKQ
FT                   IQKVADIVCLVVDGEIVEVLKPSELSHATHPMAQRFLQLSS"
XX
SQ   Sequence 1043 BP; 288 A; 180 C; 252 G; 322 T; 1 other;
     aaaagttggc cactaagtgg tgtaagaata ataaattgtc aatatcaatt gactgattct        60
     tattgttcat atacggacac aaatcttttc gttaagtgag tgttttgatc aaaaaaagtt       120
     gaaaacttta aagccatgcc atcactttgg tctaacgaat ccgatggatc attacgagag       180
     catcttgtgg atgttgttgt gtctggttca gaaccaaaga ttcgggtaca tgatctgacc       240
     cgagtcgccg atgatggatc tcggatcttg aaaggagtta cgatagatat accaaaaggt       300
     atgatcgttg gtgtgattgg acctagtgga agtggaaagt caacgttttt gagatctctg       360
     aatcgtcttt gggaaccacc ggagtcaact gtgttcttgg acggtgaaga tataaccaac       420
     gttgatgtta ttgctcttcg tcgtagagtt ggaatgcttt ttcagcttcc tgttcttttt       480
     caagggactg ttgcggataa tgtgagatat ggtccgaatt tgagagggga gaaactaagt       540
     gacgaagagg tttataagct gctaagtctt gcagaccttg atgcttcctt tgctaagaag       600
     actggtgcag agttatctgt gggtcaagct caacgagtag cacttgcaag gactctagcc       660
     aacgagcctg aggtgttgct gctcgatgaa ccaacaagtg ctcttgatcc gatatcgaca       720
     gagaacattg aggatgttat agtgaaactg aagaagcaga gagggattac tactgtgatt       780
     gtttctcaca gtatcaagca gattcagaaa gttgctgata tcgtttgcct tgttgtcgac       840
     ggagagattg ttgaagttct taaaccaagt gagctttcgc acgctacgca tccaatggca       900
     cagaggtttc ttcaactcag ttcttgagac cattttctca ttgatggttc ctgcaagtta       960
     tttgctattt tgcttgaatc ttaataatct ctttcaagag aggaacaaat gctggttgaa      1020
     tgtaacaacm cctttcatgg ttt                                              1043
//