spacer
spacer

EBI Dbfetch

ID   AF303267; SV 1; linear; mRNA; STD; INV; 1518 BP.
XX
AC   AF303267;
XX
DT   18-JAN-2001 (Rel. 66, Created)
DT   18-JAN-2001 (Rel. 66, Last updated, Version 1)
XX
DE   Caenorhabditis elegans kynureninase (XG726) mRNA, complete cds.
XX
KW   Worm Transcriptome Project.
XX
OS   Caenorhabditis elegans
OC   Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea;
OC   Rhabditidae; Peloderinae; Caenorhabditis.
XX
RN   [1]
RP   1-1518
RA   Kohara Y., Shin'i T., Suzuki Y., Sugano S., Potdevin M., Thierry-Mieg Y.,
RA   Thierry-Mieg D., Thierry-Mieg J.;
RT   "Expressed genes in C. elegans";
RL   Unpublished.
XX
RN   [2]
RP   1-1518
RA   Kohara Y., Shin'i T., Vidal M., Sugano S., Suzuki Y., Thierry-Mieg D.,
RA   Thierry-Mieg J.;
RT   ;
RL   Submitted (21-AUG-2000) to the EMBL/GenBank/DDBJ databases.
RL   National Institute of Genetics, Mishima, Japan
XX
CC   Complete mRNA for kynureninase gene XG726 (YK6646) in C. elegans.
CC   This gene is located on chromosome X on the reverse strand, its
CC   3' end is in megabase 7, near kilobase 726. It is encoded by cosmid
CC   C15H9 from base 20421 to base 18009. Its extrapolated genetic
CC   position is (X; -3.51). No phenotype has yet been reported to our
CC   knowledge; this gene's function is yet unknown.
CC   Clones
CC   This mRNA is represented by 6 clones, all compatible with a single
CC   mRNA type. Clones match the mRNA starting on base:
CC   yc707D10 -> base 1 ;  yc708E8 -> base 1 ;
CC   yk502b11 -> base 1 ;  yk462d11 -> base 8 ;
CC   yc713B10 -> base 75 ;  dvpl6g83 -> base 911 ;
CC   Transcripts from this gene are present at all stages : 2 of 3
CC   staged clones come from young embryos, 1 of 3 staged clones comes
CC   from mixed stage.
CC   3 clones are from a capped library, 2 clones start with an exact
CC   SL1 transpliced leader (beware, they both contain point mutations.
CC   use yk502b11 to produce the protein).
CC   The 5'UTR contains 3 bp:
CC   ATC
CC   The transpliced leader SL1 (GGTTTAATTACCCAAGTTTGAG) is present in
CC   front of the submitted sequence. The first ATG starts at nucleotide
CC   4.
CC   Splicing
CC   Comparison to the genome sequence confirms genome sequence accuracy
CC   and shows that the 8/8 introns follow the consensus [gt-ag] rule.
CC   Coordinates of the introns in the template genomic DNA, where 1
CC   denotes the first genomic base matching the RNA, are
CC   [ type ]      start     end     length
CC   [ gt_ag ]        94     141      48 bp
CC   [ gt_ag ]       260     729     470 bp
CC   [ gt_ag ]       851     895      45 bp
CC   [ gt_ag ]      1041    1086      46 bp
CC   [ gt_ag ]      1381    1467      87 bp
CC   [ gt_ag ]      1694    1749      56 bp
CC   [ gt_ag ]      1836    1884      49 bp
CC   [ gt_ag ]      2119    2212      94 bp
CC   The 3'UTR contains about 78 bp
CC   TTCAATTGGCTCGCAATGTAGTAGAATAATGTTGAACGTATTGTATGAGT
CC   TTTATATGAAAAAAATGATCTGAATGTG
CC   The standard AATAAA polyadenylation signal does not occur, but the
CC   variant :
CC   AAAAAA is seen, starting 20 to 35 bp before the polyA.
CC   Conceptual translation
CC   The protein encoded between the first in frame amino acid and the
CC   stop codon contains 478 residues (MW: 54.0 kDa; pI: 5.28). The
CC   sequence is identical to the predicted CDS C15H9.7.
CC   Homology
CC   BLASTp search run at NCBI on database nr on Aug 18, 2000 shows 13
CC   hits with expectancy < 0.001.
CC   There is a single hit in C. elegans, probably to self.
CC   Lineage report
CC   Based on S. Federhen and S. Shavirin's tax BLAST at NCBI, homologs
CC   to this gene are found in cellular organisms (actinomycetes,
CC   animals, chlamydias, crenarchaeotes, eubacteria, eukaryotes,
CC   euryarchaeotes, fungi, low GC Gram+, mammals, nematodes).
CC   PFAM analysis, carried on Sean Eddy's server at
CC   http://pfam.wustl.edu on Aug 18, 2000 shows 1 non relevant hit
CC   (negative score)
CC   Neighbors
CC   The XG726 gene is in a 40 kb region coding for 6 well transcribed
CC   genes. The closest neighbors are XG722 (on the same strand) and
CC   XG745 (on the opposite strand).
CC   Please mail any question to mieg@ncbi.nlm.nih.gov.
CC   Map location: XG726.
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..1518
FT                   /organism="Caenorhabditis elegans"
FT                   /mol_type="mRNA"
FT                   /db_xref="taxon:6239"
FT   CDS             4..1440
FT                   /codon_start=1
FT                   /gene="XG726"
FT                   /product="kynureninase"
FT                   /db_xref="GOA:Q18026"
FT                   /db_xref="InterPro:IPR015421"
FT                   /db_xref="UniProtKB/Swiss-Prot:Q18026"
FT                   /experiment="experimental evidence, no additional details
FT                   recorded"
FT                   /protein_id="AAG50225.1"
FT                   /translation="MSDAPPQPENEQECMCTQDKVLQFLNKMADESGIKDLTDPALAEF
FT                   LSDSDALKEIRDLFHYPKAGTLPDADPSLVDPESDSIYLCGNSLGLMPKATGEVMKDHL
FT                   DKWAKMGVFGHMSGEVPWAHCDEYCLEGVGRLVGAKKEEVSVCNSLTVNIHVLLTSFYK
FT                   PTETRHKILLESKAFPSDHYAIESQIRLKGRTVQDSMVCLEPREGEETLRTEDILDYIE
FT                   KNGDEIAIVFFSGIQYYTGQLFDMRAITEAGHRKGCFVGFDLAHAFANVPLHLHWWDVD
FT                   FACWCSYKYGCTGAGSIGGLFVHERFLNDQRERMLGWWSHKMSSRFVMDNVLDLDEGAA
FT                   GYRISNPPIHTVAAMLGSLKVFDQVSLENLRSRSCYLTGYLEYLVKTLFGENSEQRTTK
FT                   LSISIITPEEFHQRGCQLSLKFSSPIDIIYPELVKRGCAVDKRYPNVIRVAPVHLYNNY
FT                   VDIRRFISVLQEVAHIVESE"
XX
SQ   Sequence 1518 BP; 420 A; 299 C; 356 G; 443 T; 0 other;
     atcatgtcgg acgccccacc gcagcctgaa aatgagcaag aatgcatgtg cactcaagat        60
     aaagttcttc agtttttaaa caaaatggct gatgaaagtg gaatcaaaga tttgacggac       120
     ccggctcttg cagaatttct ttccgatagt gatgcgctga aagaaattag agacttgttc       180
     cattatccga aagccggaac tcttccagat gctgatccaa gccttgttga tccagagagt       240
     gatagcattt atctatgcgg aaactcactc ggattgatgc caaaagcaac cggcgaagtc       300
     atgaaagatc atttggacaa atgggccaaa atgggagtat tcgggcatat gtctggagaa       360
     gttccatggg ctcattgtga tgagtactgc ttagaaggtg tgggacgtct tgttggtgcc       420
     aagaaagagg aagtttccgt gtgcaatagt ttaactgtta atatccacgt gctcttgact       480
     tccttctaca aacccacgga gactcgtcat aaaatccttc tcgaatccaa agcattccca       540
     tccgatcatt atgcaattga atctcagatt cgtttgaagg gaagaactgt tcaggacagt       600
     atggtttgtc ttgaaccacg agagggagag gaaactcttc gcacagaaga catccttgat       660
     tatattgaga aaaatggaga tgagattgca atcgtgtttt tctcgggaat ccagtactat       720
     accggacagc tttttgacat gagggctatt actgaagcgg gacatcgaaa gggatgcttt       780
     gttggcttcg atttggccca tgctttcgca aacgttcccc ttcatcttca ctggtgggat       840
     gttgactttg cctgttggtg ctcgtataag tacggatgta caggagctgg atcaattggt       900
     ggtctttttg tccatgaaag atttttgaat gaccaacggg agcgtatgtt gggatggtgg       960
     agtcataaaa tgtcatcgag atttgtcatg gacaatgttc tcgacttgga cgagggagct      1020
     gctggttaca gaatcagtaa ccctccaatt cataccgtcg ctgctatgct gggaagtttg      1080
     aaggttttcg atcaagtctc acttgaaaac cttcgtagtc gttcatgcta tttgaccggg      1140
     tacctggagt acttagtgaa aactttgttc ggtgaaaact ctgagcaaag aactactaag      1200
     ctctcaattt ctatcatcac tccagaagag ttccatcagc gtggttgcca actttctctg      1260
     aaattctctt ctccaatcga cattatttat cctgagcttg ttaaacgcgg atgtgcggtt      1320
     gacaaaagat acccgaatgt aatccgtgtc gctccagttc atctttacaa taactacgtg      1380
     gatattcgca gattcatttc cgtgctccaa gaagttgctc atattgtcga aagcgaataa      1440
     ttcaattggc tcgcaatgta gtagaataat gttgaacgta ttgtatgagt tttatatgaa      1500
     aaaaatgatc tgaatgtg                                                    1518
//


  
spacer
spacer