![]() |
EBI DbfetchID AF303267; SV 1; linear; mRNA; STD; INV; 1518 BP. XX AC AF303267; XX DT 18-JAN-2001 (Rel. 66, Created) DT 18-JAN-2001 (Rel. 66, Last updated, Version 1) XX DE Caenorhabditis elegans kynureninase (XG726) mRNA, complete cds. XX KW Worm Transcriptome Project. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-1518 RA Kohara Y., Shin'i T., Suzuki Y., Sugano S., Potdevin M., Thierry-Mieg Y., RA Thierry-Mieg D., Thierry-Mieg J.; RT "Expressed genes in C. elegans"; RL Unpublished. XX RN [2] RP 1-1518 RA Kohara Y., Shin'i T., Vidal M., Sugano S., Suzuki Y., Thierry-Mieg D., RA Thierry-Mieg J.; RT ; RL Submitted (21-AUG-2000) to the EMBL/GenBank/DDBJ databases. RL National Institute of Genetics, Mishima, Japan XX CC Complete mRNA for kynureninase gene XG726 (YK6646) in C. elegans. CC This gene is located on chromosome X on the reverse strand, its CC 3' end is in megabase 7, near kilobase 726. It is encoded by cosmid CC C15H9 from base 20421 to base 18009. Its extrapolated genetic CC position is (X; -3.51). No phenotype has yet been reported to our CC knowledge; this gene's function is yet unknown. CC Clones CC This mRNA is represented by 6 clones, all compatible with a single CC mRNA type. Clones match the mRNA starting on base: CC yc707D10 -> base 1 ; yc708E8 -> base 1 ; CC yk502b11 -> base 1 ; yk462d11 -> base 8 ; CC yc713B10 -> base 75 ; dvpl6g83 -> base 911 ; CC Transcripts from this gene are present at all stages : 2 of 3 CC staged clones come from young embryos, 1 of 3 staged clones comes CC from mixed stage. CC 3 clones are from a capped library, 2 clones start with an exact CC SL1 transpliced leader (beware, they both contain point mutations. CC use yk502b11 to produce the protein). CC The 5'UTR contains 3 bp: CC ATC CC The transpliced leader SL1 (GGTTTAATTACCCAAGTTTGAG) is present in CC front of the submitted sequence. The first ATG starts at nucleotide CC 4. CC Splicing CC Comparison to the genome sequence confirms genome sequence accuracy CC and shows that the 8/8 introns follow the consensus [gt-ag] rule. CC Coordinates of the introns in the template genomic DNA, where 1 CC denotes the first genomic base matching the RNA, are CC [ type ] start end length CC [ gt_ag ] 94 141 48 bp CC [ gt_ag ] 260 729 470 bp CC [ gt_ag ] 851 895 45 bp CC [ gt_ag ] 1041 1086 46 bp CC [ gt_ag ] 1381 1467 87 bp CC [ gt_ag ] 1694 1749 56 bp CC [ gt_ag ] 1836 1884 49 bp CC [ gt_ag ] 2119 2212 94 bp CC The 3'UTR contains about 78 bp CC TTCAATTGGCTCGCAATGTAGTAGAATAATGTTGAACGTATTGTATGAGT CC TTTATATGAAAAAAATGATCTGAATGTG CC The standard AATAAA polyadenylation signal does not occur, but the CC variant : CC AAAAAA is seen, starting 20 to 35 bp before the polyA. CC Conceptual translation CC The protein encoded between the first in frame amino acid and the CC stop codon contains 478 residues (MW: 54.0 kDa; pI: 5.28). The CC sequence is identical to the predicted CDS C15H9.7. CC Homology CC BLASTp search run at NCBI on database nr on Aug 18, 2000 shows 13 CC hits with expectancy < 0.001. CC There is a single hit in C. elegans, probably to self. CC Lineage report CC Based on S. Federhen and S. Shavirin's tax BLAST at NCBI, homologs CC to this gene are found in cellular organisms (actinomycetes, CC animals, chlamydias, crenarchaeotes, eubacteria, eukaryotes, CC euryarchaeotes, fungi, low GC Gram+, mammals, nematodes). CC PFAM analysis, carried on Sean Eddy's server at CC http://pfam.wustl.edu on Aug 18, 2000 shows 1 non relevant hit CC (negative score) CC Neighbors CC The XG726 gene is in a 40 kb region coding for 6 well transcribed CC genes. The closest neighbors are XG722 (on the same strand) and CC XG745 (on the opposite strand). CC Please mail any question to mieg@ncbi.nlm.nih.gov. CC Map location: XG726. XX FH Key Location/Qualifiers FH FT source 1..1518 FT /organism="Caenorhabditis elegans" FT /mol_type="mRNA" FT /db_xref="taxon:6239" FT CDS 4..1440 FT /codon_start=1 FT /gene="XG726" FT /product="kynureninase" FT /db_xref="GOA:Q18026" FT /db_xref="InterPro:IPR015421" FT /db_xref="UniProtKB/Swiss-Prot:Q18026" FT /experiment="experimental evidence, no additional details FT recorded" FT /protein_id="AAG50225.1" FT /translation="MSDAPPQPENEQECMCTQDKVLQFLNKMADESGIKDLTDPALAEF FT LSDSDALKEIRDLFHYPKAGTLPDADPSLVDPESDSIYLCGNSLGLMPKATGEVMKDHL FT DKWAKMGVFGHMSGEVPWAHCDEYCLEGVGRLVGAKKEEVSVCNSLTVNIHVLLTSFYK FT PTETRHKILLESKAFPSDHYAIESQIRLKGRTVQDSMVCLEPREGEETLRTEDILDYIE FT KNGDEIAIVFFSGIQYYTGQLFDMRAITEAGHRKGCFVGFDLAHAFANVPLHLHWWDVD FT FACWCSYKYGCTGAGSIGGLFVHERFLNDQRERMLGWWSHKMSSRFVMDNVLDLDEGAA FT GYRISNPPIHTVAAMLGSLKVFDQVSLENLRSRSCYLTGYLEYLVKTLFGENSEQRTTK FT LSISIITPEEFHQRGCQLSLKFSSPIDIIYPELVKRGCAVDKRYPNVIRVAPVHLYNNY FT VDIRRFISVLQEVAHIVESE" XX SQ Sequence 1518 BP; 420 A; 299 C; 356 G; 443 T; 0 other; atcatgtcgg acgccccacc gcagcctgaa aatgagcaag aatgcatgtg cactcaagat 60 aaagttcttc agtttttaaa caaaatggct gatgaaagtg gaatcaaaga tttgacggac 120 ccggctcttg cagaatttct ttccgatagt gatgcgctga aagaaattag agacttgttc 180 cattatccga aagccggaac tcttccagat gctgatccaa gccttgttga tccagagagt 240 gatagcattt atctatgcgg aaactcactc ggattgatgc caaaagcaac cggcgaagtc 300 atgaaagatc atttggacaa atgggccaaa atgggagtat tcgggcatat gtctggagaa 360 gttccatggg ctcattgtga tgagtactgc ttagaaggtg tgggacgtct tgttggtgcc 420 aagaaagagg aagtttccgt gtgcaatagt ttaactgtta atatccacgt gctcttgact 480 tccttctaca aacccacgga gactcgtcat aaaatccttc tcgaatccaa agcattccca 540 tccgatcatt atgcaattga atctcagatt cgtttgaagg gaagaactgt tcaggacagt 600 atggtttgtc ttgaaccacg agagggagag gaaactcttc gcacagaaga catccttgat 660 tatattgaga aaaatggaga tgagattgca atcgtgtttt tctcgggaat ccagtactat 720 accggacagc tttttgacat gagggctatt actgaagcgg gacatcgaaa gggatgcttt 780 gttggcttcg atttggccca tgctttcgca aacgttcccc ttcatcttca ctggtgggat 840 gttgactttg cctgttggtg ctcgtataag tacggatgta caggagctgg atcaattggt 900 ggtctttttg tccatgaaag atttttgaat gaccaacggg agcgtatgtt gggatggtgg 960 agtcataaaa tgtcatcgag atttgtcatg gacaatgttc tcgacttgga cgagggagct 1020 gctggttaca gaatcagtaa ccctccaatt cataccgtcg ctgctatgct gggaagtttg 1080 aaggttttcg atcaagtctc acttgaaaac cttcgtagtc gttcatgcta tttgaccggg 1140 tacctggagt acttagtgaa aactttgttc ggtgaaaact ctgagcaaag aactactaag 1200 ctctcaattt ctatcatcac tccagaagag ttccatcagc gtggttgcca actttctctg 1260 aaattctctt ctccaatcga cattatttat cctgagcttg ttaaacgcgg atgtgcggtt 1320 gacaaaagat acccgaatgt aatccgtgtc gctccagttc atctttacaa taactacgtg 1380 gatattcgca gattcatttc cgtgctccaa gaagttgctc atattgtcga aagcgaataa 1440 ttcaattggc tcgcaatgta gtagaataat gttgaacgta ttgtatgagt tttatatgaa 1500 aaaaatgatc tgaatgtg 1518 // ![]() |