![]() |
EBI DbfetchID U70847; SV 1; linear; genomic DNA; STD; INV; 4069 BP. XX AC U70847; XX DT 01-OCT-1996 (Rel. 49, Created) DT 01-SEP-2005 (Rel. 85, Last updated, Version 13) XX DE Caenorhabditis elegans cosmid K09B3, complete sequence. XX KW HTG. XX OS Caenorhabditis elegans OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea; OC Rhabditidae; Peloderinae; Caenorhabditis. XX RN [1] RP 1-4069 RX DOI; 10.1126/science.282.5396.2012 RX PUBMED; 9851916. RG C. elegans Sequencing Consortium RA ; RT "Genome sequence of the nematode C. elegans: a platform for investigating RT biology"; RL Science 282(5396):2012-2018(1998). XX RN [2] RP 1-4069 RA Wohldmann P., Beck C.; RT "The sequence of C. elegans cosmid K09B3"; RL Unpublished. XX RN [3] RP 1-4069 RA Waterston R.; RT ; RL Submitted (15-SEP-1996) to the EMBL/GenBank/DDBJ databases. RL Department of Genetics, Washington University, Genome Sequencing Center, RL 4444 Forest Park Avenue, St. Louis, MO 63110, USA XX RN [4] RP 1-4069 RA Waterston R.; RT ; RL Submitted (23-MAY-2002) to the EMBL/GenBank/DDBJ databases. RL Department of Genetics, Washington University, Genome Sequencing Center, RL 4444 Forest Park Avenue, St. Louis, MO 63110, USA XX RN [5] RP 1-4069 RA Waterston R.; RT ; RL Submitted (28-AUG-2002) to the EMBL/GenBank/DDBJ databases. RL Department of Genetics, Washington University, Genome Sequencing Center, RL 4444 Forest Park Avenue, St. Louis, MO 63110, USA XX RN [6] RP 1-4069 RA Waterston R.; RT ; RL Submitted (19-NOV-2002) to the EMBL/GenBank/DDBJ databases. RL Department of Genetics, Washington University, Genome Sequencing Center, RL 4444 Forest Park Avenue, St. Louis, MO 63110, USA XX RN [7] RP 1-4069 RG WormBase Consortium RA ; RT ; RL Submitted (21-SEP-2004) to the EMBL/GenBank/DDBJ databases. RL Department of Genetics, Washington University, Genome Sequencing Center, RL 4444 Forest Park Avenue, St. Louis, MO 63110, USA XX RN [8] RP 1-4069 RG WormBase Consortium RA ; RT ; RL Submitted (31-AUG-2005) to the EMBL/GenBank/DDBJ databases. RL Department of Genetics, Washington University, Genome Sequencing Center, RL 4444 Forest Park Avenue, St. Louis, MO 63110, USA XX DR EMBL-CON; BX284604. XX CC Submitted by: CC Genome Sequencing Center CC Department of Genetics, Washington University CC St. Louis , MO 63110, USA, and CC Sanger Centre, Hinxton Hall CC Cambridge CB10 IRQ, England CC email: submissions@watson.wustl.edu and jes@sanger.ac.uk CC NOTICE: This sequence may not be the entire insert of this clone. CC It may be shorter because we only sequence overlapping sections CC once, or longer because we provide a small overlap between CC neighboring submissions. CC This sequence was finished as follows unless otherwise noted: all CC regions were double stranded, sequenced with an alternate chemistry CC or covered by high quality data (i.e., phred quality >= 30); an CC attempt was made to resolve all sequencing problems, such as CC compressions and repeats; all regions were covered by sequence from CC more than one m13 subclone. CC For a graphical representation of this clone sequence and its CC analysis see: CC http://www.wormbase.org/db/seq/sequence?name=K09B3;class=Sequence CC NEIGHBORING CLONE INFORMATION CC The 5' clone is W07G9, 1000 bp overlap; the 3' clone is M02B7, 700 CC bp overlap. Actual start of this clone is at base position 1 of CC K09B3; actual end is at 34292 of M02B7. CC NOTES: CC Coding seqences below are the result of integration and manual CC review of the following data : computer analysis using the program CC Genefinder (P. Green and L. Hillier, personal communication), the CC large scale EST projects of Yuji Kohara CC (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html) and The C. CC elegans ORFeome cloning project (http://worfdb.dfci.harvard.edu/), CC similarity to other proteins from BlastX analyses CC (http://blast.wustl.edu/), sequence conservation with C. briggsae CC using Jim Kent's WABA alignment program (Genome Research CC 10:1115-1125, 2000), individual C. elegans GenBank submissions, CC and personal communications with C. elegans researchers. tRNAs CC are predicted using the program tRNAscan-SE (Lowe, T.M. and CC Eddy, S.R., 1997, Nucl. Acids. Res., 25, 955-964). XX FH Key Location/Qualifiers FH FT source 1..4069 FT /organism="Caenorhabditis elegans" FT /chromosome="1" FT /strain="Bristol N2" FT /mol_type="genomic DNA" FT /clone="K09B3" FT /db_xref="taxon:6239" FT gene 588..949 FT /locus_tag="K09B3.2" FT CDS join(588..743,827..949) FT /codon_start=1 FT /locus_tag="K09B3.2" FT /standard_name="K09B3.2" FT /product="Hypothetical protein K09B3.2" FT /note="contains similarity to Pfam domain PF03360 FT (Glycosyltransferase family 43); coded for by the following FT C. elegans cDNAs: OSTF061B10_1, OSTR061B10_1" FT /db_xref="GOA:Q94267" FT /db_xref="InterPro:IPR005027" FT /db_xref="UniProtKB/TrEMBL:Q94267" FT /protein_id="AAB09106.1" FT /translation="MAGFAVNLKVVLNSDAVFGTACKRGGGAPETCLLEDMGLEREDIE FT PFGYEKDKDREILVWHTKTSTPNIVKSNKNSTKKAPPPDTFGYFVEA" FT gene 1889..3347 FT /locus_tag="K09B3.1" FT CDS join(1889..1913,2286..2436,2829..2926,2975..3034, FT 3082..3216,3259..3347) FT /codon_start=1 FT /locus_tag="K09B3.1" FT /standard_name="K09B3.1" FT /product="Hypothetical protein K09B3.1" FT /note="contains similarity to Pfam domain PF01549 (ShTK FT domain); coded for by the following C. elegans cDNAs: FT OSTF127F12_1, OSTR127F12_1, yk1283a12.3" FT /db_xref="InterPro:IPR003582" FT /db_xref="UniProtKB/TrEMBL:Q94268" FT /db_xref="WormBase:WBGene00019545" FT /protein_id="AAB09105.2" FT /translation="MRRLVLLLALCFSCSFAQYGLYGSGYNGYGGYGSGYGTGYGMGYG FT MGYPMNPYMNQVYGYGSNPMMSGGGGMYGAGGCMDLSPQCSVWASTGQCSTNPMMMRQT FT CAMSCGTCSTGVGALGSMGSYNPFGGYGGMGYPQAGLLDPLAQFIGRSIYETGILRQPT FT PSSSSKSIGILTKAKSEQLPSR" XX SQ Sequence 4069 BP; 1129 A; 844 C; 881 G; 1215 T; 0 other; gatcatgagg aaggtgtcgt gtactttgga gatgacgata attcgtatga tacgcggctt 60 tttacggagt atatcaggaa tgtgaaaacg ctcggaattt gggcagttgg tgagtttaat 120 aacggctgac gatatatggg gtttttggct ttatgggctc cgaaattgat aggcacgaca 180 tggtgcatcg gcgagaaaac tttgaaagct tttatctcca tggttttatg ttttattgaa 240 aagttatcaa ctgacaaaat atttgctata aaattgtcta caactttgta gttgagagct 300 ttttgatata ctcaaagcca acttagttat agactgctga gctccagcga atgtcatggt 360 gcatcgacat gaaaactttg actaaaacac ttttatctca gttgctatta gttttacttt 420 tgaaaattgt catttttcaa taattatacc tgaaccaact gaaataaata ctattcaatt 480 ttccaggcct ggtcggtggt actgtagtcg aagcgccaaa agtggtgggc ggcaaggtga 540 cggcatttaa cgtgaaatgg aacccgaagc ggcgatttgc ggtggatatg gccggttttg 600 cagtcaacct gaaagtggtc ctgaactctg atgcagtatt cggtaccgct tgtaaacgcg 660 gcggtggtgc cccggaaaca tgtctgctgg aggatatggg tctagaacga gaggacattg 720 agccattcgg ttatgagaag gatgttagtt tttaagtttt ttgtgtctga aattttttga 780 tattttgaca aaaaaaacag aaacataata tgaattaatt tttcagaaag accgtgaaat 840 cctggtatgg catacaaaaa ccagtacgcc gaacatcgtg aaatccaaca aaaattccac 900 gaaaaaagcc cctccgccag acaccttcgg gtactttgtt gaagcgtgat atgataattc 960 tttttctgtt aaataatttt cctacaaaaa ttttgatgac gggtgccatc gtgtaattct 1020 cttcccccaa cttccccctt gctagccgct tctaattcct cttttcgggt ttttctcttt 1080 ctgtgtactt tttacttgct tccgttctgt ttttgtattc ataaaaatgc tgattttaaa 1140 ttttgttgtt gacaatcaaa ccttgctgcc tgccgacgtg cctgcctatg tcagtttata 1200 aagcgactgt ttactgccta ctaggcagta aatataatac aaatatatct tcaaggcgtc 1260 cttgacctgc ctacggggca atccatgcct gtcaaccggg aaacctagat ttgcctaaaa 1320 agcggcatat gcctttataa gacgaaagca atgcctgcca gcccgcgagg cagtctagac 1380 ctacctacaa ggcaaattat gccatgccga catagacttg cctgcaattt ttgcctttaa 1440 ggcttttaag gcaatttaag cttgcctaca aggcgacctc gacttgcgtc caaggtaacc 1500 tagacctaga atttgctacg aaactatata tcactcagcc tacaaggcaa cctatgccta 1560 cctacatggc aaaactaaat gttttttttt aagtttttga aggggacctc tctctcttaa 1620 gtttccatat gttttacttt tttttcaaat tcgaaatttc ccaaaaaatt catcaccaaa 1680 ttttttgcct tccctgacta accggtcctt gacaagtaca tacaaccggg cggattactc 1740 gtgtaaaaac tgcgaagcac agatgacaaa aggattacga gaaacaactg aaaccccgag 1800 atggaaaaaa cttgttcata ccgggaaggt ataaaaggcg aaaagtaccg gatatttggt 1860 tttgaactta ttttttgaag cagatgagat gaggcggttg gtcttattac taggtgagaa 1920 ttttcaattt tttagactta acagtctaaa aagtatctcc gcggtcatgc aagctatcaa 1980 aaaagtgtca actaatcaaa tgttggaaat gtttaataat tagtttttga gttaacaata 2040 tttttatatc atgaataggc acatagacac agatgctact tacctctagt ttcgagtccc 2100 agacagtaat ctcacaatag tgataataat atcttttatt aatgagaatg ctatgaagag 2160 tagtaaatac aagagaccgt tggtcagaga gagatgctca atggctaggc attaggacta 2220 gaactatacc ggctgataag ccgcagtaat caaactatag gtccacacaa catccttttt 2280 ttcagcacta tgtttctcct gctcttttgc tcaatatggc ctctatggta gcggatataa 2340 tggctatgga gggtacggta gcggctatgg aactgggtac ggaatgggct atggaatggg 2400 ctatccaatg aatccgtata tgaatcaggt ttatgggtga gattttttaa agtttaactt 2460 tgagcgagta tatctctgcg tgggttggaa ctacagaaaa gttgtcagtt gtcaactgaa 2520 aattggcagc ccttgaattt tttttgttag ttgacaactt ttttctagtt ctcactgttg 2580 cggagctata acagtttccc gtcacgcgcg gtacatgagc gagagagact caaaaagacg 2640 cagaaaggtc aggcgctcta gcttctctct catctctccc ttcctttctc ttccacactt 2700 ctcgcatatc tcatctgttc gtttagaacc gacgcgacag aaattaccca acccaaccca 2760 aatcctccaa ctgaaaacca acgcgactgc cgcggcctgc cttccatgag atcacggagt 2820 ctttgcaggt atggtagcaa tccgatgatg tccggtggcg gcgggatgta tggagctggc 2880 ggatgcatgg atctgagccc tcaatgctcc gtttgggcga gcacaggtag accgaagcat 2940 aaagttctag cagaaatttt gtagtttttt tcaggccaat gctctacaaa tccaatgatg 3000 atgcggcaga cgtgtgcaat gtcctgtgga acttgtaagt agtcgttcag ggcgtgcgct 3060 actattagag ttttgtttca ggcagcacag gtgttggagc tctcgggtcc atggggtcct 3120 acaacccatt tggagggtac ggagggatgg gataccccca agccggtttg ttagaccccc 3180 ttgctcaatt catcgggaga tcgatttacg aaactggtga ggcttagagc gcacttgcaa 3240 ccttaaaatt ttcttcaggg atcctccgac aaccaactcc atcgagtagt tccaagagca 3300 ttggtatact gacgaaagcg aaaagtgagc agttgccgag ccggtgatca actggattag 3360 aatctattta tgctacctgt ataaaattga taaagatctc acattcatta attttattct 3420 cgatatcacc cgtgcaaact attcagcaaa tttctaaaca tcacaatgta cacacgaatt 3480 atttgggatt cagagaggct agtcgaggtg gtcgttttgc agattgagcc cgctcgtaga 3540 gagcatcagt gtagacgact ccgttgacaa gaccatcgaa tgttggggtc ttggatggaa 3600 atagcgattt ggaagttgat actgaaactt tttgactttt tgcaaggtct aaggggtcta 3660 actcacattt ttgatcactc ttgtacttgt tgatcacatt gatctgcttc actggcttga 3720 agtagctgtt tgccaggttt tccgagtcac ttttcgctag ccgctcgcgt agtttgtcgt 3780 caaagttaga ccgtaataag gtgttctctt gccgagcagg ttcttgcata taacctgaaa 3840 gatggaagaa taaagcaatc ccaaattgct agaataatcg gccgcttttt ggcaatcccg 3900 ccttacatgt ttcttggaaa agctcaggct tagcagtgag gcctgcaatc taatcaaatc 3960 cagacctacc attattagcc aacggcagaa tagtccgact catcgacatt tccggcaaaa 4020 tccatcgact ctcgtcttca ttccacaccg cctccttctt tatccgatc 4069 // ![]() |