spacer
spacer

EBI Dbfetch

ID   U70847; SV 1; linear; genomic DNA; STD; INV; 4069 BP.
XX
AC   U70847;
XX
DT   01-OCT-1996 (Rel. 49, Created)
DT   01-SEP-2005 (Rel. 85, Last updated, Version 13)
XX
DE   Caenorhabditis elegans cosmid K09B3, complete sequence.
XX
KW   HTG.
XX
OS   Caenorhabditis elegans
OC   Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea;
OC   Rhabditidae; Peloderinae; Caenorhabditis.
XX
RN   [1]
RP   1-4069
RX   DOI; 10.1126/science.282.5396.2012
RX   PUBMED; 9851916.
RG   C. elegans Sequencing Consortium
RA   ;
RT   "Genome sequence of the nematode C. elegans: a platform for investigating
RT   biology";
RL   Science 282(5396):2012-2018(1998).
XX
RN   [2]
RP   1-4069
RA   Wohldmann P., Beck C.;
RT   "The sequence of C. elegans cosmid K09B3";
RL   Unpublished.
XX
RN   [3]
RP   1-4069
RA   Waterston R.;
RT   ;
RL   Submitted (15-SEP-1996) to the EMBL/GenBank/DDBJ databases.
RL   Department of Genetics, Washington University, Genome Sequencing Center,
RL   4444 Forest Park Avenue, St. Louis, MO 63110, USA
XX
RN   [4]
RP   1-4069
RA   Waterston R.;
RT   ;
RL   Submitted (23-MAY-2002) to the EMBL/GenBank/DDBJ databases.
RL   Department of Genetics, Washington University, Genome Sequencing Center,
RL   4444 Forest Park Avenue, St. Louis, MO 63110, USA
XX
RN   [5]
RP   1-4069
RA   Waterston R.;
RT   ;
RL   Submitted (28-AUG-2002) to the EMBL/GenBank/DDBJ databases.
RL   Department of Genetics, Washington University, Genome Sequencing Center,
RL   4444 Forest Park Avenue, St. Louis, MO 63110, USA
XX
RN   [6]
RP   1-4069
RA   Waterston R.;
RT   ;
RL   Submitted (19-NOV-2002) to the EMBL/GenBank/DDBJ databases.
RL   Department of Genetics, Washington University, Genome Sequencing Center,
RL   4444 Forest Park Avenue, St. Louis, MO 63110, USA
XX
RN   [7]
RP   1-4069
RG   WormBase Consortium
RA   ;
RT   ;
RL   Submitted (21-SEP-2004) to the EMBL/GenBank/DDBJ databases.
RL   Department of Genetics, Washington University, Genome Sequencing Center,
RL   4444 Forest Park Avenue, St. Louis, MO 63110, USA
XX
RN   [8]
RP   1-4069
RG   WormBase Consortium
RA   ;
RT   ;
RL   Submitted (31-AUG-2005) to the EMBL/GenBank/DDBJ databases.
RL   Department of Genetics, Washington University, Genome Sequencing Center,
RL   4444 Forest Park Avenue, St. Louis, MO 63110, USA
XX
DR   EMBL-CON; BX284604.
XX
CC   Submitted by:
CC          Genome Sequencing Center
CC          Department of Genetics, Washington University
CC          St. Louis , MO  63110, USA, and
CC          Sanger Centre, Hinxton Hall
CC          Cambridge CB10 IRQ, England
CC          email: submissions@watson.wustl.edu and jes@sanger.ac.uk
CC   NOTICE:  This sequence may not be the entire insert of this clone.
CC   It may be shorter because we only sequence overlapping sections
CC   once, or longer because we provide a small overlap between
CC   neighboring submissions.
CC   This sequence was finished as follows unless otherwise noted: all
CC   regions were double stranded, sequenced with an alternate chemistry
CC   or covered by high quality data (i.e., phred quality >= 30); an
CC   attempt was made to resolve all sequencing problems, such as
CC   compressions and repeats; all regions were covered by sequence from
CC   more than one m13 subclone.
CC   For a graphical representation of this clone sequence and its
CC   analysis see:
CC   http://www.wormbase.org/db/seq/sequence?name=K09B3;class=Sequence
CC               NEIGHBORING CLONE INFORMATION
CC   The 5' clone is W07G9, 1000 bp overlap; the 3' clone is M02B7, 700
CC   bp overlap. Actual start of this clone is at base position 1 of
CC   K09B3; actual end is at 34292 of M02B7.
CC   NOTES:
CC   Coding seqences below are the result of integration and manual
CC   review of the following data : computer analysis using the program
CC   Genefinder (P. Green and L. Hillier, personal communication), the
CC   large scale EST projects of Yuji Kohara
CC   (http://www.ddbj.nig.ac.jp/c-elegans/html/CE_INDEX.html) and The C.
CC   elegans ORFeome cloning project (http://worfdb.dfci.harvard.edu/),
CC   similarity to other proteins from BlastX analyses
CC   (http://blast.wustl.edu/), sequence conservation with C. briggsae
CC   using Jim Kent's WABA alignment program (Genome Research
CC   10:1115-1125, 2000), individual C. elegans GenBank submissions,
CC   and personal communications with C. elegans researchers. tRNAs
CC   are predicted using the program tRNAscan-SE (Lowe, T.M. and
CC   Eddy, S.R., 1997, Nucl. Acids. Res., 25, 955-964).
XX
FH   Key             Location/Qualifiers
FH
FT   source          1..4069
FT                   /organism="Caenorhabditis elegans"
FT                   /chromosome="1"
FT                   /strain="Bristol N2"
FT                   /mol_type="genomic DNA"
FT                   /clone="K09B3"
FT                   /db_xref="taxon:6239"
FT   gene            588..949
FT                   /locus_tag="K09B3.2"
FT   CDS             join(588..743,827..949)
FT                   /codon_start=1
FT                   /locus_tag="K09B3.2"
FT                   /standard_name="K09B3.2"
FT                   /product="Hypothetical protein K09B3.2"
FT                   /note="contains similarity to Pfam domain PF03360
FT                   (Glycosyltransferase family 43); coded for by the following
FT                   C. elegans cDNAs: OSTF061B10_1, OSTR061B10_1"
FT                   /db_xref="GOA:Q94267"
FT                   /db_xref="InterPro:IPR005027"
FT                   /db_xref="UniProtKB/TrEMBL:Q94267"
FT                   /protein_id="AAB09106.1"
FT                   /translation="MAGFAVNLKVVLNSDAVFGTACKRGGGAPETCLLEDMGLEREDIE
FT                   PFGYEKDKDREILVWHTKTSTPNIVKSNKNSTKKAPPPDTFGYFVEA"
FT   gene            1889..3347
FT                   /locus_tag="K09B3.1"
FT   CDS             join(1889..1913,2286..2436,2829..2926,2975..3034,
FT                   3082..3216,3259..3347)
FT                   /codon_start=1
FT                   /locus_tag="K09B3.1"
FT                   /standard_name="K09B3.1"
FT                   /product="Hypothetical protein K09B3.1"
FT                   /note="contains similarity to Pfam domain PF01549 (ShTK
FT                   domain); coded for by the following C. elegans cDNAs:
FT                   OSTF127F12_1, OSTR127F12_1, yk1283a12.3"
FT                   /db_xref="InterPro:IPR003582"
FT                   /db_xref="UniProtKB/TrEMBL:Q94268"
FT                   /db_xref="WormBase:WBGene00019545"
FT                   /protein_id="AAB09105.2"
FT                   /translation="MRRLVLLLALCFSCSFAQYGLYGSGYNGYGGYGSGYGTGYGMGYG
FT                   MGYPMNPYMNQVYGYGSNPMMSGGGGMYGAGGCMDLSPQCSVWASTGQCSTNPMMMRQT
FT                   CAMSCGTCSTGVGALGSMGSYNPFGGYGGMGYPQAGLLDPLAQFIGRSIYETGILRQPT
FT                   PSSSSKSIGILTKAKSEQLPSR"
XX
SQ   Sequence 4069 BP; 1129 A; 844 C; 881 G; 1215 T; 0 other;
     gatcatgagg aaggtgtcgt gtactttgga gatgacgata attcgtatga tacgcggctt        60
     tttacggagt atatcaggaa tgtgaaaacg ctcggaattt gggcagttgg tgagtttaat       120
     aacggctgac gatatatggg gtttttggct ttatgggctc cgaaattgat aggcacgaca       180
     tggtgcatcg gcgagaaaac tttgaaagct tttatctcca tggttttatg ttttattgaa       240
     aagttatcaa ctgacaaaat atttgctata aaattgtcta caactttgta gttgagagct       300
     ttttgatata ctcaaagcca acttagttat agactgctga gctccagcga atgtcatggt       360
     gcatcgacat gaaaactttg actaaaacac ttttatctca gttgctatta gttttacttt       420
     tgaaaattgt catttttcaa taattatacc tgaaccaact gaaataaata ctattcaatt       480
     ttccaggcct ggtcggtggt actgtagtcg aagcgccaaa agtggtgggc ggcaaggtga       540
     cggcatttaa cgtgaaatgg aacccgaagc ggcgatttgc ggtggatatg gccggttttg       600
     cagtcaacct gaaagtggtc ctgaactctg atgcagtatt cggtaccgct tgtaaacgcg       660
     gcggtggtgc cccggaaaca tgtctgctgg aggatatggg tctagaacga gaggacattg       720
     agccattcgg ttatgagaag gatgttagtt tttaagtttt ttgtgtctga aattttttga       780
     tattttgaca aaaaaaacag aaacataata tgaattaatt tttcagaaag accgtgaaat       840
     cctggtatgg catacaaaaa ccagtacgcc gaacatcgtg aaatccaaca aaaattccac       900
     gaaaaaagcc cctccgccag acaccttcgg gtactttgtt gaagcgtgat atgataattc       960
     tttttctgtt aaataatttt cctacaaaaa ttttgatgac gggtgccatc gtgtaattct      1020
     cttcccccaa cttccccctt gctagccgct tctaattcct cttttcgggt ttttctcttt      1080
     ctgtgtactt tttacttgct tccgttctgt ttttgtattc ataaaaatgc tgattttaaa      1140
     ttttgttgtt gacaatcaaa ccttgctgcc tgccgacgtg cctgcctatg tcagtttata      1200
     aagcgactgt ttactgccta ctaggcagta aatataatac aaatatatct tcaaggcgtc      1260
     cttgacctgc ctacggggca atccatgcct gtcaaccggg aaacctagat ttgcctaaaa      1320
     agcggcatat gcctttataa gacgaaagca atgcctgcca gcccgcgagg cagtctagac      1380
     ctacctacaa ggcaaattat gccatgccga catagacttg cctgcaattt ttgcctttaa      1440
     ggcttttaag gcaatttaag cttgcctaca aggcgacctc gacttgcgtc caaggtaacc      1500
     tagacctaga atttgctacg aaactatata tcactcagcc tacaaggcaa cctatgccta      1560
     cctacatggc aaaactaaat gttttttttt aagtttttga aggggacctc tctctcttaa      1620
     gtttccatat gttttacttt tttttcaaat tcgaaatttc ccaaaaaatt catcaccaaa      1680
     ttttttgcct tccctgacta accggtcctt gacaagtaca tacaaccggg cggattactc      1740
     gtgtaaaaac tgcgaagcac agatgacaaa aggattacga gaaacaactg aaaccccgag      1800
     atggaaaaaa cttgttcata ccgggaaggt ataaaaggcg aaaagtaccg gatatttggt      1860
     tttgaactta ttttttgaag cagatgagat gaggcggttg gtcttattac taggtgagaa      1920
     ttttcaattt tttagactta acagtctaaa aagtatctcc gcggtcatgc aagctatcaa      1980
     aaaagtgtca actaatcaaa tgttggaaat gtttaataat tagtttttga gttaacaata      2040
     tttttatatc atgaataggc acatagacac agatgctact tacctctagt ttcgagtccc      2100
     agacagtaat ctcacaatag tgataataat atcttttatt aatgagaatg ctatgaagag      2160
     tagtaaatac aagagaccgt tggtcagaga gagatgctca atggctaggc attaggacta      2220
     gaactatacc ggctgataag ccgcagtaat caaactatag gtccacacaa catccttttt      2280
     ttcagcacta tgtttctcct gctcttttgc tcaatatggc ctctatggta gcggatataa      2340
     tggctatgga gggtacggta gcggctatgg aactgggtac ggaatgggct atggaatggg      2400
     ctatccaatg aatccgtata tgaatcaggt ttatgggtga gattttttaa agtttaactt      2460
     tgagcgagta tatctctgcg tgggttggaa ctacagaaaa gttgtcagtt gtcaactgaa      2520
     aattggcagc ccttgaattt tttttgttag ttgacaactt ttttctagtt ctcactgttg      2580
     cggagctata acagtttccc gtcacgcgcg gtacatgagc gagagagact caaaaagacg      2640
     cagaaaggtc aggcgctcta gcttctctct catctctccc ttcctttctc ttccacactt      2700
     ctcgcatatc tcatctgttc gtttagaacc gacgcgacag aaattaccca acccaaccca      2760
     aatcctccaa ctgaaaacca acgcgactgc cgcggcctgc cttccatgag atcacggagt      2820
     ctttgcaggt atggtagcaa tccgatgatg tccggtggcg gcgggatgta tggagctggc      2880
     ggatgcatgg atctgagccc tcaatgctcc gtttgggcga gcacaggtag accgaagcat      2940
     aaagttctag cagaaatttt gtagtttttt tcaggccaat gctctacaaa tccaatgatg      3000
     atgcggcaga cgtgtgcaat gtcctgtgga acttgtaagt agtcgttcag ggcgtgcgct      3060
     actattagag ttttgtttca ggcagcacag gtgttggagc tctcgggtcc atggggtcct      3120
     acaacccatt tggagggtac ggagggatgg gataccccca agccggtttg ttagaccccc      3180
     ttgctcaatt catcgggaga tcgatttacg aaactggtga ggcttagagc gcacttgcaa      3240
     ccttaaaatt ttcttcaggg atcctccgac aaccaactcc atcgagtagt tccaagagca      3300
     ttggtatact gacgaaagcg aaaagtgagc agttgccgag ccggtgatca actggattag      3360
     aatctattta tgctacctgt ataaaattga taaagatctc acattcatta attttattct      3420
     cgatatcacc cgtgcaaact attcagcaaa tttctaaaca tcacaatgta cacacgaatt      3480
     atttgggatt cagagaggct agtcgaggtg gtcgttttgc agattgagcc cgctcgtaga      3540
     gagcatcagt gtagacgact ccgttgacaa gaccatcgaa tgttggggtc ttggatggaa      3600
     atagcgattt ggaagttgat actgaaactt tttgactttt tgcaaggtct aaggggtcta      3660
     actcacattt ttgatcactc ttgtacttgt tgatcacatt gatctgcttc actggcttga      3720
     agtagctgtt tgccaggttt tccgagtcac ttttcgctag ccgctcgcgt agtttgtcgt      3780
     caaagttaga ccgtaataag gtgttctctt gccgagcagg ttcttgcata taacctgaaa      3840
     gatggaagaa taaagcaatc ccaaattgct agaataatcg gccgcttttt ggcaatcccg      3900
     ccttacatgt ttcttggaaa agctcaggct tagcagtgag gcctgcaatc taatcaaatc      3960
     cagacctacc attattagcc aacggcagaa tagtccgact catcgacatt tccggcaaaa      4020
     tccatcgact ctcgtcttca ttccacaccg cctccttctt tatccgatc                  4069
//


  
spacer
spacer