Dbfetch

LOCUS       NM_068123               1011 bp    mRNA    linear   INV 22-NOV-2023
DEFINITION  Caenorhabditis elegans Nematode cuticle collagen N-terminal
            domain-containing protein (col-33), mRNA.
ACCESSION   NM_068123
VERSION     NM_068123.5
DBLINK      BioProject: PRJNA158
KEYWORDS    RefSeq.
SOURCE      Caenorhabditis elegans
  ORGANISM  Caenorhabditis elegans
            Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
            Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
            Caenorhabditis.
REFERENCE   1  (bases 1 to 1011)
  AUTHORS   Sulson,J.E. and Waterston,R.
  CONSRTM   Caenorhabditis elegans Sequencing Consortium
  TITLE     Genome sequence of the nematode C. elegans: a platform for
            investigating biology
  JOURNAL   Science 282 (5396), 2012-2018 (1998)
   PUBMED   9851916
  REMARK    Erratum:[Science 1999 Jan 1;283(5398):35]
REFERENCE   2  (bases 1 to 1011)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (22-NOV-2023) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   3  (bases 1 to 1011)
  AUTHORS   WormBase.
  CONSRTM   WormBase Consortium
  TITLE     Direct Submission
  JOURNAL   Submitted (29-OCT-2023) WormBase Group, European Bioinformatics
            Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org
REFERENCE   4  (bases 1 to 1011)
  AUTHORS   Sulson,J.E. and Waterston,R.
  TITLE     Direct Submission
  JOURNAL   Submitted (03-MAR-2003) Nematode Sequencing Project: Sanger
            Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute
            at Washington University, St. Louis, MO 63110, USA
COMMENT     REVIEWED REFSEQ: This record has been curated by WormBase. This
            record is derived from an annotated genomic sequence (NC_003282).
            
            On Sep 4, 2019 this sequence version replaced NM_068123.4.
FEATURES             Location/Qualifiers
     source          1..1011
                     /organism="Caenorhabditis elegans"
                     /mol_type="mRNA"
                     /strain="Bristol N2"
                     /db_xref="taxon:6239"
                     /chromosome="IV"
     gene            1..1011
                     /gene="col-33"
                     /locus_tag="CELE_F36A4.6"
                     /db_xref="GeneID:177191"
                     /db_xref="WormBase:WBGene00000610"
     CDS             49..963
                     /gene="col-33"
                     /locus_tag="CELE_F36A4.6"
                     /standard_name="F36A4.6"
                     /note="Confirmed by transcript evidence"
                     /codon_start=1
                     /product="Nematode cuticle collagen N-terminal
                     domain-containing protein"
                     /protein_id="NP_500524.2"
                     /db_xref="EnsemblGenomes-Gn:WBGene00000610"
                     /db_xref="EnsemblGenomes-Tr:F36A4.6"
                     /db_xref="GeneID:177191"
                     /db_xref="GOA:Q20091"
                     /db_xref="InterPro:IPR002486"
                     /db_xref="InterPro:IPR008160"
                     /db_xref="UniProtKB/TrEMBL:Q20091"
                     /db_xref="WormBase:WBGene00000610"
                     /translation="MEVQEIKNRMKAYRFVAYSAVAFSVVAVISVCVTLPMVYNYVHH
                     VKRSMQNEIVYCRGSAKDIWSEVRTLKTALEPIQNRTARQAYADAAVHGGGGGGGNCE
                     ACCLPGPAGPAGAPGNPGRPGKPGAPGLPGNPGKPPVQPCEPITPPPCKPCPDGPAGP
                     PGPPGAPGDAGTNGAPGAPGGDAPPGEAGPKGPPGPPGSPGAPGEPGRPGDDAPSEPL
                     IPGEPGPQGPPGPPGQAGPDGQPGAPGGPGTPGSKGPPGNPGAPGADGKPGAPGQAGP
                     AGAKGEKGICPKYCAIDGGVFFEDGTRR"
ORIGIN      
        1 aatggacact tttttcttta ttcttactga aaaagtgcga agctagtcat ggaggtccag
       61 gagattaaga atcggatgaa agcctatcgg tttgtagcct actccgcagt ggcattctcg
      121 gtggttgcag ttatttcggt atgcgtcaca cttccaatgg tgtacaacta cgtgcatcat
      181 gtcaagagaa gcatgcaaaa tgagattgtc tattgtaggg gctccgcaaa agacatctgg
      241 tcagaagtgc gcaccctgaa aaccgctctg gaaccgatcc aaaaccgtac agcccgtcag
      301 gcgtacgcag atgccgcagt tcacggagga ggcggtggag gcggaaactg tgaggcctgc
      361 tgccttccag gacccgctgg acccgccggt gccccaggaa accctgggag accgggtaaa
      421 cctggagctc caggacttcc aggaaaccct ggaaagccac cagttcagcc ttgtgagcca
      481 attactccac caccatgcaa accatgtcca gatggacccg cgggaccacc aggaccacct
      541 ggagcacctg gagatgctgg aacaaacgga gctccgggag cgcctggagg tgatgctcca
      601 cccggtgagg caggaccgaa aggaccacca ggaccaccag gatcccccgg agcacccgga
      661 gagccaggaa gacctggaga cgacgcacca agtgagccac tcatcccagg ggagccagga
      721 ccacaaggac caccgggacc acctggacaa gccggaccag acggacaacc tggagcaccc
      781 ggaggacccg gaactcctgg atcaaaagga ccaccgggta acccaggagc tcctggagcc
      841 gatggaaagc ctggagcacc aggacaagct ggaccagctg gagcgaaggg agagaaggga
      901 atctgtccga aatattgtgc gattgacgga ggtgtcttct tcgaggacgg cactcggcga
      961 taaaggattt tatttattca tgattatgac aataaacatt ggctttaact t
//