Dbfetch
LOCUS NM_171332 2046 bp mRNA linear INV 22-NOV-2023
DEFINITION Caenorhabditis elegans Leucine-rich repeat protein soc-2 (soc-2),
mRNA.
ACCESSION NM_171332
VERSION NM_171332.5
DBLINK BioProject: PRJNA158
KEYWORDS RefSeq.
SOURCE Caenorhabditis elegans
ORGANISM Caenorhabditis elegans
Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
Caenorhabditis.
REFERENCE 1 (bases 1 to 2046)
AUTHORS Sulson,J.E. and Waterston,R.
CONSRTM Caenorhabditis elegans Sequencing Consortium
TITLE Genome sequence of the nematode C. elegans: a platform for
investigating biology
JOURNAL Science 282 (5396), 2012-2018 (1998)
PUBMED 9851916
REMARK Erratum:[Science 1999 Jan 1;283(5398):35]
REFERENCE 2 (bases 1 to 2046)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (22-NOV-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 2046)
AUTHORS WormBase.
CONSRTM WormBase Consortium
TITLE Direct Submission
JOURNAL Submitted (29-OCT-2023) WormBase Group, European Bioinformatics
Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org
REFERENCE 4 (bases 1 to 2046)
AUTHORS Sulson,J.E. and Waterston,R.
TITLE Direct Submission
JOURNAL Submitted (03-MAR-2003) Nematode Sequencing Project: Sanger
Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute
at Washington University, St. Louis, MO 63110, USA
COMMENT REVIEWED REFSEQ: This record has been curated by WormBase. This
record is derived from an annotated genomic sequence (NC_003282).
On Apr 15, 2020 this sequence version replaced NM_171332.4.
FEATURES Location/Qualifiers
source 1..2046
/organism="Caenorhabditis elegans"
/mol_type="mRNA"
/strain="Bristol N2"
/db_xref="taxon:6239"
/chromosome="IV"
gene 1..2046
/gene="soc-2"
/locus_tag="CELE_AC7.2"
/db_xref="GeneID:177286"
/db_xref="WormBase:WBGene00004929"
CDS 9..1685
/gene="soc-2"
/locus_tag="CELE_AC7.2"
/standard_name="AC7.2b"
/note="Confirmed by transcript evidence"
/codon_start=1
/product="Leucine-rich repeat protein soc-2"
/protein_id="NP_741391.2"
/db_xref="EnsemblGenomes-Gn:WBGene00004929"
/db_xref="EnsemblGenomes-Tr:AC7.2a.1"
/db_xref="EnsemblGenomes-Tr:AC7.2a.2"
/db_xref="GeneID:177286"
/db_xref="WormBase:WBGene00004929"
/translation="MRVLQKLGFCLEKQKRETPPTTANTGVSATKRVSVIATDRDRAY
FLRQKNMRNNKGHAEKDLLKEFHKCKEAQDQRLDLSSIEITSIPSPIKELTQLTELFL
YKNKLTCLPTEIGQLVNLKKLGLSENALTSLPDSLASLESLETLDLRHNKLTEVPSVI
YKIGSLETLWLRYNRIVAVDEQIGNLSKLKMLDVRENKIRELPSAIGKLTSLVVCLVS
YNHLTRVPEEIGDCHSLTQLDLQHNDLSELPYSIGKLVNLVRIGIRYNKIRCIPSELE
SCQQLEEFIVESNHLQLLPPNLLTMLPKIHTVNLSRNELTAFPAGGPQQFVSTVTINM
EHNQISKIPIGIFSKATRLTKLNLKENELVSLPLDMGSWTSITELNLSTNQLKVLPED
IEKLVNLEILVLSNNQLKKLPNQIGNLNKLRELDLEENELETVPTEIGFLQHLTKLWV
QSNKILTLPRSIGNLCSLQDLRLGENNLTAIPEEIGHLDSLKSLYLNDNSSLHNLPFE
LALCQSLEIMSIENSPLSQIPPEITAGGPSLVIQYLKMQGPYRGVVMNSQ"
ORIGIN
1 aaaaaataat gcgtgtactt caaaaattgg gattctgcct tgagaaacag aaacgggaaa
61 cacctccaac tacagcaaat acgggagtat cggcgacgaa gcgtgtttcc gtgattgcca
121 ctgacagaga tagagcctat tttttgaggc agaagaatat gaggaacaat aagggccacg
181 ctgaaaagga cctcctaaaa gaatttcaca aatgcaaaga ggcgcaggat cagagattag
241 atttgagctc aatcgagatc acgagcattc cgtcgccgat caaagagctc acacagctga
301 cagaattgtt cttgtacaag aacaagttga cgtgcttgcc aacggaaata ggtcaactgg
361 tgaatctcaa gaaacttggt ctctctgaaa atgcgcttac atctcttccg gattcacttg
421 cttctctgga atcactggaa acattggatt tacggcacaa caagttgaca gaggttccat
481 cggtcattta caaaatcggg tcgctcgaaa cattatggct gaggtacaat cgaattgtgg
541 cagttgacga acaaattgga aatctgtcaa aattgaaaat gttggatgtt cgtgagaata
601 agattcgaga gttaccatct gcaattggaa aactgacgtc actggttgtg tgtcttgtct
661 cttataatca tttaacacgg gttcctgaag aaatcggtga ctgccattcc ctgactcaac
721 tcgatcttca acacaacgac ctctcagaac taccgtactc aataggaaaa ctcgtgaatc
781 ttgttcgaat cggaattcga tacaataaga ttcgatgtat tccaagtgaa ttggaaagtt
841 gtcagcagct cgaggaattt attgtagaga gcaatcattt gcaattacta ccgccaaacc
901 tgctcacaat gcttccaaaa atccacacag tgaatctctc acggaacgag ttgactgcat
961 tcccggcagg cggacctcaa caatttgtgt ccacagtcac aattaatatg gaacacaatc
1021 agatttcaaa gattccaatc ggaatattct cgaaagcaac acgattaaca aaactgaatt
1081 tgaaggaaaa tgagctggtc tcgttgcctt tggacatggg atcttggaca tcaatcaccg
1141 agctcaatct ctccacaaat caattgaaag ttttgccaga agatatcgaa aaacttgtga
1201 atctggaaat ccttgtgctg tccaacaatc aactgaaaaa gcttccaaat caaataggaa
1261 atctcaataa actccgcgag ctggatctcg aggaaaatga attggagacc gttccaactg
1321 aaatcggatt tttacaacat cttacgaaac tgtgggttca gtcaaacaag attttgactc
1381 taccaagatc cattggaaat ttgtgttcgc ttcaagattt gcgattggga gagaacaatt
1441 tgacagcgat tcccgaggaa attggccacc tcgactcatt gaaatctcta tacctcaacg
1501 acaactcctc tcttcacaat ttgccatttg agttggcact gtgccaatcg cttgaaataa
1561 tgtcaatcga aaactctcca ctttctcaga ttccacctga aatcactgct ggtggtcctt
1621 cacttgtgat acaatatctt aaaatgcaag gtccctatcg aggagttgtg atgaattctc
1681 aataattccc ccaatattct acttcaattc aaaaaaccat gtttcttgtc ttctacgggt
1741 tgcaaaactt gttttcgctc caaaatgttt tttttaatgt tttatttttt tgaaactgaa
1801 aagtcacttt ttcttctcaa aatatcatat taatatggta cttttccaat tcaattgtgt
1861 cttcccgatt ttttctctca aaaaaggttt tgtttattaa ttattttcaa attgtgaatt
1921 ccaaaaactt tctctcgtcc agtttcccag ctcccccgtt gcatttcccc cttcttccaa
1981 aaattttttt ataatttttt gttccaaaat tcttgtttct tttaatgaat tctaacaaaa
2041 tcagaa
//