Dbfetch
LOCUS NM_001026088 2128 bp mRNA linear INV 22-NOV-2023
DEFINITION Caenorhabditis elegans Leucine-rich repeat protein soc-2 (soc-2),
mRNA.
ACCESSION NM_001026088
VERSION NM_001026088.5
DBLINK BioProject: PRJNA158
KEYWORDS RefSeq.
SOURCE Caenorhabditis elegans
ORGANISM Caenorhabditis elegans
Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
Caenorhabditis.
REFERENCE 1 (bases 1 to 2128)
AUTHORS Sulson,J.E. and Waterston,R.
CONSRTM Caenorhabditis elegans Sequencing Consortium
TITLE Genome sequence of the nematode C. elegans: a platform for
investigating biology
JOURNAL Science 282 (5396), 2012-2018 (1998)
PUBMED 9851916
REMARK Erratum:[Science 1999 Jan 1;283(5398):35]
REFERENCE 2 (bases 1 to 2128)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (22-NOV-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 2128)
AUTHORS WormBase.
CONSRTM WormBase Consortium
TITLE Direct Submission
JOURNAL Submitted (29-OCT-2023) WormBase Group, European Bioinformatics
Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org
REFERENCE 4 (bases 1 to 2128)
AUTHORS Sulson,J.E. and Waterston,R.
TITLE Direct Submission
JOURNAL Submitted (03-MAR-2003) Nematode Sequencing Project: Sanger
Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute
at Washington University, St. Louis, MO 63110, USA
COMMENT REVIEWED REFSEQ: This record has been curated by WormBase. This
record is derived from an annotated genomic sequence (NC_003282).
On Feb 2, 2021 this sequence version replaced NM_001026088.4.
FEATURES Location/Qualifiers
source 1..2128
/organism="Caenorhabditis elegans"
/mol_type="mRNA"
/strain="Bristol N2"
/db_xref="taxon:6239"
/chromosome="IV"
gene 1..2128
/gene="soc-2"
/locus_tag="CELE_AC7.2"
/db_xref="GeneID:177286"
/db_xref="WormBase:WBGene00004929"
CDS 88..1767
/gene="soc-2"
/locus_tag="CELE_AC7.2"
/standard_name="AC7.2a"
/note="Confirmed by transcript evidence"
/codon_start=1
/product="Leucine-rich repeat protein soc-2"
/protein_id="NP_001021259.1"
/db_xref="GeneID:177286"
/db_xref="WormBase:WBGene00004929"
/translation="METSKEFEFRPAKETSRSKSPGGIVGRLSNFARNKARHSLSEKG
SNSVGGSGGAGFDKPRKDLLKEFHKCKEAQDQRLDLSSIEITSIPSPIKELTQLTELF
LYKNKLTCLPTEIGQLVNLKKLGLSENALTSLPDSLASLESLETLDLRHNKLTEVPSV
IYKIGSLETLWLRYNRIVAVDEQIGNLSKLKMLDVRENKIRELPSAIGKLTSLVVCLV
SYNHLTRVPEEIGDCHSLTQLDLQHNDLSELPYSIGKLVNLVRIGIRYNKIRCIPSEL
ESCQQLEEFIVESNHLQLLPPNLLTMLPKIHTVNLSRNELTAFPAGGPQQFVSTVTIN
MEHNQISKIPIGIFSKATRLTKLNLKENELVSLPLDMGSWTSITELNLSTNQLKVLPE
DIEKLVNLEILVLSNNQLKKLPNQIGNLNKLRELDLEENELETVPTEIGFLQHLTKLW
VQSNKILTLPRSIGNLCSLQDLRLGENNLTAIPEEIGHLDSLKSLYLNDNSSLHNLPF
ELALCQSLEIMSIENSPLSQIPPEITAGGPSLVIQYLKMQGPYRGVVMNSQ"
ORIGIN
1 acactggatc acgcgcacgc cggttccaag agtgtctcat cggcccgatt accggacgcg
61 ttggcactgt gtctgaaggc tgcgcgaatg gagacatcga aggagttcga attccgtccg
121 gccaaggaga cgtcacgctc caagagtccc ggtggaatcg tcggaagact ttcgaatttt
181 gcgcgaaaca aggcgaggca ttcgttgagt gaaaaaggtt caaattcggt tggtggaagt
241 ggtggagcag gttttgataa accgagaaag gacctcctaa aagaatttca caaatgcaaa
301 gaggcgcagg atcagagatt agatttgagc tcaatcgaga tcacgagcat tccgtcgccg
361 atcaaagagc tcacacagct gacagaattg ttcttgtaca agaacaagtt gacgtgcttg
421 ccaacggaaa taggtcaact ggtgaatctc aagaaacttg gtctctctga aaatgcgctt
481 acatctcttc cggattcact tgcttctctg gaatcactgg aaacattgga tttacggcac
541 aacaagttga cagaggttcc atcggtcatt tacaaaatcg ggtcgctcga aacattatgg
601 ctgaggtaca atcgaattgt ggcagttgac gaacaaattg gaaatctgtc aaaattgaaa
661 atgttggatg ttcgtgagaa taagattcga gagttaccat ctgcaattgg aaaactgacg
721 tcactggttg tgtgtcttgt ctcttataat catttaacac gggttcctga agaaatcggt
781 gactgccatt ccctgactca actcgatctt caacacaacg acctctcaga actaccgtac
841 tcaataggaa aactcgtgaa tcttgttcga atcggaattc gatacaataa gattcgatgt
901 attccaagtg aattggaaag ttgtcagcag ctcgaggaat ttattgtaga gagcaatcat
961 ttgcaattac taccgccaaa cctgctcaca atgcttccaa aaatccacac agtgaatctc
1021 tcacggaacg agttgactgc attcccggca ggcggacctc aacaatttgt gtccacagtc
1081 acaattaata tggaacacaa tcagatttca aagattccaa tcggaatatt ctcgaaagca
1141 acacgattaa caaaactgaa tttgaaggaa aatgagctgg tctcgttgcc tttggacatg
1201 ggatcttgga catcaatcac cgagctcaat ctctccacaa atcaattgaa agttttgcca
1261 gaagatatcg aaaaacttgt gaatctggaa atccttgtgc tgtccaacaa tcaactgaaa
1321 aagcttccaa atcaaatagg aaatctcaat aaactccgcg agctggatct cgaggaaaat
1381 gaattggaga ccgttccaac tgaaatcgga tttttacaac atcttacgaa actgtgggtt
1441 cagtcaaaca agattttgac tctaccaaga tccattggaa atttgtgttc gcttcaagat
1501 ttgcgattgg gagagaacaa tttgacagcg attcccgagg aaattggcca cctcgactca
1561 ttgaaatctc tatacctcaa cgacaactcc tctcttcaca atttgccatt tgagttggca
1621 ctgtgccaat cgcttgaaat aatgtcaatc gaaaactctc cactttctca gattccacct
1681 gaaatcactg ctggtggtcc ttcacttgtg atacaatatc ttaaaatgca aggtccctat
1741 cgaggagttg tgatgaattc tcaataattc ccccaatatt ctacttcaat tcaaaaaacc
1801 atgtttcttg tcttctacgg gttgcaaaac ttgttttcgc tccaaaatgt tttttttaat
1861 gttttatttt tttgaaactg aaaagtcact ttttcttctc aaaatatcat attaatatgg
1921 tacttttcca attcaattgt gtcttcccga ttttttctct caaaaaaggt tttgtttatt
1981 aattattttc aaattgtgaa ttccaaaaac tttctctcgt ccagtttccc agctcccccg
2041 ttgcatttcc cccttcttcc aaaaattttt ttataatttt ttgttccaaa attcttgttt
2101 cttttaatga attctaacaa aatcagaa
//