Dbfetch
LOCUS NM_001027491 5611 bp mRNA linear INV 22-NOV-2023
DEFINITION Caenorhabditis elegans Collagen alpha-1(IV) chain (emb-9), mRNA.
ACCESSION NM_001027491
VERSION NM_001027491.7
DBLINK BioProject: PRJNA158
KEYWORDS RefSeq.
SOURCE Caenorhabditis elegans
ORGANISM Caenorhabditis elegans
Eukaryota; Metazoa; Ecdysozoa; Nematoda; Chromadorea; Rhabditida;
Rhabditina; Rhabditomorpha; Rhabditoidea; Rhabditidae; Peloderinae;
Caenorhabditis.
REFERENCE 1 (bases 1 to 5611)
AUTHORS Sulson,J.E. and Waterston,R.
CONSRTM Caenorhabditis elegans Sequencing Consortium
TITLE Genome sequence of the nematode C. elegans: a platform for
investigating biology
JOURNAL Science 282 (5396), 2012-2018 (1998)
PUBMED 9851916
REMARK Erratum:[Science 1999 Jan 1;283(5398):35]
REFERENCE 2 (bases 1 to 5611)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (22-NOV-2023) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 5611)
AUTHORS WormBase.
CONSRTM WormBase Consortium
TITLE Direct Submission
JOURNAL Submitted (29-OCT-2023) WormBase Group, European Bioinformatics
Institute, Cambridge, CB10 1SA, UK. Email: help@wormbase.org
REFERENCE 4 (bases 1 to 5611)
AUTHORS Sulson,J.E. and Waterston,R.
TITLE Direct Submission
JOURNAL Submitted (03-MAR-2003) Nematode Sequencing Project: Sanger
Institute, Hinxton, Cambridge CB10 1SA, UK and The Genome Institute
at Washington University, St. Louis, MO 63110, USA
COMMENT REVIEWED REFSEQ: This record has been curated by WormBase. This
record is derived from an annotated genomic sequence (NC_003281).
On May 27, 2020 this sequence version replaced NM_001027491.6.
FEATURES Location/Qualifiers
source 1..5611
/organism="Caenorhabditis elegans"
/mol_type="mRNA"
/strain="Bristol N2"
/db_xref="taxon:6239"
/chromosome="III"
gene 1..5611
/gene="emb-9"
/locus_tag="CELE_K04H4.1"
/db_xref="GeneID:176314"
/db_xref="WormBase:WBGene00001263"
CDS 50..5329
/gene="emb-9"
/locus_tag="CELE_K04H4.1"
/standard_name="K04H4.1a"
/note="Confirmed by transcript evidence"
/codon_start=1
/product="Collagen alpha-1(IV) chain"
/protein_id="NP_001022662.2"
/db_xref="EnsemblGenomes-Gn:WBGene00001263"
/db_xref="GeneID:176314"
/db_xref="WormBase:WBGene00001263"
/translation="MSRLSLLGLTAAVVLLSSFCQDRIHVDAAAACKGCAPPCVCPGT
KGERGNPGFGGEPGHPGAPGQDGPEGAPGAPGMFGAEGDFGDMGSKGARGDRGLPGSP
GHPGLQGLDGLPGLKGEEGIPGCNGTDGFPGMPGLAGPPGQSGQNGNPGRPGLSGPPG
EGGVNSQGRKGVKGESGRSGVPGLPGNSGYPGLKGAKGDPGPYGLPGFPGVSGLKGRM
GVRTSGVKGEKGLPGPPGPPGQPGSYPWASKPIEMEVLQGPVGPAGVKGEKGRDGPVG
PPGMLGLDGPPGYPGLKGQKGDLGDAGQRGKRGKDGVPGNYGEKGSQGEQGLGGTPGY
PGTKGGAGEPGYPGRPGFEGDCGPEGPLGEGTGEAGPHGAQGFDGVQGGKGLPGHDGL
PGPVGPRGPVGAPGAPGQPGIDGMPGYTEKGDRGEDGYPGFAGEPGLPGEPGDCGYPG
EDGLPGYDIQGPPGLDGQSGRDGFPGIPGDIGDPGYSGEKGFPGTGVNKVGPPGMTGL
PGEPGMPGRIGVDGYPGPPGNNGERGEDCGYCPDGVPGNAGDPGFPGMNGYPGPPGPN
GDHGDCGMPGAPGKPGSAGSDGLSGSPGLPGIPGYPGMKGEAGEIVGPMENPAGIPGL
KGDHGLPGLPGRPGSDGLPGYPGGPGQNGFPGLQGEPGLAGIDGKRGRQGSLGIPGLQ
GPPGDSFPGQPGTPGYKGERGADGLPGLPGAQGPRGIPAPLRIVNQVAGQPGVDGMPG
LPGDRGADGLPGLPGPVGPDGYPGTPGERGMDGLPGFPGLHGEPGMRGQQGEVGFNGI
DGDCGEPGLDGYPGAPGAPGAPGETGFGFPGQVGYPGPNGDAGAAGLPGPDGYPGRDG
LPGTPGYPGEAGMNGQDGAPGQPGSRGESGLVGIDGKKGRDGTPGTRGQDGGPGYSGE
AGAPGQNGMDGYPGAPGDQGYPGSPGQDGYPGPSGIPGEDGLVGFPGLRGEHGDNGLP
GLEGECGEEGSRGLDGVPGYPGEHGTDGLPGLPGADGQPGFVGEAGEPGTPGYRGQPG
EPGNLAYPGQPGDVGYPGPDGPPGLPGQDGLPGLNGERGDNGDSYPGNPGLSGQPGDA
GYDGLDGVPGPPGYPGITGMPGLKGESGLPGLPGRQGNDGIPGQPGLEGECGEDGFPG
SPGQPGYPGQQGREGEKGYPGIPGENGLPGLRGQDGQPGLKGENGLDGQPGYPGSAGQ
LGTPGDVGYPGAPGENGDNGNQGRDGQPGLRGESGQPGQPGLPGRDGQPGPVGPPGDD
GYPGAPGQDIYGPPGQAGQDGYPGLDGLPGAPGLNGEPGSPGQYGMPGLPGGPGESGL
PGYPGERGLPGLDGKRGHDGLPGAPGVPGVEGVPGLEGDCGEDGYPGAPGAPGSNGYP
GERGLPGVPGQQGRSGDNGYPGAPGQPGIKGPRGDDGFPGRDGLDGLPGRPGREGLPG
PMAMAVRNPPGQPGENGYPGEKGYPGLPGDNGLSGPPGKAGYPGAPGTDGYPGPPGLS
GMPGHGGDQGFQGAAGRTGNPGLPGTPGYPGSPGGWAPSRGFTFAKHSQTTAVPQCPP
GASQLWEGYSLLYVQGNGRASGQDLGQPGSCLSKFNTMPFMFCNMNSVCHVSSRNDYS
FWLSTDEPMTPMMNPVTGTAIRPYISRCAVCEVPTQIIAVHSQDTSVPQCPQGWSGMW
TGYSFVMHTAAGAEGTGQSLQSPGSCLEEFRAVPFIECHGRGTCNYYATNHGFWLSIV
DQDKQFRKPMSQTLKAGGLKDRVSRCQVCLKNR"
ORIGIN
1 aacgacacca atctttgggt aacgagaagt cgatcgccgt gactggaaga tgtcacgctt
61 atcgcttctc ggcttgacgg cagccgtagt gctactgtcg tcgttctgtc aagataggat
121 acatgtggat gctgccgccg catgcaaggg atgtgctcca ccatgtgttt gcccaggaac
181 caaaggagaa cgtggtaatc caggatttgg tggtgaacca ggacatcctg gagcaccagg
241 acaagatgga ccagaaggag caccaggagc tccaggaatg tttggagccg aaggagattt
301 tggagatatg ggatctaagg gagctcgtgg agatcgtggt cttcctggat cgccaggtca
361 tccaggtctt caaggacttg acggattacc aggactgaaa ggagaagaag gaattccagg
421 atgcaatgga acagatggtt tccctggaat gcccggactt gctggacctc cagggcaatc
481 tggacaaaac ggaaaccctg gacgaccagg actctccgga ccaccaggag aaggaggtgt
541 caattcacaa ggacgcaaag gagttaaagg agaatctgga agatcaggag ttccaggtct
601 tccaggaaac tctggttatc ctggattgaa aggagcaaag ggagatccag gaccgtacgg
661 tcttcccgga ttccccggag tttctggatt gaagggaaga atgggagtga gaacttctgg
721 tgttaaggga gagaagggtt tgcctggacc accaggacca ccaggacaac cgggatctta
781 tccatgggct tcaaagccaa ttgagatgga agtcttgcaa ggacctgtcg gaccagctgg
841 agtgaaagga gaaaagggac gtgatggacc agtaggacca ccaggaatgc tcggacttga
901 cggaccacca ggatatcctg gattaaaggg acagaaggga gatttgggag atgctggaca
961 acgtggtaaa cgtggaaagg acggagttcc aggaaattat ggagaaaagg gatcccaagg
1021 agaacaagga cttggaggaa ctccaggata cccaggaact aagggagggg ctggagaacc
1081 aggataccca ggaagaccag gtttcgaagg agactgtgga ccggaaggac cacttggaga
1141 aggaactggt gaggctggac cacatggagc tcaaggattc gacggagttc aaggaggcaa
1201 aggattgcca ggacatgatg gtctcccagg accagtaggt ccaagaggtc cagttggagc
1261 tccaggagcc ccaggacagc ccggtatcga tggaatgcca ggatacaccg aaaaaggaga
1321 tagaggagag gacggttacc caggattcgc tggcgaacca ggactcccag gagaaccagg
1381 agattgtggt tatccaggag aggatggtct tccaggatac gatattcaag gaccacctgg
1441 acttgacgga caatccggaa gagatggatt ccctggtatt ccaggagaca tcggagatcc
1501 aggttattct ggagaaaagg gattcccagg aactggagtt aacaaagttg gaccaccagg
1561 aatgaccggt ttgccaggag agccaggaat gccaggacgt attggagttg acggttatcc
1621 aggaccacca ggaaacaatg gagaaagagg agaagactgt ggatattgtc cagatggtgt
1681 tccaggaaat gctggagatc caggattccc cggaatgaat ggatatccag gaccaccagg
1741 acctaatggt gatcatggag actgcggaat gccaggagct ccaggaaagc caggatccgc
1801 tggatcagat ggactttcag gatcaccagg acttccagga attccaggct acccaggaat
1861 gaagggagaa gccggagaga tcgttggacc aatggaaaac ccagctggaa ttccaggtct
1921 taagggagat catggtcttc caggactacc aggacgacca ggaagtgacg gactccctgg
1981 ttacccagga ggaccaggac aaaatggatt cccaggactc caaggtgaac caggtcttgc
2041 tggaattgat gggaagagag gacgtcaagg atctctcgga atcccaggac ttcaaggacc
2101 acctggagac tctttcccag gacagccagg aacacctggt tacaagggag aacgaggtgc
2161 tgatggtctt ccaggacttc caggagctca aggaccacgg ggaattccag caccattgag
2221 aattgtcaat caagttgctg gacaaccagg tgttgacggc atgccaggtc ttccaggaga
2281 tagaggagct gatggtcttc caggactccc aggaccggtt ggaccagacg gatacccagg
2341 aacaccagga gagcgcggta tggatggtct tccaggattc ccaggactcc atggagagcc
2401 tggaatgcgt ggacagcaag gagaagttgg tttcaacgga attgatgggg actgtggaga
2461 gccaggtctt gacggatacc ccggagcacc aggagctcca ggagctccag gagagactgg
2521 atttggcttc ccaggacaag tcggataccc aggaccaaat ggagatgctg gggcagctgg
2581 acttccagga ccagacggct acccaggaag agacggtctt ccaggaactc ctggataccc
2641 aggagaagca ggaatgaacg gacaagatgg agccccagga cagccaggat ctcgtggaga
2701 gtctggactt gttggaattg atggaaagaa aggacgcgat ggaaccccag gaacacgtgg
2761 acaagacggt ggaccgggat attccggaga ggctggagcc ccaggacaaa atggaatgga
2821 tggataccca ggagcaccag gagatcaagg atatccagga tccccagggc aagacggata
2881 cccagggcca agtggaattc caggagagga cggtcttgtc ggattcccag gacttcgtgg
2941 tgaacacgga gacaatggtc ttccaggatt ggagggagaa tgcggagagg aaggatccag
3001 aggacttgat ggtgttccag gttacccagg agaacacgga accgacgggc taccagggct
3061 tccaggagct gatggtcaac caggttttgt tggagaagcc ggagagccag gtacaccagg
3121 atacagaggt cagccaggag aaccaggaaa cctcgcatat ccaggacagc caggagatgt
3181 tggataccca ggaccagatg gaccaccagg tcttccagga caagacggcc ttccaggact
3241 gaatggtgaa cgaggagaca atggagatag ttatccagga aacccaggat tgagcggcca
3301 accaggagat gctggatatg acggacttga tggagtccca ggaccaccag gatacccagg
3361 aatcactggg atgccagggc tcaagggaga atctggactt ccaggacttc caggacgtca
3421 aggaaatgac ggcattccag gtcaaccagg acttgaagga gaatgtggtg aagatgggtt
3481 cccaggatct ccagggcaac caggttatcc tgggcagcaa ggacgtgaag gagagaaggg
3541 atatccagga attccaggag aaaatggtct ccctggtctt cgtggacaag acggccagcc
3601 aggtctcaag ggagaaaatg gtttggatgg tcagccaggt tatccaggat ccgcaggaca
3661 attgggaact ccaggagacg ttggatatcc aggagcacct ggagaaaatg gagacaacgg
3721 aaatcaggga cgtgatggac aacctggact tcgtggagaa tcaggtcagc caggacagcc
3781 aggattgcca ggaagagatg gtcagccagg accagttgga ccaccaggag acgatggata
3841 cccaggagca cccggacaag acatatatgg cccaccaggt caagctggac aagatggcta
3901 cccaggactt gatggactcc caggagcccc aggacttaat ggagaaccag gatctccagg
3961 acaatacgga atgccaggtc ttccaggagg cccaggagaa tcaggactac caggatatcc
4021 aggagaacgt ggtcttcccg gactcgacgg aaagcgcggt catgatggac tcccaggagc
4081 accaggtgta ccaggagttg agggagttcc aggacttgaa ggagattgtg gagaagacgg
4141 ttatccagga gctccaggag ccccaggaag caacggatat ccaggagagc gtggtcttcc
4201 aggtgtccca ggacaacaag gacgatctgg tgacaacggt tatccaggag caccaggaca
4261 acctggaatc aagggaccac gtggagatga tggattccca ggacgcgacg gactcgacgg
4321 acttccaggt cgaccaggac gtgaaggatt gcccggacca atggctatgg cagttagaaa
4381 cccgccaggc caaccaggag aaaatggtta tccaggagag aagggatacc caggacttcc
4441 aggagataac ggactttccg gaccaccagg aaaagccgga tacccaggag ctccaggaac
4501 agacggctat ccaggaccac caggtctttc aggaatgcct ggacacggtg gagatcaagg
4561 attccaagga gctgctggac gtactggtaa tccaggactt ccaggtactc caggataccc
4621 aggatcacca ggaggatggg caccaagtcg aggattcact tttgccaagc actctcagac
4681 taccgcagta ccacagtgcc caccaggagc ttctcaactt tgggaaggat attctcttct
4741 ttacgttcaa ggaaacggtc gtgccagtgg acaagatctt ggtcaaccag gatcttgcct
4801 ctccaaattc aacacaatgc cattcatgtt ctgtaatatg aacagcgtct gccacgtttc
4861 cagccgtaac gattactcgt tctggttgtc aactgacgag ccaatgacac caatgatgaa
4921 tccagtcacc ggaacagcaa ttcgtccata catttctcgt tgtgccgtct gtgaagttcc
4981 aactcaaatc atcgcagttc actctcaaga cacatcagtt ccacaatgtc cacaaggatg
5041 gtcgggtatg tggaccggat attcctttgt catgcacact gccgctggag ccgaaggaac
5101 tggacaaagt cttcaatcgc ctggttcctg tctcgaagag ttccgtgccg ttccattcat
5161 tgagtgccac ggaagaggaa catgcaatta ttatgccacc aatcacggat tctggctttc
5221 catcgttgat caggacaagc aattccgtaa accaatgtca cagacactca aagctggagg
5281 acttaaagat cgtgtatcca gatgccaagt gtgtctcaag aaccgataat catcccaatc
5341 atcactcgct ttttactatt tatcaagttc tacttaaatc tttctttata ttttctctct
5401 attaaatatc aaccattttc tatgcttttt acaatgtttc ttcaacaaaa aatcttccat
5461 actaacaaaa ctgccaaaag agagcaagtt ctgtgtgata cgaattactc ctgccccctt
5521 tgaaaacaag attctgttat tttattgtat ttccctaatg ttcgtaatct gaaaataaaa
5581 atttatttga tttgaataaa atttttgaac t
//