ID EU914103; SV 1; linear; genomic RNA; STD; VRL; 3383 BP. XX AC EU914103; XX DT 31-JUL-2009 (Rel. 101, Created) DT 04-DEC-2009 (Rel. 103, Last updated, Version 2) XX DE Morogoro virus strain 3017/2004 segment S glycoprotein precursor (GPC) and DE nucleoprotein (NP) genes, complete cds. XX KW . XX OS Morogoro mammarenavirus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Arenaviridae; unclassified Arenaviridae. XX RN [1] RP 1-3383 RX DOI; 10.3201/eid1512.090864. RX PUBMED; 19961688. RA Gunther S., Hoofd G., Charrel R., Roser C., Becker-Ziaja B., Lloyd G., RA Sabuni C., Verhagen R., van der Groen G., Kennis J., Katakweba A., RA Machang'u R., Makundi R., Leirs H.; RT "Mopeia virus-related arenavirus in natal multimammate mice, Morogoro, RT Tanzania"; RL Emerg. Infect. Dis. 15(12):2008-2012(2009). XX RN [2] RP 1-3383 RA Gunther S., Leirs H.; RT ; RL Submitted (20-JUL-2008) to the INSDC. RL Virology, Bernhard-Nocht-Institute for Tropical Medicine, RL Bernhard-Nocht-Str. 74, Hamburg 20359, Germany XX DR MD5; 99aef1c0c20390d384ff2d02cd90f3b3. DR EuropePMC; PMC3044542; 19961688. DR EuropePMC; PMC3053032; 21056893. DR EuropePMC; PMC4217180; 25352624. XX FH Key Location/Qualifiers FH FT source 1..3383 FT /organism="Morogoro mammarenavirus" FT /segment="S" FT /host="Mastomys natalensis" FT /strain="3017/2004" FT /mol_type="genomic RNA" FT /country="Tanzania:Morogoro" FT /collection_date="2004" FT /db_xref="taxon:573900" FT gene 46..1515 FT /gene="GPC" FT CDS 46..1515 FT /codon_start=1 FT /gene="GPC" FT /product="glycoprotein precursor" FT /db_xref="GOA:C6ZK00" FT /db_xref="InterPro:IPR001535" FT /db_xref="PDB:5NFF" FT /db_xref="UniProtKB/TrEMBL:C6ZK00" FT /protein_id="ACJ24973.1" FT /translation="MGQIVTFFQEVPHIIEEVMNIVLITLSLLAILKGIYNIMTCGIIG FT LLTFLFLCGRSCSSIYKDNYQFLSLDLDMSGLNATMPLSCSKNNSHHYIQVRNDTGLEL FT TLTNTSLLDHKFCNLSDAHKRNLYDKALMSIVTTFHLNIPNFNQYEVMSCDFNGGKITV FT QYNLSHSSYVDAANHCGTIANGIMDTFRRMYWSNALSPSEYISGTTCIQTAYQYLIIQN FT TTWEDHCVFSRPSPMGFLSLLSQRTKNFYISRRLLGLFTWTLSDSEGNDMPGGYCLTRS FT MLIGMDLKCFGNTAVAKCNQQHDEEFCDMLRLFDFNKQAISRLKSEAQQSLNLITKAVN FT SLINDQLIMKNHLRDLMGIPYCNYTKFWYLNDTRSGATSLPRCWLISNGSYLNETQFSR FT DIEQEANNMLTDMLRKEYEKRQSTTPLGLVDLFVFSTSFYLISVFLHLIKIPTHRHIRG FT KPCPKPHRINHMAICSCGFYKQPGIPTQWKR" FT gene complement(1620..3332) FT /gene="NP" FT CDS complement(1620..3332) FT /codon_start=1 FT /gene="NP" FT /product="nucleoprotein" FT /db_xref="GOA:C6ZK01" FT /db_xref="InterPro:IPR000229" FT /db_xref="InterPro:IPR035083" FT /db_xref="InterPro:IPR035084" FT /db_xref="InterPro:IPR038115" FT /db_xref="UniProtKB/TrEMBL:C6ZK01" FT /protein_id="ACJ24974.1" FT /translation="MSNSKEVKSFLWTQSLRRELSGFCTNVKVQVIKDAQALLHGLDFS FT EVSNVQRLMRKEKRDDSDLKRLRDLNQAVNNLVELKSTQQKNVLRVGTLTPDDLLVLAA FT DLDRLKAKVIRSERPLAAGVYMGNLTAQQLEQRKVLLQMVGMGGGPLGREPPRDGIVRI FT WDVRNPELLNNQFGTMPSLTIACMCKQGQTDLNDVIQSLTDLGLVYTAKYPNMSDLEKL FT TQAHPILGVIEPKKSAINISGYNFSLSAAVKAGACLIDGGNMLETIRVSARNLDGILKA FT TLKVKRSLGMFVSDTPGDRNPYENLLYKLCLSGEGWPYIASRTSILGRAWDNTTVDLSG FT DGTQAPKPAGGNSTRVAQAQGMSAGLTYSQTMELKDCMLQLDPNAKTWVDIEGRAEDPV FT EVAIYQPSNGQYIHFYREPTDIKQFKQDAKHSHGIDIQDLFSVQPGLTSAVIEGLPRNM FT VLTCQGVDDIRKLLDSQGRRDIKLIDVSMQKEEARKYEDSIWDEYKHLCTMHTGIVTQK FT KKRGGKEEVTPHCALMDCLMFEAATVGSSKLTTPRPVLSKDLVFRMSTPKVVL" XX SQ Sequence 3383 BP; 874 A; 839 C; 701 G; 969 T; 0 other; atttttggtt gcgctttgct ttgttcacaa attagtgatc aagaaatggg tcagattgtg 60 accttctttc aggaagtgcc acatataata gaggaagtga tgaatatagt cctcataact 120 ctatcactct tggccattct aaaaggaatc tacaacatca tgacctgcgg catcattggc 180 ctcctgacat tcctcttttt gtgtgggaga tcctgctcaa gcatctataa agacaattat 240 cagtttctct cactggatct ggatatgtca ggactcaatg caacaatgcc tctctcttgc 300 tcaaagaaca attcacacca ctacatacaa gtgaggaacg atacagggct agagttaact 360 ctcacaaaca ccagcctctt ggatcacaag ttttgtaacc tttcagatgc tcacaagagg 420 aacctctatg acaaggcttt aatgtcaatc gtaacaacat tccacctcaa tatccctaat 480 ttcaatcagt atgaggtcat gtcttgtgac ttcaatggtg ggaaaatcac tgtccagtac 540 aacctatcgc actcatccta tgttgatgct gcaaaccact gtggaaccat cgccaatggt 600 atcatggata cctttaggag gatgtactgg tccaatgcac tgtcaccttc tgaatacatc 660 tctggaacaa catgcatcca gacagcatat cagtacttga tcattcagaa cacaacgtgg 720 gaggatcact gtgttttctc caggccatcc cccatgggat tccttagcct attatctcaa 780 cggacaaaga acttttacat atcaaggagg ctgttgggct tgttcacttg gactctaagt 840 gattctgaag gaaatgatat gcctggtggc tactgtctga caagatctat gctaataggg 900 atggacctga agtgctttgg caacactgct gttgcgaagt gcaatcaaca acatgatgaa 960 gaattctgtg atatgctcag attatttgat ttcaacaagc aagccatctc aaggcttaag 1020 tcagaagcac agcagagcct aaacctaata acaaaggcag tgaactcatt gatcaatgat 1080 cagctgataa tgaaaaatca tcttagggat ttgatgggga ttccttattg caattataca 1140 aaattttggt acttgaacga cacgaggtca ggcgccactt cgctgccaag atgttggttg 1200 atttctaatg ggtcttactt aaatgaaact caatttagta gggacataga gcaggaggcc 1260 aacaacatgc tgacagatat gttaaggaaa gaatatgaga aaaggcaaag cacaacacca 1320 cttggtttag tggacctctt tgtcttctct acaagcttct atcttataag tgtatttttg 1380 cacctcatta aaatcccaac acatcgacat ataagaggca agccctgccc caaaccacac 1440 aggattaatc acatggcaat ctgctcatgt ggcttttaca aacaacctgg catcccaact 1500 caatggaaga gatagtccga tggaccctcc gagacgcacc gcccgaggcg gtgcgtctcg 1560 ggggcccgcc ggcccccgca gcccaccacc ttgcggtggt gggctgcggg ggtgtccatt 1620 tacaggacaa cctttggtgt gctcatccta aagaccagat ccttgctcag cactggcctt 1680 ggtgttgtca gcttcgaact ccccacagta gcggcctcaa acattagaca gtccatcaag 1740 gcacaatggg gagtcacttc ctccttgcca cccctctttt tcttctgagt gactatgcct 1800 gtgtgcatag tgcagaggtg cttgtactca tcccagatgc tgtcttcata tttccttgcc 1860 tcttctttct gcatagacac atcgatcaat ttgatgtctc ttctcccttg tgagtccaac 1920 agcttcctta tgtcatcaac accttggcag gtgagcacca tgtttctagg tagcccctct 1980 ataacagcac ttgtcagccc tggctgaaca gagaacaagt cttggatgtc aatcccatga 2040 gaatgttttg cgtcctgctt gaattgcttg atgtcagtgg gttctctata aaagtgaatg 2100 tactgcccgt tggatggctg atatatggcc acttcaacag ggtcctcagc tctaccctct 2160 atatcgaccc aggttttggc atttgggtcc aattgcagca tgcagtcctt cagctccatt 2220 gtctgagaat aggttaatcc agctgacatg ccctgtgcct gtgcaaccct ggtagaattg 2280 cctccagcag gcttgggtgc ctgtgtcccg tccccactta ggtcaactgt tgtgttgtcc 2340 catgccctcc cgaggatgga tgttcttgat gcaatgtagg gccaaccctc cccagagaga 2400 cacaacttgt aaaggagatt ctcataggga ttcctatccc ctggcgtgtc tgagacgaac 2460 atgcccaaag acctctttac ctttagggta gctttgagga ttccatcaag gtttctggct 2520 gaaaccctga tggtttcaag catgtttcca ccatcaatca aacatgcacc agccttcaca 2580 gctgcagaca ggctgaagtt gtatccagat atgtttatgg cacttttttt gggttctata 2640 actcccagga ttgggtgtgc ttgggtcagt ttctccagat cagacatatt cggatatttt 2700 gctgtgtaca caagaccaag gtctgttaac gactggatga catcatttaa gtcagtctga 2760 ccttgtttgc acatgcatgc tattgttaga cttggcatcg tgccaaactg attgttaagc 2820 aattctgggt ttctaacatc ccagattctt acaatgccat ctctaggtgg ctccctaccc 2880 agtggaccac ctcccattcc aaccatttgc aacagaacct ttctttgttc taattgttgt 2940 gctgtcagat ttcccatgta gacacctgct gccagcggcc tttcacttct gatgaccttg 3000 gctttgagtc tgtccaagtc ggctgcaaga acaagcaagt catcaggagt taatgttcca 3060 acccttaaga cattcttttg ctgagtggat ttcaactcaa caagattgtt aacagcctga 3120 tttagatctc ttaatctttt tagatctgag tcatccctct tttctttcct catcaacctc 3180 tgcacattgc tgacctctga gaaatccagc ccgtgaagaa gggcttgagc atccttgatg 3240 acttgcactt tcacattcgt gcagaagcct gagagctctc tccttagact ctgagtccaa 3300 aggaaagatt tcacctcctt tgagttagac atcttcacac cttgttcctc agcaacttga 3360 tctgcaattg gcgcagtcaa taa 3383 //