ID AY012687; SV 1; linear; genomic RNA; STD; VRL; 3343 BP. XX AC AY012687; XX DT 02-JAN-2001 (Rel. 66, Created) DT 11-DEC-2001 (Rel. 70, Last updated, Version 2) XX DE Allpahuayo virus strain CLHP-2472 nucleocapsid protein (N) and glycoprotein DE precursor (GPC) genes, complete cds. XX KW . XX OS Allpahuayo mammarenavirus OC Viruses; Riboviria; Negarnaviricota; Polyploviricotina; Ellioviricetes; OC Bunyavirales; Arenaviridae; Mammarenavirus. XX RN [1] RP 1-3343 RX DOI; 10.1006/viro.2000.0803. RX PUBMED; 11384226. RA Moncayo A.C., Hice C.L., Watts D.M., Travassos de Rosa A.P.A., Guzman H., RA Russell K.L., Calampa C., Gozalo A., Popov V.L., Weaver S.C., Tesh R.B.; RT "Allpahuayo virus: a newly recognized arenavirus (arenaviridae) from RT arboreal rice rats (oecomys bicolor and oecomys paricola) in northeastern RT peru"; RL Virology 284(2):277-286(2001). XX RN [2] RP 1-3343 RA Moncayo A.C., Hice C.L., Watts D.M., Travassos de Rosa A.P.A., Guzman H., RA Russell K.L., Calampa C., Gozalo A., Popov V.L., Weaver S.C., Tesh R.B.; RT ; RL Submitted (07-NOV-2000) to the INSDC. RL Pathology, University of Texas Medical Branch, 301 University Blvd., RL Galveston, TX 77555-0609, USA XX DR MD5; 3592e2869a597603c01181cbad6e3b62. DR EuropePMC; PMC2374749; 18304671. DR EuropePMC; PMC2658809; 19343214. DR EuropePMC; PMC2680969; 19478873. DR EuropePMC; PMC2714187; 18454597. DR EuropePMC; PMC3053032; 21056893. DR EuropePMC; PMC3210867; 21982818. DR EuropePMC; PMC3309595; 22377271. DR EuropePMC; PMC4111098; 24161346. DR EuropePMC; PMC4136298; 24920811. DR EuropePMC; PMC4217180; 25352624. XX FH Key Location/Qualifiers FH FT source 1..3343 FT /organism="Allpahuayo mammarenavirus" FT /host="Oecomys bicolor" FT /strain="CLHP-2472" FT /mol_type="genomic RNA" FT /country="Peru: Northeast" FT /db_xref="taxon:144752" FT CDS 33..1718 FT /codon_start=1 FT /gene="N" FT /product="nucleocapsid protein" FT /db_xref="GOA:Q9DK04" FT /db_xref="InterPro:IPR000229" FT /db_xref="InterPro:IPR035083" FT /db_xref="InterPro:IPR035084" FT /db_xref="InterPro:IPR038115" FT /db_xref="UniProtKB/Swiss-Prot:Q9DK04" FT /protein_id="AAG42532.1" FT /translation="MSSENVPSFRWTQSLRRGLSNWTHAVKGDVLADARAIVSALDFHQ FT VAQVQRMMRKDKRSEADLTRLRDMNKEVDALMMMRSAQKDNILKVGGLSKDELMELASD FT LDKLRKKVQRTEGGGQPGVYAGNLTSSQLNQRSEILKMMGMGTGPRGPVGGVVKVWDIK FT DSSLLVNQFGSMPALTIACMTQQGGEQMNDVVQALTSLGLVYTVKYPNLSDLEKLTEKH FT PCLKLITQEPAQINISGYNLSLSAAVKADACMIDGGNMLETLQVKPSMFSTLIKTILEV FT KNREGMFVSPSPGQRNPYENILYKVCLSGDGWPYIGSRSQIKGRAWENTTVDLEGKPSV FT NHPPVRNGGSPDLKQIPKTKEDEVIRAIEQLDPRGTTWVDIEGPPGDPVELALFQPETG FT NYLHCYRRPHNENAFKDQSKFSHGLLLKDLADTQPGLISCIIRHLPNNMVLTAQGNDDI FT IKLLEMHGRRDIKVLDVKLSSDQARLMEDVVWERYNMLCVKHTGLVIKKKKKGAAPGSA FT NPHCALLDCIMFDATVTGYLRDQKPKRLLPLDTLYRDNANLINL" FT CDS complement(1789..3312) FT /codon_start=1 FT /gene="GPC" FT /product="glycoprotein precursor" FT /db_xref="GOA:Q9DK03" FT /db_xref="InterPro:IPR001535" FT /db_xref="UniProtKB/Swiss-Prot:Q9DK03" FT /protein_id="AAG42531.1" FT /translation="MGQVVTFLQSLPEVINEAINIALIAISIICILKGLVNFWKCGVVQ FT LAIFLCLAGRKCDGLMIDRRHELSHVELNLTRMFDNLPQSCSKNNTHHYYKGPKGTTWG FT IELTLTNTSLDSYANMSRIRSLAFGNITNCDKTGEAGHTLKWLLNELHFNVLHVTRHVG FT ARCRVSEGAGLLIQYNLTIGDHGGEVGRHLIASLAQIIGDNKAAWVGKCDSHCTMDGKC FT NYTNCEGFTHYNYLIIQNTTWENHCSYSPMSTIRMALNKVAYSSVSRQLLGFFTWDISD FT SSGAHVPGGYCLEQWAIVWAGIKCFDNAVMAKCNKDHNVEFCDTMRLFDFNQNAIKTLQ FT LNVENSVNLLKRSINGLISDSLVIRNSLKQLAKIPYCNYTKFWYVNDTITGKHSLPQCW FT LMRNGSYLNETHFKNEWLWESQNLYNEMLLKEYEDRQGKTPIALTDICFWSLVFFTSTV FT FLQLVGIPTHRHLVGEGCPKPHRITSNSLCACGYYKIPKRPTRWVRKGK" XX SQ Sequence 3343 BP; 966 A; 721 C; 770 G; 886 T; 0 other; tttaggataa cgctttgcta acgatctcga acatgagttc tgagaatgta ccctcattcc 60 gctggactca atcccttaga aggggtctgt ccaactggac ccatgctgtg aagggagacg 120 ttcttgcgga tgcgagggcc atagtctctg cgcttgactt ccaccaagtt gcacaagtgc 180 aaagaatgat gaggaaggat aaaaggagcg aggcggatct gaccagattg agagacatga 240 ataaagaggt tgacgctctt atgatgatga ggtctgcaca aaaagacaat atcttgaaag 300 tgggtggatt gtcaaaagat gagttgatgg agctggcttc tgacctagac aaactgagga 360 agaaggtcca gaggacagaa gggggtggtc aaccaggtgt atatgcaggg aacctaactt 420 catctcagct gaatcagaga tcagaaatcc tgaagatgat ggggatgggt acaggtccaa 480 gaggcccagt tggaggtgtt gtgaaggttt gggacatcaa ggacagcagc ctccttgtca 540 atcaatttgg ctctatgcca gcactaacca tcgcctgcat gacacagcaa ggaggagaac 600 agatgaatga tgtggttcaa gccctaacat ctctgggact tgtgtacaca gtgaagtatc 660 cgaacttgtc tgacttagag aaattgaccg agaaacatcc ctgtctgaag ctcatcacac 720 aagagcctgc tcagatcaac atttcggggt acaatctaag cttgtctgct gcagtcaagg 780 cggatgcttg tatgattgat ggtggcaaca tgttggaaac tttacaggtt aaaccctcaa 840 tgtttagcac ccttatcaag acaattcttg aagtgaaaaa cagagaaggc atgtttgtga 900 gtccatcccc tgggcagaga aatccctacg aaaacatatt atacaaggtt tgtctgtcag 960 gagatggttg gccgtacatt ggctcaagat cacaaatcaa aggaagagca tgggagaaca 1020 ccactgttga tcttgagggc aaaccatcag tgaaccaccc tccagtgagg aatggtggtt 1080 caccagatct aaaacaaata cccaagacaa aggaagatga ggtgataaga gccattgaac 1140 agcttgaccc gagaggaaca acctgggtgg atattgaggg tccaccaggt gatcctgttg 1200 aacttgctct atttcaacca gaaactggca attacttgca ttgctacaga agaccacaca 1260 atgaaaacgc attcaaggac cagagcaagt tctctcatgg cttgttgctc aaagatctgg 1320 ctgatacaca gccaggactc atctcgtgca taataagaca ccttccaaac aacatggtgt 1380 tgactgcaca aggtaatgat gacataatca aactacttga gatgcatgga agaagagaca 1440 ttaaggtgtt ggatgtgaag ttgtcatcag atcaagctcg tctaatggag gatgtggtat 1500 gggaacgtta caacatgctc tgcgtgaaac acactgggct cgtcattaag aagaagaaga 1560 agggtgcagc tccaggatca gccaatcccc attgcgcatt gttggactgt ataatgtttg 1620 atgccactgt cacaggctat cttcgagacc aaaaaccgaa aagacttctt ccacttgaca 1680 ctctatacag agacaatgct aatctaatta acctttaaaa ccttctaatt gaaggcctcg 1740 atgtcacacc ccctaggggg tgtgacatcg aggcgctggg agagcgaatt acttgccttt 1800 tctgacccat ctagtcggtc tttttggaat tttatagtaa ccgcaggcac acaatgagtt 1860 tgatgtaatc ctatggggtt ttgggcaccc ttcgccgact aagtgtcgat gtgttgggat 1920 gccaacaagc tgcaagaaca ctgtgctggt gaagaaaacc aaggaccaga agcagatgtc 1980 agttaaggca atgggtgtct ttccctgcct atcctcatat tctttcagca gcatctcatt 2040 gtacaaattt tgactttccc acaaccactc atttttaaag tgtgtctcat ttaagtaaga 2100 gccattgcgc ataagccaac attgaggcaa actatgtttg cctgtgatgg tgtcattcac 2160 ataccagaac ttagtgtaat tacagtaagg aatcttggct agctgcttca aactattcct 2220 aatcaccaaa ctatctgata tcaaaccatt aatgctcctc ttcagaagat ttacactgtt 2280 ctcaacattc agctgaaggg ttttaatggc gttttggttg aagtcaaaca gtctcattgt 2340 gtcacaaaac tcaacattgt ggtccttgtt gcactttgcc atcacagcat tatcaaaaca 2400 ttttattcct gcccatacta ttgcccattg ttctaaacaa tatccacctg ggacatgtgc 2460 tccgcttgag tccgagatgt cccatgtgaa gaatccaagt agttgtctcg acacagaact 2520 ataagctact ttgttcaaag ccatccgaat tgttgacatt ggtgagtaag agcaatgatt 2580 ctcccaggtt gtattctgaa tgatgaggta attatagtgt gtgaatccct cacagttggt 2640 ataattgcat ttgccatcca ttgtgcaatg gctgtcacac tttcccaccc aagctgcttt 2700 gttgtcacca atgatctgag caagagaagc aatgagatgt ctgccgactt cacctccatg 2760 atctccaata gtgaggttgt attgtatgag cagcccagca ccctctgaca ctctgcatct 2820 cgctccaaca tgtctggtga catggagcac attgaagtga agctcattaa gcagccactt 2880 gagtgtgtgc cctgcttcac ctgtcttgtc acagttggtg atgttcccga atgccaaact 2940 tctgatcctg ctcatattag cataactgtc aagagaagtg tttgtaagag tcagttcaat 3000 tccccaggtg gttccttttg ggcctttata atagtggtgt gtgttgtttt tgctgcaaga 3060 ttgagggagg ttgtcaaaca ttctagttag gtttagttcg acgtgtgaga gctcatgtct 3120 tctgtcaatc atcagaccat cacattttct acctgcaagg caaaggaaaa ttgcgagttg 3180 cacaacccca cacttccaaa agttgaccaa tcctttcaga atacagataa tagaaatggc 3240 aatcaaggca atgttaattg cctcattgat cacctcgggt agagactgta gaaatgtgac 3300 aacttgtccc attttggagt cacacaggcg tgatcaagga aat 3343 //