ID HM145750; SV 1; linear; genomic DNA; STD; VRL; 4944 BP. XX AC HM145750; XX DT 21-JUL-2010 (Rel. 105, Created) DT 13-AUG-2010 (Rel. 105, Last updated, Version 2) XX DE Bocavirus gorilla/GBoV1/2009, complete genome. XX KW . XX OS Bocavirus gorilla/GBoV1/2009 OC Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Bocavirus. XX RN [1] RC Publication Status: Online-Only RP 1-4944 RX DOI; 10.1371/journal.pone.0011948. RX PUBMED; 20668709. RA Kapoor A., Mehta N., Esper F., Poljsak-Prijatelj M., Quan P.L., Qaisar N., RA Delwart E., Lipkin W.I.; RT "Identification and characterization of a new bocavirus species in RT gorillas"; RL PLoS One 5(7):E11948-E11948(2010). XX RN [2] RP 1-4944 RA Kapoor A., Lipkin W.I.; RT ; RL Submitted (23-APR-2010) to the INSDC. RL Center for Infection and Immunity, Columbia University, 722 West 168th RL Street, New York, NY 10032, USA XX FH Key Location/Qualifiers FH FT source 1..4944 FT /organism="Bocavirus gorilla/GBoV1/2009" FT /host="Gorilla gorilla" FT /isolate="GBoV1" FT /mol_type="genomic DNA" FT /country="USA" FT /isolation_source="stool" FT /collection_date="04-Apr-2009" FT /db_xref="taxon:864686" FT CDS 14..2428 FT /codon_start=1 FT /product="non structural protein 1" FT /note="NS1" FT /db_xref="GOA:D9ZK36" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="UniProtKB/TrEMBL:D9ZK36" FT /protein_id="ADK34009.1" FT /translation="MAFNPPVIRAFSQPAFTYVFKFPYPLWKEKEYLLHALLAHGTEQA FT MIQLRSCAPHPDEDIIRDDLLISLEDRHFGAILCKAVYMASTTLMSQKRRNMFPRCDII FT VQSEIGEENLHCHIIIGGDGLSKRNAKSSCTQFYGLILAELVQRCKTLLATRPFEPEEA FT DIYHALKKAEREAWGGITGGNMQILQYRDRRGDIHAQTVDPLRFFKNYLLPKNRCLSSY FT SKPEVCTSPENWFILAEKTYSHTLVNGLPLPEHYRKHYHTTLDNEIIPGPQTMAYGGRG FT PWEHLPEVGDQRLAASSVSTTYKPNKKEKLMLNLLEKCSELNLLVYEDLVANCPELLLM FT LEGQPGGARLIEQVLGMHHINVCSNFTALSYLFHLYPVTSLDSNNKALQLLLTQGYNPL FT MVGHALCCVLNKQFGKQNTVCFYGPASTGKTNMAKAIVQGIRLYGCVNHLNKGFVFNDC FT RQRLVVWWEECLMHQDWVEPAKCILGGTECRIDVKHRDSVLLTQTPVIISTNHDIYAVV FT GGNSVSHVHAAPLKERVIQLNFMKQLSQTFGEITATEIAALLHWCFNEYDCTLTGFKTK FT WKLDKIPNSFPLGVLCPNHSQDFTLHENGYCTDCGGYLAHSADDSVYTDCTSQTSREEL FT DPGKLNYINYSTLHLTSKAYITLFTGNLGDTDGEDTKPETPEVGVCAPKKRRISTPTTP FT PNSPASSVSTFTFFDNWYAQPQDEDELREYERQTSLLQKKRQSRERREKTPVADISSQE FT SQPEPNPTQWGEQLGVLPSGTPDQPPIVLHCFEDFRPSDEDEGEYIGEKRQ" FT CDS 2169..2828 FT /codon_start=1 FT /product="nucleoprotein 1" FT /note="NP1" FT /db_xref="GOA:D9ZK37" FT /db_xref="InterPro:IPR021075" FT /db_xref="UniProtKB/TrEMBL:D9ZK37" FT /protein_id="ADK34010.1" FT /translation="MSSGNMKDKHRSYKRKGSPGKDEKKRQWQTSRHRSHSRSPIRHSG FT ENNSGFYRQEHPINHLSSCTASKTSGQVMKTKENTSGKKDSRTNPYTVFSQHRASNPNA FT PGWCGFYWHSTRIARDGTNAIFNEMKQQFQELQIDNKIGWDSTRELLFNQKKTLDQKYR FT NMFWHFRNASDCERCAYWDDVYRRHLANVSSQTESEEITDEEMLSAVETMETDASN" FT CDS 2815..4830 FT /codon_start=1 FT /product="capsid protein" FT /note="VP1" FT /db_xref="GOA:D9ZK38" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="UniProtKB/TrEMBL:D9ZK38" FT /protein_id="ADK34011.1" FT /translation="MPPIKRQPGGWVLPGYRYLGPFNPLDNGKPINNADRAAQAHDKSY FT SELIKSGKNPYLYFNKADEKFINDLKDDWSIGGIIGSTFFKLKRAVAPALGNKERAQKR FT HFYFANSNKGAKKSKSSEPKPRTSKMSENEIQDQQPSDAVDGQRGGSSAAGSIGGGKGS FT GVGISTGGWVGGSHFADKYVITKNTRQFITSIQNGHLYKTEIINPSNGNGKSQRCVTTP FT WTYFNFNQYSCHFSPQDWQRLTNEYKRFRPKAMLVKIYNLQIKQILSNGADTTYNNDLT FT AGVHIFCDGEHAYPNASHPWDEDVMPDLPYKTWKLFQYGYIPILNELADLDGTTAGGTA FT TEKAILYQMPFFMLENSDHEVLRTGESSEFTFNFDCEWVNNERAYIPPGLMFNPKVPTR FT RVQYIRQNGQTTASTSRIEPYSKPTSWMTGPGFLGGQRVGPATSDTAPYMVCTKPDGVY FT INTGAAGYGSGFDPPSGSLAPTDLEYKLQWYQTPEDTGNNGNIIANPSLSMLRDQLLYR FT GNQTTYNLNADVWMFPNQIWDRYPITREHPIWCKKPRADKNTIIDPFDGSIAMDHPPGT FT IFIKMAKIPVPSSTNADSYLNIYCTGQVSCEIVWEVERYVTKNWRPERRHTALGMSIGG FT TENVSPTYHVDSAGTYIQPTTFDQCMPVKTNINKVL" FT CDS 3202..4830 FT /codon_start=1 FT /product="capsid protein" FT /note="VP2" FT /db_xref="GOA:D9ZK39" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR016184" FT /db_xref="UniProtKB/TrEMBL:D9ZK39" FT /protein_id="ADK34012.1" FT /translation="MSENEIQDQQPSDAVDGQRGGSSAAGSIGGGKGSGVGISTGGWVG FT GSHFADKYVITKNTRQFITSIQNGHLYKTEIINPSNGNGKSQRCVTTPWTYFNFNQYSC FT HFSPQDWQRLTNEYKRFRPKAMLVKIYNLQIKQILSNGADTTYNNDLTAGVHIFCDGEH FT AYPNASHPWDEDVMPDLPYKTWKLFQYGYIPILNELADLDGTTAGGTATEKAILYQMPF FT FMLENSDHEVLRTGESSEFTFNFDCEWVNNERAYIPPGLMFNPKVPTRRVQYIRQNGQT FT TASTSRIEPYSKPTSWMTGPGFLGGQRVGPATSDTAPYMVCTKPDGVYINTGAAGYGSG FT FDPPSGSLAPTDLEYKLQWYQTPEDTGNNGNIIANPSLSMLRDQLLYRGNQTTYNLNAD FT VWMFPNQIWDRYPITREHPIWCKKPRADKNTIIDPFDGSIAMDHPPGTIFIKMAKIPVP FT SSTNADSYLNIYCTGQVSCEIVWEVERYVTKNWRPERRHTALGMSIGGTENVSPTYHVD FT SAGTYIQPTTFDQCMPVKTNINKVL" XX SQ Sequence 4944 BP; 1647 A; 1059 C; 972 G; 1266 T; 0 other; tggtgagttc aaaatggctt tcaatcctcc tgtgattaga gctttctctc aacctgcttt 60 tacttatgtc ttcaagtttc catatccact atggaaagaa aaagaatact tactacacgc 120 attacttgca cacggaacag aacaagcaat gattcagtta agaagctgtg cacctcatcc 180 ggatgaagat ataatccgag atgacttact gatttcttta gaggatcgcc attttggtgc 240 tattctatgc aaagctgtct acatggcaag caccacacta atgtcacaaa aacgacgaaa 300 tatgtttcct cgctgtgaca ttattgttca atctgaaatt ggggaagaaa accttcactg 360 ccacatcata atcgggggag acggactgag caaacgaaat gctaaatcct cttgtactca 420 attctatgga ctaattctag ctgaactagt ccaacgctgc aaaacactct tagctacgcg 480 tccatttgaa ccagaagaag ctgatatata tcacgctcta aaaaaagcag aacgcgaggc 540 ttggggtggg attactggag gcaacatgca aatcctacag tacagagatc gcagaggaga 600 tattcacgct cagacggtgg atcctcttcg cttcttcaaa aactaccttt tacctaaaaa 660 tagatgtctt tcatcttaca gcaaacctga agtttgtact tctcctgaga actggtttat 720 tttagctgaa aaaacttaca gtcacactct tgttaacggg ctgccgcttc ctgaacatta 780 cagaaaacac taccacacaa ccctagataa cgaaatcatt ccagggcctc aaaccatggc 840 ctatggggga cgtggtccgt gggaacatct tcctgaggta ggagatcagc gtttagctgc 900 ttcttctgtc agtactacat acaaacctaa caaaaaagaa aagctaatgc taaacttact 960 tgaaaaatgt agtgaactta atcttctagt atatgaagac ttagttgcta actgtcctga 1020 acttttactt atgcttgaag gtcagccagg aggtgcacgc cttatcgaac aggtgctagg 1080 catgcaccat atcaatgttt gctcaaactt tacagctcta agctatttat ttcaccttta 1140 tcctgttacc tctcttgact ctaataacaa ggcattgcaa ctgttgttga cacaaggcta 1200 taacccacta atggttgggc acgccttgtg ctgtgtgctg aacaaacagt tcggcaaaca 1260 aaatactgtt tgcttttacg gccctgcttc cacaggtaaa acaaacatgg caaaggccat 1320 agtccaagga attagacttt atggctgtgt taatcattta aacaagggat ttgtgtttaa 1380 tgattgcagg caacgcctag ttgtttggtg ggaggagtgc ctaatgcacc aagactgggt 1440 ggaaccagca aaatgtattc taggaggaac agagtgtaga attgacgtca aacacagaga 1500 tagtgtacta ttgactcaaa ctccagtaat aatttccact aaccacgata tctatgcagt 1560 tgttggtggt aattctgtgt ctcatgttca tgcggctcca ctgaaagaaa gagtgattca 1620 gctaaacttt atgaaacaac tttcacagac ttttggagag atcactgcta cagaaattgc 1680 agctctacta cactggtgtt tcaatgagta cgactgtact ctgacaggct ttaaaacaaa 1740 atggaaatta gataaaattc caaactcatt tcctcttggg gtcctttgtc ctaatcattc 1800 acaggacttt acacttcacg aaaacggata ctgcactgat tgtggtggtt accttgctca 1860 tagtgctgac gattctgtgt acactgattg cacaagccaa actagcagag aagaactcga 1920 cccaggtaag cttaactaca ttaactattc tacattacat cttacttcca aagcttatat 1980 aactttattt acaggtaacc tgggggatac ggacggagag gacaccaagc cagaaacacc 2040 ggaagtgggt gtttgtgcac ccaagaagcg acgcataagt actcctacaa ctcctccaaa 2100 ctcaccagca agttcagtga gtacctttac cttttttgat aattggtacg cacaaccaca 2160 ggacgaagat gagctcaggg aatatgaaag acaaacatcg ctcttacaaa agaaaaggca 2220 gtccagggaa agacgagaaa aaacgccagt ggcagacatc tcgtcacagg agtcacagcc 2280 ggagcccaat ccgacacagt ggggagaaca actcggggtt ttaccgtcag gaacacccga 2340 tcaaccacct atcgtcttgc actgcttcga agacttcagg ccaagtgatg aagacgaagg 2400 agaatacatc ggggaaaaaa gacagtagaa ccaatccata cactgtattc agtcaacaca 2460 gggcttcaaa tcctaacgct ccagggtggt gtgggtttta ctggcattct actcgaattg 2520 ctagagacgg tactaatgct atttttaatg aaatgaaaca acaattccaa gaacttcaaa 2580 ttgataataa aattggctgg gacagcacta gggaactttt atttaatcag aaaaaaacac 2640 tagatcaaaa atacagaaat atgttttggc actttagaaa tgcttcagat tgtgaacgct 2700 gtgcttattg ggatgatgta taccgtagac acttagctaa tgtttcctct cagacagaat 2760 cagaagaaat aactgacgag gaaatgcttt ctgctgttga aactatggaa acagatgcct 2820 ccaattaaaa ggcaacctgg aggctgggta ctacctggtt atagatatct tggtccgttt 2880 aatcctcttg ataacggtaa acctattaat aacgctgatc gcgctgctca agcacatgat 2940 aaatcatact ctgaattaat aaaaagtgga aaaaatccat atttatattt caataaagct 3000 gatgaaaaat tcattaatga tttaaaagac gattggtcta ttggtggaat tattggctca 3060 acttttttta aactaaaacg cgccgtggct cccgctctgg gtaataaaga gcgagcacaa 3120 aaaagacact tctactttgc aaactcaaat aaaggtgcta aaaaatccaa atccagtgaa 3180 cctaaaccaa gaacatcaaa aatgtcagaa aacgaaattc aagaccaaca accatcagac 3240 gctgttgatg gccaacgcgg aggctcttca gcagctggta gtattggtgg ggggaaaggt 3300 tccggtgtgg gcatatccac aggagggtgg gttggtggct cacactttgc tgacaaatac 3360 gtgataacta aaaacaccag acaattcatc acaagcattc aaaatgggca tttatataaa 3420 acagaaataa taaatccttc gaatggcaat gggaaatcac aacgctgcgt aaccacacct 3480 tggacttact ttaactttaa ccaatacagt tgtcattttt caccacaaga ctggcaacgc 3540 ttaacaaatg aatacaaaag attcagacct aaagcaatgc tagtaaaaat ctataattta 3600 caaataaaac aaattctttc taatggtgct gacactacgt acaacaacga cctaacagct 3660 ggtgtccaca ttttttgtga cggcgaacac gcatacccaa acgcatctca tccatgggat 3720 gaagatgtaa tgccggacct tccatataaa acatggaagc tttttcaata tggatacatt 3780 cctattttaa atgagcttgc cgatcttgat ggaactactg ctggaggaac tgcaacagaa 3840 aaggcaattt tatatcaaat gccttttttc atgctggaaa acagtgatca tgaagttctg 3900 agaactggtg agagctcaga attcacattt aactttgatt gtgaatgggt taacaatgaa 3960 agagcataca ttcctccagg attaatgttt aacccaaaag ttccaacaag acgagttcaa 4020 tacataagac aaaacggaca aacaactgcc agcactagtc gaatagaacc atactcaaaa 4080 cctacaagct ggatgacagg accaggtttc ctaggtgggc aaagagtagg accagcaaca 4140 tcagacactg ctccctacat ggtttgtacc aaacctgatg gagtatacat aaacactgga 4200 gctgctggat atggatctgg atttgatcct ccaagcggca gccttgctcc tacggaccta 4260 gaatacaaac ttcaatggta ccaaacacca gaagacacag gaaacaacgg aaacataatt 4320 gcaaatccat ctttatctat gcttagagac caactcctct acagaggaaa ccaaacaaca 4380 tacaacctaa atgcagacgt ttggatgttt cctaaccaaa tttgggacag atacccaata 4440 accagagaac atccaatttg gtgcaaaaaa ccaagagcag acaaaaacac aatcatagat 4500 ccatttgatg gatctattgc tatggatcac ccaccaggaa ccatttttat caaaatggca 4560 aaaattccag ttccatcttc aacaaatgca gactcgtact taaacattta ctgcacaggg 4620 caagttagct gcgaaattgt atgggaagta gaaagatacg taacaaagaa ctggcgtcct 4680 gaaagaagac acactgcact tggaatgagc attggaggaa cagaaaatgt aagcccaact 4740 taccatgttg actctgctgg aacatacatc cagccaacaa ctttcgatca atgtatgcct 4800 gtaaaaacaa acatcaataa agtgttgtaa tcactttagc ctcttttttg cttacgcttg 4860 taagttccct tccaatggac aagtggatag tgaagggtga ctgtaatccc gagctcatga 4920 gttcgaggct acagtccgat agca 4944 //