ID EU918736; SV 1; linear; genomic DNA; STD; VRL; 5242 BP. XX AC EU918736; XX DT 16-APR-2009 (Rel. 100, Created) DT 22-APR-2009 (Rel. 100, Last updated, Version 2) XX DE Human bocavirus 3 strain W471, complete genome. XX KW . XX OS Human bocavirus 3 OC Viruses; Parvoviridae; Parvovirinae; Bocaparvovirus; OC Primate bocaparvovirus 1. XX RN [1] RC Publication Status: Online-Only RP 1-5242 RA Arthur J.L., Higgins G.D., Davidson G.P., Givney R.C., Ratcliff R.M.; RT "A Novel Bocavirus Associated with Acute Gastroenteritis in Australian RT Children"; RL PLoS Pathog. 5(4):E1000391-E1000391(2009). XX RN [2] RP 1-5242 RA Ratcliff R.M., Arthur J.L.; RT ; RL Submitted (23-JUL-2008) to the INSDC. RL Infectious Diseases Laboratories, Institute of Medical and Veterinary RL Science, Frome Road, Adelaide, South Australia 5000, Australia XX DR MD5; c092e905c11a3d06006287ab1ae0d270. DR EuropePMC; PMC2663820; 19381259. DR EuropePMC; PMC2957997; 20113572. DR EuropePMC; PMC3020864; 20844210. DR EuropePMC; PMC3078135; 21525999. DR EuropePMC; PMC3125170; 21738642. DR EuropePMC; PMC3322079; 21888826. DR EuropePMC; PMC3382199; 22761854. DR EuropePMC; PMC3487788; 23133667. DR EuropePMC; PMC3837656; 24209884. DR EuropePMC; PMC3838256; 24109231. DR EuropePMC; PMC5375684; 28122984. DR EuropePMC; PMC5780762; 22531100. DR EuropePMC; PMC6360332; 30766894. XX FH Key Location/Qualifiers FH FT source 1..5242 FT /organism="Human bocavirus 3" FT /host="Homo sapiens" FT /strain="W471" FT /mol_type="genomic DNA" FT /country="Australia" FT /collection_date="Aug-2001" FT /note="acronym: HBoV3" FT /db_xref="taxon:638313" FT CDS 224..2209 FT /codon_start=1 FT /product="NS1" FT /note="NS1 protein; non-structural protein" FT /db_xref="GOA:C5IY48" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/Swiss-Prot:C5IY48" FT /protein_id="ACH81927.1" FT /translation="MAFNPPVIRAFSSPAFTYVFKFPYPSWKEKEWLLHALLAHGTEQA FT MIQLRNCVPHPDEDIIRDDLLLSLEDRHFGAILCKAVYMATTTFMSQKQRNMFPRCDII FT VQSELGETNLHCHIIVGGEGLSKRNAKTSCPQLYGLILGELIQRCKTLLATRPFEPEEA FT EIYHALKRAEREAWGGVTSGNLQILQYRDRRGDLHAQQVDALRFFKNYLLPKNRCITSY FT SRPDVCTSPENWFVLAEKTYCHTLVNGLPLPEHYRKHYHATLDNEVLPGPQTMAFGGRG FT PWEHLPEVGDQRLAASSVSTTYKPNKKEKLMLNLLDKCSELNLLVYEDLVANCPELLLM FT LEGQPGGARLIEQVLGMHHINVCSNFTALSYLFHLYPGTTLSSDNKALQLLLIQGYNPL FT MVGHALCCVLNKQFGKQNTVCFYGPASTGKTNMAKAIVQGIRLYGCVNHLNKGFVFNDC FT RQRLVVWWEECLMHQDWVEPAKCILGGTECRIDVKHRDSVLLTQTPVIISTNHDIYAVV FT GGNSVSHVHAAPLKERVIQLNFMKQLPQTFGEITPEEIAALLQWCFNEYECTLTGFKTK FT WSLDKIPNSFPLGVLCPTHSQDFILHENGYCTDCGGYLAHSADDSVYTDRASDTSKEAI FT DAGKFTFSKHFIYILYTTLKHMFNYR" FT CDS 2380..3036 FT /codon_start=1 FT /product="NP1" FT /note="NP1 protein; non-structural protein" FT /db_xref="GOA:C1IWT1" FT /db_xref="InterPro:IPR021075" FT /db_xref="UniProtKB/Swiss-Prot:C1IWT1" FT /protein_id="ACH81928.1" FT /translation="MSSGNTKDKHRAYKRKGSPGRDERKRPWQPHHRSRSRSPIRRSGE FT TSSGSYRQEHQISHLSSCTASKISDPVTKTKENTSGKRDSRTNPYTVFSQHKASHPDAP FT GWCGFYWHSTRIARNGTNAIFNEMKQQFQQLQLDNKIGWDSARELLFSQKKSLDQQYRN FT MFWHFRNASDCERCNYWDNVYRMHLAHVSSQTESEEITDEEMLSAAESMETDASN" FT CDS 3023..5029 FT /codon_start=1 FT /product="VP1" FT /note="VP1 protein; minor capsid protein" FT /db_xref="GOA:C1IWT2" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="PDB:5US7" FT /db_xref="UniProtKB/Swiss-Prot:C1IWT2" FT /protein_id="ACH81929.1" FT /translation="MPPIKRQPGGWVLPGYKYLGPFNPLDNGEPVNKADRAAQSHDKSY FT SELIKSGKNPYLYFNKADEKFIDDLKNDWSLGGIIGSSFFKLKRAVAPALGNKERAQKR FT HFYFANSNKGAKKSKNNEPKPSTSKMSENEIQDQQPSEPNDGQRGGGGGATGSVGGGKG FT SGVGISTGGWVGGSYFTDSYVITKNTRQFLVKIQNNHQYKTESIIPSNGGGKSQRCVST FT PWSYFNFNQYSSHFSPQDWQRLTNEYKRFRPKGMHVKIYNLQIKQILSNGADVTYNNDL FT TAGVHIFCDGEHAYPNATHPWDEDVMPELPYQTWYLFQYGYIPTIHELAEMEDSNAVEK FT AIALQIPFFMLENSDHEVLRTGESAEFNFNFDCEWINNERAFIPPGLMFNPLVPTRRAQ FT YIRRNGNTQASTSRVQPYAKPTSWMTGPGLLSAQRVGPAASDTAAWMVGVDPEGANINS FT GRAGVSSGFDPPAGSLRPTDLEYKVQWYQTPAGTNNDGNIISNPPLSMLRDQTLYRGNQ FT TTYNLCSDVWMFPNQIWDRYPVTRENPIWCKQPRSDKHTTIDPFDGSIAMDHPPGTIFI FT KMAKIPVPSNNNADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGLGIGGADE FT INPTYHVDKNGAYIQPTTWDMCFPVKTNINKVL" FT CDS 3410..5029 FT /codon_start=1 FT /product="VP2" FT /note="VP2 protein; major capsid protein" FT /db_xref="GOA:C1IWT2" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="PDB:5US7" FT /db_xref="UniProtKB/Swiss-Prot:C1IWT2" FT /protein_id="ACH81930.1" FT /translation="MSENEIQDQQPSEPNDGQRGGGGGATGSVGGGKGSGVGISTGGWV FT GGSYFTDSYVITKNTRQFLVKIQNNHQYKTESIIPSNGGGKSQRCVSTPWSYFNFNQYS FT SHFSPQDWQRLTNEYKRFRPKGMHVKIYNLQIKQILSNGADVTYNNDLTAGVHIFCDGE FT HAYPNATHPWDEDVMPELPYQTWYLFQYGYIPTIHELAEMEDSNAVEKAIALQIPFFML FT ENSDHEVLRTGESAEFNFNFDCEWINNERAFIPPGLMFNPLVPTRRAQYIRRNGNTQAS FT TSRVQPYAKPTSWMTGPGLLSAQRVGPAASDTAAWMVGVDPEGANINSGRAGVSSGFDP FT PAGSLRPTDLEYKVQWYQTPAGTNNDGNIISNPPLSMLRDQTLYRGNQTTYNLCSDVWM FT FPNQIWDRYPVTRENPIWCKQPRSDKHTTIDPFDGSIAMDHPPGTIFIKMAKIPVPSNN FT NADSYLNIYCTGQVSCEIVWEVERYATKNWRPERRHTALGLGIGGADEINPTYHVDKNG FT AYIQPTTWDMCFPVKTNINKVL" XX SQ Sequence 5242 BP; 1701 A; 1129 C; 1081 G; 1331 T; 0 other; tcagtgctac aacgtcacat ataaaataat aaatattcac aaggaggagt ggctacgtat 60 ggggtgatca taaacacgcc caggaagtga cgtatgtcag ccaatcagca tcgagcatat 120 aacctatata aaccgatgca cttccgcatt tcgtcagact gcatccggtc tccggcgagt 180 gaacatctct gggaagagct ccacacttgt ggtgagtcaa attatggctt tcaatccacc 240 tgttattaga gcattctctt cacctgcttt tacttatgtc ttcaaatttc catatccatc 300 atggaaagaa aaagaatggc ttcttcatgc acttctggct catggtaccg agcaagccat 360 gatccagctg agaaactgtg ttcctcatcc ggatgaagat ataatccgtg atgacttact 420 actttctcta gaagatcgcc attttggggc aattctctgc aaggctgtct atatggctac 480 tactactttt atgtcacaga aacaaagaaa tatgtttcct cgctgtgaca taatagtcca 540 atctgagctt ggggagacaa acctacactg ccatattata gttgggggag aaggcttaag 600 caagagaaat gcaaaaacat catgtcctca actatatgga ctgatactag gggaattaat 660 ccaacgctgc aaaactcttc tggctacgcg tccttttgaa ccggaagagg cagaaattta 720 tcatgcttta aaacgagctg agcgagaagc ttggggtgga gttactagcg gcaacctaca 780 aattctccaa tacagagatc gcagaggaga ccttcacgca caacaagtgg atgctcttcg 840 cttcttcaaa aactacctat tgcctaaaaa tagatgcatt acatcttaca gcagacctga 900 tgtctgtact tctccagaaa actggtttgt tttagctgaa aaaacttact gtcacactct 960 tgttaacggg ctgccgcttc cagaacatta cagaaaacac taccacgcaa ccctagataa 1020 cgaagttcta ccagggcctc agacaatggc ctttggggga cgtggtccgt gggaacatct 1080 tcctgaggta ggagatcaac gtttagctgc ttcttctgtt agtacaacat ataaaccaaa 1140 caaaaaagag aaacttatgc ttaacttact agataaatgc agcgaattaa atcttttagt 1200 ttatgaagac ttagtagcta actgtcctga acttttgctt atgcttgaag gtcaaccagg 1260 tggggcacgc ttaatagaac aagtcctagg catgcaccat attaatgttt gctctaactt 1320 tactgctctt agttatctct ttcaccttta ccctggcaca accttatctt cagataacaa 1380 ggctttgcag ctgttgttga tacaaggtta caacccatta atggttggtc acgccttgtg 1440 ctgtgtactc aacaagcaat ttggcaaaca aaacactgtt tgcttttatg gaccagcttc 1500 tactggtaaa acaaacatgg caaaggccat agtccaaggc attagactat atggctgtgt 1560 taatcattta aacaaagggt ttgtctttaa tgattgcaga caacgcctag ttgtttggtg 1620 ggaggagtgc ttaatgcacc aggattgggt ggaaccagca aagtgtatct tgggtggaac 1680 tgagtgtaga attgacgtca aacacagaga tagtgtatta ttgacacaaa ctccagtaat 1740 tatttccact aaccacgata tctacgcggt tgttggtggt aattctgttt ctcatgttca 1800 tgcggctcca ttaaaagaaa gagtgattca gctaaatttt atgaaacaac ttcctcaaac 1860 atttggagag atcactccag aagaaattgc agctctactg caatggtgtt tcaatgagta 1920 cgaatgtact ctgacaggct ttaaaacaaa atggagccta gataaaattc caaactcatt 1980 tcctcttggg gtcctttgtc ctactcattc acaggacttc atactccacg aaaacggata 2040 ctgcactgat tgtggtggtt accttgctca tagcgctgac gattctgtgt acactgatcg 2100 tgcaagcgac actagcaaag aagccatcga cgcaggtaag tttacgttct ccaagcactt 2160 tatatatatc ctatacacaa cactaaaaca tatgtttaat tacaggtgac ttgggggata 2220 cggacggaga ggactccgag tcagaagcat cggaagtggg tgttcgtcca tccaagaagc 2280 gacgcataac tattcctgca actccaccaa attctcctgg cagctctgtg agtacttctg 2340 ccttctttga taattggtgc gcacaaccgc gagacgaaga tgagctcagg gaatacgaaa 2400 gacaagcatc gcgcctacaa aagaaaaggg agtccaggga gagacgagag gaaacgccca 2460 tggcaacctc atcacaggag tcggagtcgg agcccaatcc gacgcagtgg ggagacaagc 2520 tcggggtcat accgtcagga acaccagatc agccacctat cgtcttgcac tgcttcgaag 2580 atctcagacc cagtgacgaa gacgaaggag aatacatcgg gaaagagaga ctctagaact 2640 aatccataca ctgtattcag ccagcataaa gcctcacatc ctgatgctcc aggatggtgt 2700 gggttctatt ggcattctac tagaattgct agaaatggta ctaatgcaat ctttaatgaa 2760 atgaaacagc agttccaaca actgcagcta gacaacaaaa ttggctggga tagtgctaga 2820 gaattattgt ttagtcagaa aaaatcacta gatcaacaat acagaaatat gttctggcac 2880 tttagaaatg cttctgattg tgaacgttgt aattactggg acaatgtata ccgtatgcac 2940 ttagctcatg tttcctctca gacagaatca gaagaaataa ctgacgagga aatgctttct 3000 gctgctgaaa gtatggaaac agatgcctcc aattaaaagg caacctggag ggtgggtgct 3060 tcctggttac aaataccttg gtccatttaa tcctcttgat aacggtgaac cagttaataa 3120 agctgatcgt gctgctcaat ctcatgataa atcatattct gaattaataa aaagtggaaa 3180 aaatccttac ttatatttca ataaagctga tgaaaaattc attgacgatt tgaaaaacga 3240 ctggtctctt ggtggcatta ttggctcaag tttctttaaa cttaagcgcg ccgtggctcc 3300 tgctctaggg aataaagagc gagctcaaaa aagacacttt tattttgcaa actcaaataa 3360 aggtgctaaa aaatcaaaaa acaacgaacc taaaccaagc acctcaaaaa tgtctgaaaa 3420 tgaaattcaa gaccaacagc catcagaacc taatgatggc caacgaggag ggggaggagg 3480 tgcgaccggc agtgtgggag gggggaaagg ttctggtgtg ggtatatcca caggtggatg 3540 ggtaggaggc agctacttta ctgactccta tgtaataaca aaaaacacca gacaatttct 3600 ggttaaaatc caaaacaacc atcaatataa aactgaaagt ataattcctt ccaatggagg 3660 aggaaaatca caaagatgtg tcagcacacc atggtcatac tttaacttta atcaatacag 3720 cagtcatttc tcaccacagg actggcagcg cctaacaaat gaatacaaaa gattcagacc 3780 taaaggtatg catgttaaaa tctacaattt acaaataaaa cagattttat caaatggtgc 3840 tgatgttaca tacaacaacg acctaacagc aggagtacac atcttttgtg atggcgaaca 3900 tgcatatcca aacgctacac atccatggga cgaagatgta atgccagaac ttccttacca 3960 aacatggtat ctgtttcaat atggatacat acctaccatt catgaacttg cagaaatgga 4020 agactccaat gcagtagaaa aagcaattgc tttacagata ccattcttca tgcttgaaaa 4080 cagcgaccat gaagttctaa gaactggaga aagtgcagaa tttaacttca actttgactg 4140 tgaatggatt aacaatgaaa gagcattcat tcctccagga ctgatgttta atccattggt 4200 accaacaaga agagctcaat acatacgaag aaatggaaac actcaagcaa gtacatcacg 4260 agttcaaccc tatgctaaac ctacaagctg gatgactggg ccaggtttac tcagtgcaca 4320 acgagtaggt ccagctgctt ctgacacagc tgcatggatg gttggtgtag atccagaagg 4380 cgcaaacatc aactcaggaa gagcaggagt tagcagtgga tttgatcctc cagctggatc 4440 actcagacct acagatctag aatacaaagt acaatggtac caaactccag ctggaacaaa 4500 caacgatgga aacatcattt caaatccacc tttatcaatg cttagagatc aaactctcta 4560 cagaggaaac caaacaacct acaacttatg ctcagatgta tggatgtttc caaatcaaat 4620 ttgggacaga tacccagtaa caagagaaaa tcctatttgg tgcaaacaac caagatcaga 4680 caaacacaca acaattgatc cttttgacgg atcaatagcc atggatcatc caccaggcac 4740 aattttcatc aaaatggcaa aaattccagt tccttcaaac aacaacgcag actcatactt 4800 aaacatctac tgcactggac aagtcagctg cgaaattgtc tgggaagtcg aaagatatgc 4860 aacaaagaac tggagaccag aaagaagaca cacagcactc ggccttggaa ttggaggggc 4920 agatgaaatc aacccaacat accatgttga caaaaacgga gcatacattc aacctacaac 4980 atgggacatg tgctttccag ttaaaacaaa catcaataaa gtgttgtaat ctcttaagcc 5040 tctttattgc ttacgcttgt aagttcctct ccaatggaca agtggaaaga aaagggtgac 5100 tgtaatcccg agctcatgag ttcgaggcta cagtccgatg gcagtggcgt tgccgtctcg 5160 aacctagccg ttacaccctt gtgcattgtg ggaggagctg ttttgcttac gcaaccgcga 5220 aactctatat cttttaatgt gt 5242 //