ID DQ335246; SV 2; linear; genomic DNA; STD; VRL; 4636 BP. XX AC DQ335246; XX DT 23-JAN-2006 (Rel. 86, Created) DT 21-NOV-2006 (Rel. 89, Last updated, Version 3) XX DE Adeno-associated virus-Go.1, complete genome. XX KW . XX OS Adeno-associated virus-Go.1 OC Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Dependovirus. XX RN [1] RP 1-4636 RX DOI; 10.1016/j.virol.2006.07.024. RX PUBMED; 16926042. RA Qiu J., Cheng F., Pintel D.; RT "Molecular characterization of caprine adeno-associated virus (AAV-Go.1) RT reveals striking similarity to human AAV5"; RL Virology 356(1-2):208-216(2006). XX RN [2] RP 1-4636 RA Qiu J., Pintel D.; RT ; RL Submitted (16-DEC-2005) to the INSDC. RL Dept. of Molecular Microbiology and Immunology, University of RL Missouri-Columbia, 1201 Rollins Street, Columbia, MO 65211, USA XX RN [3] RC Sequence update by submitter RP 1-4636 RA Qiu J., Pintel D.; RT ; RL Submitted (09-JUN-2006) to the INSDC. RL Dept. of Molecular Microbiology and Immunology, University of RL Missouri-Columbia, 1201 Rollins Street, Columbia, MO 65211, USA XX CC On Jun 9, 2006 this sequence version replaced gi:85070096. XX FH Key Location/Qualifiers FH FT source 1..4636 FT /organism="Adeno-associated virus-Go.1" FT /mol_type="genomic DNA" FT /db_xref="taxon:296148" FT CDS 347..2179 FT /codon_start=1 FT /product="Rep78" FT /note="nonstructural protein" FT /db_xref="GOA:Q2LD61" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="UniProtKB/TrEMBL:Q2LD61" FT /protein_id="ABC69725.1" FT /translation="MATFYEVIVRVPFDVEEHLPGISDSFVDWVTGQIWELPPESDLNL FT TLIEQPQLTVADRIRRVFLYEWNKFSKQESKFFVQFEKGSEYFHLHTLVETSGISSMVL FT GRYVSQIRAQLVKVVFQGIEPQINDWVAITKVKKGGANKVVDSGYIPAYLLPKVQPELQ FT WAWTNLDEYKLAALNLEERKRLVAQFLAESSQRSQEAASQREFSADPVIKSKTSQKYMA FT LVNWLVEHGITSEKQWIQENQESYLSFNSTGNSRSQIKAALDNATKIMSLTKSAVDYLV FT GSSVPEDISKNRIWQIFEMNGYDPAYAGSILYGWCQRSFNKRNTVWLYGPATTGKTNIA FT EAIAHTVPFYGCVNWTNENFPFNDCVDKMLIWWEEGKMTNKVVESAKAILGGSKVRVDQ FT KCKSSVQIDSTPVIVTSNTNMCVVVDGNSTTFEHQQPLEDRMFKFELTKRLPPDFGKIT FT KQEVKDFFAWAKVNQVPVTHEFKVPRELAGTKGAEKSLKRPLGDVTNTSYKSPEKRARL FT SFVPETPRSSDVTVDPAPLRPLNWNSRYDCKCDHHAQFDNISDKCDECEYLNRGKNGCI FT CHNVTHCQICHGIPPWEKENLSDFGDFDDANKEQ" FT CDS 2195..4375 FT /codon_start=1 FT /product="VP1" FT /note="capsid protein" FT /db_xref="GOA:Q5XXZ6" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="UniProtKB/TrEMBL:Q5XXZ6" FT /protein_id="ABC69726.1" FT /translation="MSFVDHPPDWLEEVGEGLREFLGLEAGPPKPKPNQQHQDQARGLV FT LPGYNYLGPGNGLDRGEPVNRADEVAREHDISYNEQLEAGDNPYLKYNHADAEFQEKLA FT DDTSFGGNLGKAVFQAKKRVLEPFGLVEEGAKTAPTGKRIDDHFPKRKKARTEEDSKPS FT TSSDAEAGPSGSQQLQIPAQPASSLGADTMSAGGGGPLGDNNQGADGVGNASGDWHCDS FT TWMGDRVVTKSTRTWVLPSYNNHQYREIKSGSVDGSNANAYFGYSTPWGYFDFNRFHSH FT WSPRDWQRLINNYWGFRPRSLRVKIFNIQVKEVTVQDSTTTIANNLTSTVQVFTDDDYQ FT LPYVVGNGTEGCLPAFPPQVFTLPQYGYATLNRDNGDNPTERSSFFCLEYFPSKMLRTG FT NNFEFTYSFEEVPFHCSFAPSQNLFKLANPLVDQYLYRFVSTSATGAIQFQKNLAGRYA FT NTYKNWFPGPMGRTQGWNTSSGSSTNRVSVNNFSVSNRMNLEGASYQVNPQPNGMTNTL FT QGSNRYALENTMIFNAQNATPGTTSVYPEDNLLLTSESETQPVNRVAYNTGGQMATNAQ FT NATTAPTVGTYNLQEVLPGSVWMERDVYLQGPIWAKIPETGAHFHPSPAMGGFGLKHPP FT PMMLIKNTPVPGNITSFSDVPVSSFITQYSTGQVTVEMEWELKKENSKRWNPEIQYTNN FT YNDPQFVDFAPDGSGEYRTTRAIGTRYLTRPL" XX SQ Sequence 4636 BP; 1164 A; 1337 C; 1237 G; 898 T; 0 other; ctctcccccc tgtcgcgttc gctcgctcgc tggctcgttt gggggggtgg cagctcaaag 60 agctgccaga cgacggccct ctggccgtcg cccccccaaa cgagccagcg agcgagcgaa 120 cgcgacaggg gggagagtgc cacactctca agcaaggagg ttttgtaagt ggtgatgtca 180 tatagttgtc acgcgatagt taatgattaa cagtcaggtg atgtgtgtta tccaatagga 240 tgaaagcgcg cgcatgagtt ctcgcgagac ttccggggta taaaggggtg agtgaacgag 300 cccgccgcca ttctctgctc tgaactgcta gaggaccctc gctgccatgg ctaccttcta 360 cgaagtcatt gttcgcgtcc catttgacgt ggaggaacat ctgcctggaa tttctgacag 420 ctttgtggac tgggtaactg gtcaaatttg ggagctgcct cccgagtcag atttgaattt 480 gactctgatt gagcagcctc agctgacggt tgctgacaga attcgccgcg tgttcctgta 540 cgagtggaac aaattttcca agcaggaatc caaattcttt gtgcagtttg aaaagggatc 600 tgaatatttt catctgcaca cgcttgtgga gacctccggc atctcttcca tggtcctagg 660 ccgctacgtg agtcagattc gcgcccagct ggtgaaagtg gtcttccagg gaatcgagcc 720 acagatcaac gactgggtcg ccatcaccaa ggtaaagaag ggcggagcca ataaggtggt 780 ggattctggg tatattcccg cctacctgct gccgaaggtc caaccggagc ttcagtgggc 840 gtggacaaac ctggacgagt ataaattggc cgccctgaac ctggaggagc gcaaacggct 900 cgtcgcgcag tttctggcag aatcctcgca gcgctcgcag gaggcggctt cgcagcgtga 960 gttctcggct gacccggtca tcaaaagcaa gacttcccag aaatacatgg cgctcgtcaa 1020 ctggctcgtg gagcacggca tcacttccga gaagcagtgg atccaggaga atcaggagag 1080 ctacctctcc ttcaactcca cgggcaactc tcggagccaa atcaaggccg cgctcgacaa 1140 cgcgaccaaa atcatgagtc tgacaaaaag cgcggtggac tacctcgtgg ggagctccgt 1200 tcccgaggac atttcaaaaa acagaatctg gcaaattttt gagatgaacg gctacgaccc 1260 ggcctacgcg ggatccatcc tctacggctg gtgtcagcgc tccttcaaca agaggaacac 1320 cgtctggctc tacggacccg ccacgaccgg caagaccaac atcgcggagg ccatcgccca 1380 cactgtgccc ttttacggct gcgtgaactg gaccaatgaa aactttccct ttaatgactg 1440 tgtggacaaa atgctcattt ggtgggagga gggaaagatg accaacaagg tggttgaatc 1500 cgccaaggcc atcctggggg gctccaaggt gcgggtcgat cagaaatgta aatcctctgt 1560 tcaaattgat tctacccccg tcattgtaac ttccaataca aacatgtgtg tggtggtgga 1620 tgggaattcc acgacctttg aacaccagca gccgctggag gaccgcatgt tcaaatttga 1680 actgactaag cggctcccgc cagattttgg caagattact aagcaggaag tcaaagactt 1740 ttttgcttgg gcaaaggtca atcaggtgcc ggtgactcac gagtttaaag ttcccaggga 1800 attggcggga actaaagggg cggagaaatc tctaaaacgc ccactgggtg acgtcaccaa 1860 tactagctat aaaagtccag agaagcgggc ccggctctca tttgttcccg agacgcctcg 1920 cagttcagac gtgactgtcg atcccgctcc tctgcgaccg ctcaattgga attcaaggta 1980 tgattgcaaa tgtgaccatc atgctcaatt tgacaacatt tctgacaaat gtgatgaatg 2040 tgaatatttg aatcggggca aaaatggatg tatctgtcac aatgtaactc actgtcaaat 2100 ttgtcacggg attcccccct gggagaagga aaacttgtca gattttgggg attttgacga 2160 tgccaataaa gaacagtaaa taaagcgagt agtcatgtct tttgttgatc accctccaga 2220 ttggttggaa gaagttggtg aaggtcttcg cgagtttttg ggccttgaag cgggcccacc 2280 gaaaccgaaa cccaatcagc agcatcaaga tcaagcccgt ggtcttgtgc tgcctggtta 2340 taactatctc ggacccggaa acggtctcga tcgaggagag cctgtcaaca gggcagacga 2400 ggtcgcgcga gagcacgaca tctcgtacaa cgagcagctt gaggcgggag acaaccccta 2460 cctcaagtac aaccacgcgg acgccgagtt tcaggagaag ctcgccgacg acacatcctt 2520 cgggggaaac ctcggaaagg cagtctttca ggccaagaaa agggttctcg aaccttttgg 2580 cctggttgaa gagggtgcta agacggcccc taccggaaag cggatagacg accactttcc 2640 aaaaagaaag aaggctcgga ccgaagagga ctccaagcct tccacctcgt cagacgccga 2700 agctggaccc agcggatccc agcagctgca aatcccagca caaccagcct caagtttggg 2760 agctgataca atgtctgcgg gaggtggcgg cccattgggc gacaataacc aaggtgccga 2820 tggagtgggc aatgcctcgg gagattggca ttgcgattcc acgtggatgg gggacagagt 2880 cgtcaccaag tccacccgca cctgggtgct gcccagctac aacaaccacc agtaccgaga 2940 gatcaaaagc ggctccgtcg acggaagcaa cgccaacgcc tactttggat acagcacccc 3000 ctgggggtac tttgacttta accgcttcca cagccactgg agcccccgag actggcaaag 3060 actcatcaac aactattggg gcttcagacc ccggtctctc agagtcaaaa tcttcaacat 3120 ccaagtcaaa gaggtcacgg tgcaggactc caccaccacc atcgccaaca acctcacctc 3180 caccgtccaa gtgtttacgg acgacgacta ccaactcccg tacgtcgtcg gcaacgggac 3240 cgagggatgc ctgccggcct tccccccgca ggtctttacg ctgccgcagt acggctacgc 3300 gacgctgaac cgagacaacg gagacaaccc gacagagcgg agcagcttct tttgcctaga 3360 gtactttccc agcaagatgc tgaggacggg caacaacttt gagtttacct acagctttga 3420 agaggtgccc ttccactgca gcttcgcccc gagccagaac ctctttaagc tggccaaccc 3480 gctggtggac cagtacctgt accgcttcgt gagcacctcg gccacgggcg ccatccagtt 3540 ccaaaagaac ctggcgggca gatacgccaa cacctacaaa aactggttcc cggggcccat 3600 gggccgaacc cagggctgga acacgagctc tggcagcagc accaacagag tcagcgtcaa 3660 caacttttcc gtctcaaacc ggatgaacct ggagggggcc agctaccaag tgaaccccca 3720 gcccaacggg atgacaaaca cgctccaagg cagcaaccgc tacgcgctgg aaaacaccat 3780 gatcttcaac gctcaaaacg ccacgccggg aactacctcg gtgtacccag aggacaatct 3840 actgctgacc agcgagagcg agactcagcc cgtcaaccgg gtggcttaca acacgggcgg 3900 tcagatggcc accaacgccc agaacgccac cacggctccc acggtcggga cctacaacct 3960 ccaggaagtg cttcctggca gcgtatggat ggagagggac gtgtacctcc aaggacccat 4020 ctgggccaag atcccagaga cgggggcgca ctttcacccc tctccggcca tgggcggatt 4080 cggactcaaa cacccgccgc ccatgatgct catcaaaaac acgccggtgc ccggcaacat 4140 caccagcttc tcggacgtgc ccgtcagcag cttcatcacc cagtacagca ccgggcaggt 4200 caccgtggag atggaatggg agctcaaaaa ggaaaactcc aagaggtgga acccagagat 4260 ccagtacacc aacaactaca acgaccccca gtttgtggac tttgctccag acggctccgg 4320 cgaatacaga accaccagag ccatcggaac ccgatacctc acccgacccc tttaacccat 4380 tcatgtcgca taccctcaat aaaccgtgta ttcgtgtcag tgaaatactg cctcttgtgg 4440 tcattcaatg aacatcagct tacaacatct acaaaacccc cttgcttgag agtgtggcac 4500 tctcccccct gtcgcgttcg ctcgctcgct ggctcgtttg ggggggtggc agctcaaaga 4560 gctgccagac gacggccctc tggccgtcgc ccccccaaac gagccagcga gcgagcgaac 4620 gcgacagggg ggagag 4636 //