ID AF513851; SV 1; linear; genomic DNA; STD; VRL; 4721 BP. XX AC AF513851; XX DT 04-SEP-2002 (Rel. 73, Created) DT 15-APR-2005 (Rel. 83, Last updated, Version 3) XX DE Adeno-associated virus 7 nonstructural protein and capsid protein genes, DE complete cds. XX KW . XX OS Adeno-associated virus - 7 OC Viruses; Parvoviridae; Parvovirinae; Dependoparvovirus. XX RN [1] RP 1-4721 RX DOI; 10.1073/pnas.182412299. RX PUBMED; 12192090. RA Gao G.P., Alvira M.R., Wang L., Calcedo R., Johnston J., Wilson J.M.; RT "Novel adeno-associated viruses from rhesus monkeys as vectors for human RT gene therapy"; RL Proc. Natl. Acad. Sci. U.S.A. 99(18):11854-11859(2002). XX RN [2] RP 1-4721 RA Alvira M.R.; RT ; RL Submitted (20-MAY-2002) to the INSDC. RL Institute for Human Gene Therapy, University of Pennsylvania, M6.40 Maloney RL Bldg, 36th & Spruce Sts, Philadelphia, PA 19104, USA XX DR MD5; 0ed68532f27563aedc1d10b2119ff03f. DR EuropePMC; PMC129358; 12192090. DR EuropePMC; PMC2293646; 18035387. DR EuropePMC; PMC2663820; 19381259. DR EuropePMC; PMC6279885; 30534580. XX FH Key Location/Qualifiers FH FT source 1..4721 FT /organism="Adeno-associated virus - 7" FT /host="rhesus monkey" FT /mol_type="genomic DNA" FT /proviral FT /db_xref="taxon:202812" FT CDS 334..2205 FT /codon_start=1 FT /product="nonstructural protein" FT /note="similar to AAV2 Rep 78 protein" FT /db_xref="GOA:Q8JQG1" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/TrEMBL:Q8JQG1" FT /protein_id="AAN03854.1" FT /translation="MPGFYEIVIKVPSDLDEHLPGISDSFVNWVAEKEWELPPDSDMDL FT NLIEQAPLTVAEKLQRDFLVQWRRVSKAPEALFFVQFEKGESYFHLHVLVETTGVKSMV FT LGRFLSQIREKLVQTIYRGVEPTLPNWFAVTKTRNGAGGGNKVVDECYIPNYLLPKTQP FT ELQWAWTNMEEYISACLNLAERKRLVAQHLTHVSQTQEQNKENLNPNSDAPVIRSKTSA FT RYMELVGWLVDRGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMALTKSAP FT DYLVGPSLPADIKTNRIYRILELNGYDPAYAGSVFLGWAQKKFGKRNTIWLFGPATTGK FT TNIAEAIAHAVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKAILGGSKV FT RVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFELTRRLEHDF FT GKVTKQEVKEFFRWASDHVTEVAHEFYVRKGGASKRPAPDDADISEPKRACPSVADPST FT SDAEGAPVDFADRYQNKCSRHAGMIQMLFPCKTCERMNQNFNICFTHGVRDCLECFPGV FT SESQPVVRKKTYRKLCAIHHLLGRAPEIACSACDLVNVDLDDCVSEQ" FT CDS 2222..4435 FT /codon_start=1 FT /product="capsid protein" FT /note="similar to AAV2 VP1 protein" FT /db_xref="GOA:Q8JQG0" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="UniProtKB/TrEMBL:Q8JQG0" FT /protein_id="AAN03855.1" FT /translation="MAADGYLPDWLEDNLSEGIREWWDLKPGAPKPKANQQKQDNGRGL FT VLPGYKYLGPFNGLDKGEPVNAADAAALEHDKAYDQQLKAGDNPYLRYNHADAEFQERL FT QEDTSFGGNLGRAVFQAKKRVLEPLGLVEEGAKTAPAKKRPVEPSPQRSPDSSTGIGKK FT GQQPARKRLNFGQTGDSESVPDPQPLGEPPAAPSSVGSGTVAAGGGAPMADNNEGADGV FT GNASGNWHCDSTWLGDRVITTSTRTWALPTYNNHLYKQISSETAGSTNDNTYFGYSTPW FT GYFDFNRFHCHFSPRDWQRLINNNWGFRPKKLRFKLFNIQVKEVTTNDGVTTIANNLTS FT TIQVFSDSEYQLPYVLGSAHQGCLPPFPADVFMIPQYGYLTLNNGSQSVGRSSFYCLEY FT FPSQMLRTGNNFEFSYSFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLARTQSNPGGTA FT GNRELQFYQGGPSTMAEQAKNWLPGPCFRQQRVSKTLDQNNNSNFAWTGATKYHLNGRN FT SLVNPGVAMATHKDDEDRFFPSSGVLIFGKTGATNKTTLENVLMTNEEEIRPTNPVATE FT EYGIVSSNLQAANTAAQTQVVNNQGALPGMVWQNRDVYLQGPIWAKIPHTDGNFHPSPL FT MGGFGLKHPPPQILIKNTPVPANPPEVFTPAKFASFITQYSTGQVSVEIEWELQKENSK FT RWNPEIQYTSNFEKQTGVDFAVDSQGVYSEPRPIGTRYLTRNL" XX SQ Sequence 4721 BP; 1108 A; 1405 C; 1301 G; 907 T; 0 other; ttggccactc cctctatgcg cgctcgctcg ctcggtgggg cctgcggacc aaaggtccgc 60 agacggcaga gctctgctct gccggcccca ccgagcgagc gagcgcgcat agagggagtg 120 gccaactcca tcactagggg taccgcgaag cgcctcccac gctgccgcgt cagcgctgac 180 gtaaatcacg tcatagggga gtggtcctgt attagctgtc acgtgagtgc ttttgcgaca 240 ttttgcgaca ccacgtggcc atttgaggta tatatggccg agtgagcgag caggatctcc 300 attttgaccg cgaaatttga acgagcagca gccatgccgg gtttctacga gatcgtgatc 360 aaggtgccga gcgacctgga cgagcacctg ccgggcattt ctgactcgtt tgtgaactgg 420 gtggccgaga aggaatggga gctgcccccg gattctgaca tggatctgaa tctgatcgag 480 caggcacccc tgaccgtggc cgagaagctg cagcgcgact tcctggtcca atggcgccgc 540 gtgagtaagg ccccggaggc cctgttcttt gttcagttcg agaagggcga gagctacttc 600 caccttcacg ttctggtgga gaccacgggg gtcaagtcca tggtgctagg ccgcttcctg 660 agtcagattc gggagaagct ggtccagacc atctaccgcg gggtcgagcc cacgctgccc 720 aactggttcg cggtgaccaa gacgcgtaat ggcgccggcg gggggaacaa ggtggtggac 780 gagtgctaca tccccaacta cctcctgccc aagacccagc ccgagctgca gtgggcgtgg 840 actaacatgg aggagtatat aagcgcgtgt ttgaacctgg ccgaacgcaa acggctcgtg 900 gcgcagcacc tgacccacgt cagccagacg caggagcaga acaaggagaa tctgaacccc 960 aattctgacg cgcccgtgat caggtcaaaa acctccgcgc gctacatgga gctggtcggg 1020 tggctggtgg accggggcat cacctccgag aagcagtgga tccaggagga ccaggcctcg 1080 tacatctcct tcaacgccgc ctccaactcg cggtcccaga tcaaggccgc gctggacaat 1140 gccggcaaga tcatggcgct gaccaaatcc gcgcccgact acctggtggg gccctcgctg 1200 cccgcggaca ttaaaaccaa ccgcatctac cgcatcctgg agctgaacgg gtacgatcct 1260 gcctacgccg gctccgtctt tctcggctgg gcccagaaaa agttcgggaa gcgcaacacc 1320 atctggctgt ttgggcccgc caccaccggc aagaccaaca ttgcggaagc catcgcccac 1380 gccgtgccct tctacggctg cgtcaactgg accaatgaga actttccctt caacgattgc 1440 gtcgacaaga tggtgatctg gtgggaggag ggcaagatga cggccaaggt cgtggagtcc 1500 gccaaggcca ttctcggcgg cagcaaggtg cgcgtggacc aaaagtgcaa gtcgtccgcc 1560 cagatcgacc ccacccccgt gatcgtcacc tccaacacca acatgtgcgc cgtgattgac 1620 gggaacagca ccaccttcga gcaccagcag ccgttgcagg accggatgtt caaatttgaa 1680 ctcacccgcc gtctggagca cgactttggc aaggtgacga agcaggaagt caaagagttc 1740 ttccgctggg ccagtgatca cgtgaccgag gtggcgcatg agttctacgt cagaaagggc 1800 ggagccagca aaagacccgc ccccgatgac gcggatataa gcgagcccaa gcgggcctgc 1860 ccctcagtcg cggatccatc gacgtcagac gcggaaggag ctccggtgga ctttgccgac 1920 aggtaccaaa acaaatgttc tcgtcacgcg ggcatgattc agatgctgtt tccctgcaaa 1980 acgtgcgaga gaatgaatca gaatttcaac atttgcttca cacacggggt cagagactgt 2040 ttagagtgtt tccccggcgt gtcagaatct caaccggtcg tcagaaaaaa gacgtatcgg 2100 aaactctgcg cgattcatca tctgctgggg cgggcgcccg agattgcttg ctcggcctgc 2160 gacctggtca acgtggacct ggacgactgc gtttctgagc aataaatgac ttaaaccagg 2220 tatggctgcc gatggttatc ttccagattg gctcgaggac aacctctctg agggcattcg 2280 cgagtggtgg gacctgaaac ctggagcccc gaaacccaaa gccaaccagc aaaagcagga 2340 caacggccgg ggtctggtgc ttcctggcta caagtacctc ggacccttca acggactcga 2400 caagggggag cccgtcaacg cggcggacgc agcggccctc gagcacgaca aggcctacga 2460 ccagcagctc aaagcgggtg acaatccgta cctgcggtat aaccacgccg acgccgagtt 2520 tcaggagcgt ctgcaagaag atacgtcatt tgggggcaac ctcgggcgag cagtcttcca 2580 ggccaagaag cgggttctcg aacctctcgg tctggttgag gaaggcgcta agacggctcc 2640 tgcaaagaag agaccggtag agccgtcacc tcagcgttcc cccgactcct ccacgggcat 2700 cggcaagaaa ggccagcagc ccgccagaaa gagactcaat ttcggtcaga ctggcgactc 2760 agagtcagtc cccgaccctc aacctctcgg agaacctcca gcagcgccct ctagtgtggg 2820 atctggtaca gtggctgcag gcggtggcgc accaatggca gacaataacg aaggtgccga 2880 cggagtgggt aatgcctcag gaaattggca ttgcgattcc acatggctgg gcgacagagt 2940 cattaccacc agcacccgaa cctgggccct gcccacctac aacaaccacc tctacaagca 3000 aatctccagt gaaactgcag gtagtaccaa cgacaacacc tacttcggct acagcacccc 3060 ctgggggtat tttgacttta acagattcca ctgccacttc tcaccacgtg actggcagcg 3120 actcatcaac aacaactggg gattccggcc caagaagctg cggttcaagc tcttcaacat 3180 ccaggtcaag gaggtcacga cgaatgacgg cgttacgacc atcgctaata accttaccag 3240 cacgattcag gtattctcgg actcggaata ccagctgccg tacgtcctcg gctctgcgca 3300 ccagggctgc ctgcctccgt tcccggcgga cgtcttcatg attcctcagt acggctacct 3360 gactctcaac aatggcagtc agtctgtggg acgttcctcc ttctactgcc tggagtactt 3420 cccctctcag atgctgagaa cgggcaacaa ctttgagttc agctacagct tcgaggacgt 3480 gcctttccac agcagctacg cacacagcca gagcctggac cggctgatga atcccctcat 3540 cgaccagtac ttgtactacc tggccagaac acagagtaac ccaggaggca cagctggcaa 3600 tcgggaactg cagttttacc agggcgggcc ttcaactatg gccgaacaag ccaagaattg 3660 gttacctgga ccttgcttcc ggcaacaaag agtctccaaa acgctggatc aaaacaacaa 3720 cagcaacttt gcttggactg gtgccaccaa atatcacctg aacggcagaa actcgttggt 3780 taatcccggc gtcgccatgg caactcacaa ggacgacgag gaccgctttt tcccatccag 3840 cggagtcctg atttttggaa aaactggagc aactaacaaa actacattgg aaaatgtgtt 3900 aatgacaaat gaagaagaaa ttcgtcctac taatcctgta gccacggaag aatacgggat 3960 agtcagcagc aacttacaag cggctaatac tgcagcccag acacaagttg tcaacaacca 4020 gggagcctta cctggcatgg tctggcagaa ccgggacgtg tacctgcagg gtcccatctg 4080 ggccaagatt cctcacacgg atggcaactt tcacccgtct cctttgatgg gcggctttgg 4140 acttaaacat ccgcctcctc agatcctgat caagaacact cccgttcccg ctaatcctcc 4200 ggaggtgttt actcctgcca agtttgcttc gttcatcaca cagtacagca ccggacaagt 4260 cagcgtggaa atcgagtggg agctgcagaa ggaaaacagc aagcgctgga acccggagat 4320 tcagtacacc tccaactttg aaaagcagac tggtgtggac tttgccgttg acagccaggg 4380 tgtttactct gagcctcgcc ctattggcac tcgttacctc acccgtaatc tgtaattgca 4440 tgttaatcaa taaaccggtt gattcgtttc agttgaactt tggtctcctg tgcttcttat 4500 cttatcggtt tccatagcaa ctggttacac attaactgct tgggtgcgct tcacgataag 4560 aacactgacg tcaccgcggt acccctagtg atggagttgg ccactccctc tatgcgcgct 4620 cgctcgctcg gtggggcctg cggaccaaag gtccgcagac ggcagagctc tgctctgccg 4680 gccccaccga gcgagcgagc gcgcatagag ggagtggcca a 4721 //