ID AF043303; SV 1; linear; genomic DNA; STD; VRL; 4679 BP. XX AC AF043303; XX DT 24-FEB-1998 (Rel. 54, Created) DT 22-MAY-2010 (Rel. 104, Last updated, Version 4) XX DE Adeno-associated virus 2, complete genome. XX KW . XX OS Adeno-associated virus - 2 OC Viruses; ssDNA viruses; Parvoviridae; Parvovirinae; Dependovirus. XX RN [1] RP 1-4679 RX PUBMED; 7996133. RA Ruffing M., Heid H., Kleinschmidt J.A.; RT "Mutations in the carboxy terminus of adeno-associated virus 2 capsid RT proteins affect viral infectivity: lack of an RGD integrin-binding motif"; RL J. Gen. Virol. 75(Pt 12):3385-3392(1994). XX RN [2] RP 1-4679 RA Berns K.I., Bohenzky R.A., Cassinotti P., Colvin D., Donahue B.A., Dull T., RA Horer M., Kleinschmidt J.A., Ruffing M., Snyder R.O., Tratschin J.-D., RA Weitz M.; RT ; RL Submitted (15-JAN-1998) to the INSDC. RL Cell Genesys Inc., 342 Lakeside Dr., Foster City, CA 94404, USA XX DR EPD; EP07161; AAV2_COA3. DR EPD; EP07162; AAV2_VNCA. DR EPD; EP07163; AAV2_19. XX FH Key Location/Qualifiers FH FT source 1..4679 FT /organism="Adeno-associated virus - 2" FT /mol_type="genomic DNA" FT /note="changes relative to the original sequence, GenBank FT Accession Number J01901, have been detected and verified by FT several different laboratories" FT /db_xref="taxon:10804" FT repeat_region 1..145 FT /rpt_type=INVERTED FT /note="inverted terminal repeat" FT misc_feature 42..83 FT /note="flip oriented DNA" FT precursor_RNA 287..4451 FT CDS join(321..1906,2228..2252) FT /codon_start=1 FT /product="Rep 68 protein" FT /db_xref="GOA:O56650" FT /db_xref="HSSP:1M55" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="UniProtKB/TrEMBL:O56650" FT /protein_id="AAC03774.1" FT /translation="MPGFYEIVIKVPSDLDEHLPGISDSFVNWVAEKEWELPPDSDMDL FT NLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMV FT LGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDECYIPNYLLPKTQP FT ELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQNPNSDAPVIRSKTSA FT RYMELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMSLTKTAP FT DYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRNTIWLFGPATTGK FT TNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKAILGGSKV FT RVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFELTRRLDHDF FT GKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEPKRVRESVAQPST FT SDAEASINYADRLARGHSL" FT CDS 321..2186 FT /codon_start=1 FT /product="Rep 78 protein" FT /db_xref="GOA:O56651" FT /db_xref="HSSP:1M55" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="UniProtKB/TrEMBL:O56651" FT /protein_id="AAC03775.1" FT /translation="MPGFYEIVIKVPSDLDEHLPGISDSFVNWVAEKEWELPPDSDMDL FT NLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMV FT LGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDECYIPNYLLPKTQP FT ELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQNPNSDAPVIRSKTSA FT RYMELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMSLTKTAP FT DYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRNTIWLFGPATTGK FT TNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKAILGGSKV FT RVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFELTRRLDHDF FT GKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEPKRVRESVAQPST FT SDAEASINYADRYQNKCSRHVGMNLMLFPCRQCERMNQNSNICFTHGQKDCLECFPVSE FT SQPVSVVKKAYQKLCYIHHIMGKVPDACTACDLVNVDLDDCIFEQ" FT variation 370 FT /replace="g" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT precursor_RNA 878..4451 FT CDS join(993..1906,2228..2252) FT /codon_start=1 FT /product="Rep 40 protein" FT /db_xref="GOA:Q77XY0" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="UniProtKB/TrEMBL:Q77XY0" FT /protein_id="AAC03776.1" FT /translation="MELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDN FT AGKIMSLTKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRN FT TIWLFGPATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVV FT ESAKAILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMF FT KFELTRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEP FT KRVRESVAQPSTSDAEASINYADRLARGHSL" FT CDS 993..2186 FT /codon_start=1 FT /product="Rep 52 protein" FT /db_xref="GOA:Q77XX9" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="UniProtKB/TrEMBL:Q77XX9" FT /protein_id="AAC03777.1" FT /translation="MELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDN FT AGKIMSLTKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRN FT TIWLFGPATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVV FT ESAKAILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMF FT KFELTRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEP FT KRVRESVAQPSTSDAEASINYADRYQNKCSRHVGMNLMLFPCRQCERMNQNSNICFTHG FT QKDCLECFPVSESQPVSVVKKAYQKLCYIHHIMGKVPDACTACDLVNVDLDDCIFEQ" FT precursor_RNA 1853..4451 FT intron 1907..2227 FT CDS 2203..4410 FT /codon_start=1 FT /product="major coat protein VP1" FT /db_xref="GOA:P03135" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="PDB:1LP3" FT /db_xref="UniProtKB/Swiss-Prot:P03135" FT /protein_id="AAC03780.1" FT /translation="MAADGYLPDWLEDTLSEGIRQWWKLKPGPPPPKPAERHKDDSRGL FT VLPGYKYLGPFNGLDKGEPVNEADAAALEHDKAYDRQLDSGDNPYLKYNHADAEFQERL FT KEDTSFGGNLGRAVFQAKKRVLEPLGLVEEPVKTAPGKKRPVEHSPVEPDSSSGTGKAG FT QQPARKRLNFGQTGDADSVPDPQPLGQPPAAPSGLGTNTMATGSGAPMADNNEGADGVG FT NSSGNWHCDSTWMGDRVITTSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGY FT FDFNRFHCHFSPRDWQRLINNNWGFRPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTV FT QVFTDSEYQLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFP FT SQMLRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQS FT RLQFSQAGASDIRDQSRNWLPGPCYRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLV FT NPGPAMASHKDDEEKFFPQSGVLIFGKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQY FT GSVSTNLQRGNRQAATADVNTQGVLPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMG FT GFGLKHPPPQILIKNTPVPANPSTTFSAAKFASFITQYSTGQVSVEIEWELQKENSKRW FT NPEIQYTSNYNKSVNVDFTVDTNGVYSEPRPIGTRYLTRNL" FT variation 2429 FT /replace="ta" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT CDS 2614..4410 FT /codon_start=1 FT /transl_except=(pos:2614..2616,aa:Met) FT /product="major coat protein VP2" FT /db_xref="GOA:P03135" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="PDB:1LP3" FT /db_xref="UniProtKB/Swiss-Prot:P03135" FT /protein_id="AAC03778.1" FT /translation="MAPGKKRPVEHSPVEPDSSSGTGKAGQQPARKRLNFGQTGDADSV FT PDPQPLGQPPAAPSGLGTNTMATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGDRVIT FT TSTRTWALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQRLI FT NNNWGFRPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQ FT GCLPPFPADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDV FT PFHSSYAHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNW FT LPGPCYRQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPGPAMASHKDDEEKFFPQ FT SGVLIFGKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQYGSVSTNLQRGNRQAATADV FT NTQGVLPGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVP FT ANPSTTFSAAKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVNVDFT FT VDTNGVYSEPRPIGTRYLTRNL" FT CDS 2729..3343 FT /codon_start=1 FT /product="assembly activating protein AAP" FT /db_xref="UniProtKB/TrEMBL:D5SGZ8" FT /protein_id="ADH10168.1" FT /translation="METQTQYLTPSLSDSHQQPPLVWELIRWLQAVAHQWQTITRAPTE FT WVIPREIGIAIPHGWATESSPPAPEPGPCPPTTTTSTNKFPANQEPRTTITTLATAPLG FT GILTSTDSTATFHHVTGKDSSTTTGDSDPRDSTSSSLTFKSKRSRRMTVRRRLPITLPA FT RFRCLLTRSTSSRTSSARRIKDASRRSQQTSSWCHSMDTSP" FT CDS 2809..4410 FT /codon_start=1 FT /product="major coat protein VP3" FT /db_xref="GOA:P03135" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="PDB:1LP3" FT /db_xref="UniProtKB/Swiss-Prot:P03135" FT /protein_id="AAC03779.1" FT /translation="MATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGDRVITTSTRTW FT ALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQRLINNNWGF FT RPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQGCLPPF FT PADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPFHSSY FT AHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNWLPGPCY FT RQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPGPAMASHKDDEEKFFPQSGVLIF FT GKQGSEKTNVDIEKVMITDEEEIRTTNPVATEQYGSVSTNLQRGNRQAATADVNTQGVL FT PGMVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPANPSTT FT FSAAKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVNVDFTVDTNGV FT YSEPRPIGTRYLTRNL" FT variation 2877 FT /replace="c" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 3759..3765 FT /replace="g" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 3859 FT /replace="a" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 3898 FT /replace="" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 3900..3902 FT /replace="gaac" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 4232..4233 FT /replace="acg" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 4329..4330 FT /replace="tcg" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 4336 FT /replace="" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 4341 FT /replace="c" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT variation 4347 FT /replace="t" FT /note="compared to sequence of GenBank Accession Number FT J01901" FT repeat_region 4535..4679 FT /rpt_type=INVERTED FT /note="inverted terminal repeat" FT misc_feature 4597..4638 FT /note="flop oriented DNA" XX SQ Sequence 4679 BP; 1198 A; 1262 C; 1255 G; 964 T; 0 other; ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120 gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 180 ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 ggtttgaacg cgcagccgcc atgccggggt tttacgagat tgtgattaag gtccccagcg 360 accttgacga gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 420 aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 480 ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 540 cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 600 tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 660 aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 720 tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 780 ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 840 agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 900 cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 960 cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1020 aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1080 atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1140 tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1200 ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1260 ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1320 ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1380 acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1440 tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1500 tcggaggaag caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1560 ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1620 ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1680 tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1740 aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1800 gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1860 agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1920 gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1980 atcagaattc aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg 2040 tgtcagaatc tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc 2100 atcatatcat gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt 2160 tggatgactg catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat 2220 cttccagatt ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa 2280 cctggcccac caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg 2340 cttcctgggt acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac 2400 gaggcagacg ccgcggccct cgagcacgac aaagcctacg accggcagct cgacagcgga 2460 gacaacccgt acctcaagta caaccacgcc gacgcggagt ttcaggagcg ccttaaagaa 2520 gatacgtctt ttgggggcaa cctcggacga gcagtcttcc aggcgaaaaa gagggttctt 2580 gaacctctgg gcctggttga ggaacctgtt aagacggctc cgggaaaaaa gaggccggta 2640 gagcactctc ctgtggagcc agactcctcc tcgggaaccg gaaaggcggg ccagcagcct 2700 gcaagaaaaa gattgaattt tggtcagact ggagacgcag actcagtacc tgacccccag 2760 cctctcggac agccaccagc agccccctct ggtctgggaa ctaatacgat ggctacaggc 2820 agtggcgcac caatggcaga caataacgag ggcgccgacg gagtgggtaa ttcctcggga 2880 aattggcatt gcgattccac atggatgggc gacagagtca tcaccaccag cacccgaacc 2940 tgggccctgc ccacctacaa caaccacctc tacaaacaaa tttccagcca atcaggagcc 3000 tcgaacgaca atcactactt tggctacagc accccttggg ggtattttga cttcaacaga 3060 ttccactgcc acttttcacc acgtgactgg caaagactca tcaacaacaa ctggggattc 3120 cgacccaaga gactcaactt caagctcttt aacattcaag tcaaagaggt cacgcagaat 3180 gacggtacga cgacgattgc caataacctt accagcacgg ttcaggtgtt tactgactcg 3240 gagtaccagc tcccgtacgt cctcggctcg gcgcatcaag gatgcctccc gccgttccca 3300 gcagacgtct tcatggtgcc acagtatgga tacctcaccc tgaacaacgg gagtcaggca 3360 gtaggacgct cttcatttta ctgcctggag tactttcctt ctcagatgct gcgtaccgga 3420 aacaacttta ccttcagcta cacttttgag gacgttcctt tccacagcag ctacgctcac 3480 agccagagtc tggaccgtct catgaatcct ctcatcgacc agtacctgta ttacttgagc 3540 agaacaaaca ctccaagtgg aaccaccacg cagtcaaggc ttcagttttc tcaggccgga 3600 gcgagtgaca ttcgggacca gtctaggaac tggcttcctg gaccctgtta ccgccagcag 3660 cgagtatcaa agacatctgc ggataacaac aacagtgaat actcgtggac tggagctacc 3720 aagtaccacc tcaatggcag agactctctg gtgaatccgg gcccggccat ggcaagccac 3780 aaggacgatg aagaaaagtt ttttcctcag agcggggttc tcatctttgg gaagcaaggc 3840 tcagagaaaa caaatgtgga cattgaaaag gtcatgatta cagacgaaga ggaaatcagg 3900 acaaccaatc ccgtggctac ggagcagtat ggttctgtat ctaccaacct ccagagaggc 3960 aacagacaag cagctaccgc agatgtcaac acacaaggcg ttcttccagg catggtctgg 4020 caggacagag atgtgtacct tcaggggccc atctgggcaa agattccaca cacggacgga 4080 cattttcacc cctctcccct catgggtgga ttcggactta aacaccctcc tccacagatt 4140 ctcatcaaga acaccccggt acctgcgaat ccttcgacca ccttcagtgc ggcaaagttt 4200 gcttccttca tcacacagta ctccacggga caggtcagcg tggagatcga gtgggagctg 4260 cagaaggaaa acagcaaacg ctggaatccc gaaattcagt acacttccaa ctacaacaag 4320 tctgttaatg tggactttac tgtggacact aatggcgtgt attcagagcc tcgccccatt 4380 ggcaccagat acctgactcg taatctgtaa ttgcttgtta atcaataaac cgtttaattc 4440 gtttcagttg aactttggtc tctgcgtatt tctttcttat ctagtttcca tggctacgta 4500 gataagtagc atggcgggtt aatcattaac tacaaggaac ccctagtgat ggagttggcc 4560 actccctctc tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc 4620 ccgggctttg cccgggcggc ctcagtgagc gagcgagcgc gcagagaggg agtggccaa 4679 //