ID J01901; SV 1; linear; genomic DNA; STD; VRL; 4675 BP. XX AC J01901; M12405; M12468-M12469; XX DT 13-JUN-1985 (Rel. 06, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 5) XX DE Adeno-associated virus 2, complete genome. XX KW alternative splicing; complete genome; major coat protein. XX OS Adeno-associated virus - 2 OC Viruses; Parvoviridae; Parvovirinae; Dependoparvovirus. XX RN [1] RP 4532-4675 RX DOI; 10.1016/0092-8674(83)90342-2. RX PUBMED; 6088052. RA Samulski R.J., Srivastava A., Berns K.I., Muzyczka N.; RT "Rescue of adeno-associated virus from recombinant plasmids: gene RT correction within the terminal repeats of AAV"; RL Cell 33(1):135-143(1983). XX RN [2] RP 1-4675 RX PUBMED; 6300419. RA Srivastava A., Lusby E.W., Berns K.I.; RT "Nucleotide sequence and organization of the adeno-associated virus 2 RT genome"; RL J. Virol. 45(2):555-564(1983). XX DR MD5; 172f635e1cc2a863b1c485effc769cb0. DR EPD; EP07161; AAV2_COA3. DR EPD; EP07162; AAV2_VNCA. DR EPD; EP07163; AAV2_19. DR EuropePMC; PMC110121; 9733829. DR EuropePMC; PMC115897; 11222696. DR EuropePMC; PMC275469; 14576307. DR GOA; D5SGZ8. DR UniProtKB/Swiss-Prot; D5SGZ8; AAP_AAV2S. XX FH Key Location/Qualifiers FH FT source 1..4675 FT /organism="Adeno-associated virus - 2" FT /mol_type="genomic DNA" FT /db_xref="taxon:10804" FT repeat_region 1..145 FT /note="5' inverted terminal repeat" FT misc_feature 42..83 FT /note="flip oriented DNA" FT mRNA 287..4447 FT /note="major coat protein A mRNA (alt.)" FT CDS join(321..1906,2228..2252) FT /codon_start=1 FT /note="major coat protein A' (alt.)" FT /db_xref="GOA:P03132" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="InterPro:IPR027417" FT /db_xref="PDB:1S9H" FT /db_xref="PDB:1U0J" FT /db_xref="PDB:4ZO0" FT /db_xref="PDB:4ZQ9" FT /db_xref="PDB:5DCX" FT /db_xref="UniProtKB/Swiss-Prot:P03132" FT /protein_id="AAA42372.1" FT /translation="MPGFYEIVIKVPSDLDGHLPGISDSFVNWVAEKEWELPPDSDMDL FT NLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMV FT LGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDECYIPNYLLPKTQP FT ELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQNPNSDAPVIRSKTSA FT RYMELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMSLTKTAP FT DYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRNTIWLFGPATTGK FT TNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKAILGGSKV FT RVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFELTRRLDHDF FT GKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEPKRVRESVAQPST FT SDAEASINYADRLARGHSL" FT CDS 321..2186 FT /codon_start=1 FT /note="major coat protein A" FT /db_xref="GOA:Q89268" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR014835" FT /db_xref="InterPro:IPR027417" FT /db_xref="PDB:5BYG" FT /db_xref="UniProtKB/Swiss-Prot:Q89268" FT /protein_id="AAA42374.1" FT /translation="MPGFYEIVIKVPSDLDGHLPGISDSFVNWVAEKEWELPPDSDMDL FT NLIEQAPLTVAEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMV FT LGRFLSQIREKLIQRIYRGIEPTLPNWFAVTKTRNGAGGGNKVVDECYIPNYLLPKTQP FT ELQWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQNPNSDAPVIRSKTSA FT RYMELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDNAGKIMSLTKTAP FT DYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRNTIWLFGPATTGK FT TNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVVESAKAILGGSKV FT RVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMFKFELTRRLDHDF FT GKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEPKRVRESVAQPST FT SDAEASINYADRYQNKCSRHVGMNLMLFPCRQCERMNQNSNICFTHGQKDCLECFPVSE FT SQPVSVVKKAYQKLCYIHHIMGKVPDACTACDLVNVDLDDCIFEQ" FT mRNA 873..4447 FT /note="major coat protein A mRNA (alt.)" FT CDS join(993..1906,2228..2252) FT /codon_start=1 FT /note="major coat protein Aa (alt.)" FT /db_xref="GOA:Q89269" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/Swiss-Prot:Q89269" FT /protein_id="AAA42373.1" FT /translation="MELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDN FT AGKIMSLTKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRN FT TIWLFGPATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVV FT ESAKAILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMF FT KFELTRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEP FT KRVRESVAQPSTSDAEASINYADRLARGHSL" FT CDS 993..2186 FT /codon_start=1 FT /note="major coat protein A'' (alt.)" FT /db_xref="GOA:Q89270" FT /db_xref="InterPro:IPR001257" FT /db_xref="InterPro:IPR014015" FT /db_xref="InterPro:IPR027417" FT /db_xref="UniProtKB/Swiss-Prot:Q89270" FT /protein_id="AAA42375.1" FT /translation="MELVGWLVDKGITSEKQWIQEDQASYISFNAASNSRSQIKAALDN FT AGKIMSLTKTAPDYLVGQQPVEDISSNRIYKILELNGYDPQYAASVFLGWATKKFGKRN FT TIWLFGPATTGKTNIAEAIAHTVPFYGCVNWTNENFPFNDCVDKMVIWWEEGKMTAKVV FT ESAKAILGGSKVRVDQKCKSSAQIDPTPVIVTSNTNMCAVIDGNSTTFEHQQPLQDRMF FT KFELTRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRPAPSDADISEP FT KRVRESVAQPSTSDAEASINYADRYQNKCSRHVGMNLMLFPCRQCERMNQNSNICFTHG FT QKDCLECFPVSESQPVSVVKKAYQKLCYIHHIMGKVPDACTACDLVNVDLDDCIFEQ" FT mRNA 1853..4447 FT /note="major coat protein B mRNA (alt.)" FT intron 1907..2227 FT /note="major coat protein A intron" FT CDS 2810..4324 FT /codon_start=1 FT /note="major coat protein B" FT /db_xref="GOA:P03135" FT /db_xref="InterPro:IPR001403" FT /db_xref="InterPro:IPR013607" FT /db_xref="InterPro:IPR016184" FT /db_xref="InterPro:IPR036952" FT /db_xref="PDB:1LP3" FT /db_xref="PDB:3J1S" FT /db_xref="PDB:3J4P" FT /db_xref="PDB:5IPI" FT /db_xref="PDB:5IPK" FT /db_xref="PDB:6CBE" FT /db_xref="PDB:6E9D" FT /db_xref="PDB:6IH9" FT /db_xref="PDB:6IHB" FT /db_xref="PDB:6NZ0" FT /db_xref="UniProtKB/Swiss-Prot:P03135" FT /protein_id="AAA42376.1" FT /translation="MATGSGAPMADNNEGADGVGNSSGNWHCDSTWMGDRVITTSTRTW FT ALPTYNNHLYKQISSQSGASNDNHYFGYSTPWGYFDFNRFHCHFSPRDWQRLINNNWGF FT RPKRLNFKLFNIQVKEVTQNDGTTTIANNLTSTVQVFTDSEYQLPYVLGSAHQGCLPPF FT PADVFMVPQYGYLTLNNGSQAVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPFHSSY FT AHSQSLDRLMNPLIDQYLYYLSRTNTPSGTTTQSRLQFSQAGASDIRDQSRNWLPGPCY FT RQQRVSKTSADNNNSEYSWTGATKYHLNGRDSLVNPAMASHKDDEEKFFPQSGVLIFGK FT QGSEKTNVNIEKVMITDEEEIGTTNPVATEQYGSVSTNLQRGNRQAATADVNTQGVLPG FT MVWQDRDVYLQGPIWAKIPHTDGHFHPSPLMGGFGLKHPPPQILIKNTPVPANPSTTFS FT AAKFASFITQYSTGHGQRGDRVGAAEGKQQTLESRNSVHFQLQQVC" FT repeat_region 4531..4675 FT /note="3' inverted terminal repeat" FT misc_feature 4592..4634 FT /note="flop oriented DNA" XX SQ Sequence 4675 BP; 1198 A; 1262 C; 1251 G; 964 T; 0 other; ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc 60 cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg 120 gccaactcca tcactagggg ttcctggagg ggtggagtcg tgacgtgaat tacgtcatag 180 ggttagggag gtcctgtatt agaggtcacg tgagtgtttt gcgacatttt gcgacaccat 240 gtggtcacgc tgggtattta agcccgagtg agcacgcagg gtctccattt tgaagcggga 300 ggtttgaacg cgcagccgcc atgccggggt tttacgagat tgtgattaag gtccccagcg 360 accttgacgg gcatctgccc ggcatttctg acagctttgt gaactgggtg gccgagaagg 420 aatgggagtt gccgccagat tctgacatgg atctgaatct gattgagcag gcacccctga 480 ccgtggccga gaagctgcag cgcgactttc tgacggaatg gcgccgtgtg agtaaggccc 540 cggaggccct tttctttgtg caatttgaga agggagagag ctacttccac atgcacgtgc 600 tcgtggaaac caccggggtg aaatccatgg ttttgggacg tttcctgagt cagattcgcg 660 aaaaactgat tcagagaatt taccgcggga tcgagccgac tttgccaaac tggttcgcgg 720 tcacaaagac cagaaatggc gccggaggcg ggaacaaggt ggtggatgag tgctacatcc 780 ccaattactt gctccccaaa acccagcctg agctccagtg ggcgtggact aatatggaac 840 agtatttaag cgcctgtttg aatctcacgg agcgtaaacg gttggtggcg cagcatctga 900 cgcacgtgtc gcagacgcag gagcagaaca aagagaatca gaatcccaat tctgatgcgc 960 cggtgatcag atcaaaaact tcagccaggt acatggagct ggtcgggtgg ctcgtggaca 1020 aggggattac ctcggagaag cagtggatcc aggaggacca ggcctcatac atctccttca 1080 atgcggcctc caactcgcgg tcccaaatca aggctgcctt ggacaatgcg ggaaagatta 1140 tgagcctgac taaaaccgcc cccgactacc tggtgggcca gcagcccgtg gaggacattt 1200 ccagcaatcg gatttataaa attttggaac taaacgggta cgatccccaa tatgcggctt 1260 ccgtctttct gggatgggcc acgaaaaagt tcggcaagag gaacaccatc tggctgtttg 1320 ggcctgcaac taccgggaag accaacatcg cggaggccat agcccacact gtgcccttct 1380 acgggtgcgt aaactggacc aatgagaact ttcccttcaa cgactgtgtc gacaagatgg 1440 tgatctggtg ggaggagggg aagatgaccg ccaaggtcgt ggagtcggcc aaagccattc 1500 tcggaggaag caaggtgcgc gtggaccaga aatgcaagtc ctcggcccag atagacccga 1560 ctcccgtgat cgtcacctcc aacaccaaca tgtgcgccgt gattgacggg aactcaacga 1620 ccttcgaaca ccagcagccg ttgcaagacc ggatgttcaa atttgaactc acccgccgtc 1680 tggatcatga ctttgggaag gtcaccaagc aggaagtcaa agactttttc cggtgggcaa 1740 aggatcacgt ggttgaggtg gagcatgaat tctacgtcaa aaagggtgga gccaagaaaa 1800 gacccgcccc cagtgacgca gatataagtg agcccaaacg ggtgcgcgag tcagttgcgc 1860 agccatcgac gtcagacgcg gaagcttcga tcaactacgc agacaggtac caaaacaaat 1920 gttctcgtca cgtgggcatg aatctgatgc tgtttccctg cagacaatgc gagagaatga 1980 atcagaattc aaatatctgc ttcactcacg gacagaaaga ctgtttagag tgctttcccg 2040 tgtcagaatc tcaacccgtt tctgtcgtca aaaaggcgta tcagaaactg tgctacattc 2100 atcatatcat gggaaaggtg ccagacgctt gcactgcctg cgatctggtc aatgtggatt 2160 tggatgactg catctttgaa caataaatga tttaaatcag gtatggctgc cgatggttat 2220 cttccagatt ggctcgagga cactctctct gaaggaataa gacagtggtg gaagctcaaa 2280 cctggcccac caccaccaaa gcccgcagag cggcataagg acgacagcag gggtcttgtg 2340 cttcctgggt acaagtacct cggacccttc aacggactcg acaagggaga gccggtcaac 2400 gaggcagacg ccgcggccct cgagcacgta caaagcctac gaccggcagc tcgacagcgg 2460 agacaacccg tacctcaagt acaaccacgc cgacgcggag tttcaggagc gccttaaaga 2520 agatacgtct tttgggggca acctcggacg agcagtcttc caggcgaaaa agagggttct 2580 tgaacctctg ggcctggttg aggaacctgt taagacggct ccgggaaaaa agaggccggt 2640 agagcactct cctgtggagc cagactcctc ctcgggaacc ggaaaggcgg gccagcagcc 2700 tgcaagaaaa agattgaatt ttggtcagac tggagacgca gactcagtac ctgaccccca 2760 gcctctcgga cagccaccag cagccccctc tggtctggga actaatacga tggctacagg 2820 cagtggcgca ccaatggcag acaataacga gggcgccgac ggagtgggta attcctccgg 2880 aaattggcat tgcgattcca catggatggg cgacagagtc atcaccacca gcacccgaac 2940 ctgggccctg cccacctaca acaaccacct ctacaaacaa atttccagcc aatcaggagc 3000 ctcgaacgac aatcactact ttggctacag caccccttgg gggtattttg acttcaacag 3060 attccactgc cacttttcac cacgtgactg gcaaagactc atcaacaaca actggggatt 3120 ccgacccaag agactcaact tcaagctctt taacattcaa gtcaaagagg tcacgcagaa 3180 tgacggtacg acgacgattg ccaataacct taccagcacg gttcaggtgt ttactgactc 3240 ggagtaccag ctcccgtacg tcctcggctc ggcgcatcaa ggatgcctcc cgccgttccc 3300 agcagacgtc ttcatggtgc cacagtatgg atacctcacc ctgaacaacg ggagtcaggc 3360 agtaggacgc tcttcatttt actgcctgga gtactttcct tctcagatgc tgcgtaccgg 3420 aaacaacttt accttcagct acacttttga ggacgttcct ttccacagca gctacgctca 3480 cagccagagt ctggaccgtc tcatgaatcc tctcatcgac cagtacctgt attacttgag 3540 cagaacaaac actccaagtg gaaccaccac gcagtcaagg cttcagtttt ctcaggccgg 3600 agcgagtgac attcgggacc agtctaggaa ctggcttcct ggaccctgtt accgccagca 3660 gcgagtatca aagacatctg cggataacaa caacagtgaa tactcgtgga ctggagctac 3720 caagtaccac ctcaatggca gagactctct ggtgaatccg gccatggcaa gccacaagga 3780 cgatgaagaa aagttttttc ctcagagcgg ggttctcatc tttgggaagc aaggctcaga 3840 gaaaacaaat gtgaacattg aaaaggtcat gattacagac gaagaggaaa tcggaacaac 3900 caatcccgtg gctacggagc agtatggttc tgtatctacc aacctccaga gaggcaacag 3960 acaagcagct accgcagatg tcaacacaca aggcgttctt ccaggcatgg tctggcagga 4020 cagagatgtg taccttcagg ggcccatctg ggcaaagatt ccacacacgg acggacattt 4080 tcacccctct cccctcatgg gtggattcgg acttaaacac cctcctccac agattctcat 4140 caagaacacc ccggtacctg cgaatccttc gaccaccttc agtgcggcaa agtttgcttc 4200 cttcatcaca cagtactcca cgggacacgg tcagcgtgga gatcgagtgg gagctgcaga 4260 aggaaaacag caaacgctgg aatcccgaaa ttcagtacac ttccaactac aacaagtctg 4320 ttaatcgtgg acttaccgtg gatactaatg gcgtgtattc agagcctcgc cccattggca 4380 ccagatacct gactcgtaat ctgtaattgc ttgttaatca ataaaccgtt taattcgttt 4440 cagttgaact ttggtctctg cgtatttctt tcttatctag tttccatggc tacgtagata 4500 agtagcatgg cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc 4560 cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg 4620 gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag agagggagtg gccaa 4675 //