ID DQ644558; SV 1; circular; genomic DNA; STD; VRL; 2556 BP. XX AC DQ644558; XX DT 14-MAY-2008 (Rel. 95, Created) DT 03-JUN-2009 (Rel. 101, Last updated, Version 3) XX DE Merremia mosaic virus isolate PR4-H6 segment DNA-A, complete sequence. XX KW . XX OS Merremia mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2556 RA Idris A., Brown J.; RT "Molecular and biological characterization of Merremia mosaic virus: a RT bipartite begomovirus from Puerto Rico"; RL Unpublished. XX RN [2] RP 1-2556 RA Idris A., Brown J.; RT ; RL Submitted (18-MAY-2006) to the INSDC. RL Plant Sciences, University of Arizona, Tucson, AZ 85721, USA XX DR MD5; ac9ba353d083818ca4afca39f7107546. XX FH Key Location/Qualifiers FH FT source 1..2556 FT /organism="Merremia mosaic virus" FT /segment="DNA-A" FT /isolate="PR4-H6" FT /mol_type="genomic DNA" FT /country="Puerto Rico" FT /db_xref="taxon:77813" FT CDS 141..887 FT /codon_start=1 FT /product="coat protein" FT /note="AV1" FT /db_xref="GOA:I6LGT9" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:I6LGT9" FT /protein_id="ABG90896.1" FT /translation="MVKRDAPWRLMAGTTKVSRNANFSPRGGMGPKAAAWVNRPMYRKP FT RIYRTLRGPDVPKGCEGPCKVQSFEQRHDISHVGKVICISDVTRGNGITHRVGKRFCVK FT SVYILGKIWMDENIKLKNHTNSVMFWLIRDRRPYGTPMDFGQVFNMYDNEPSTATVKND FT LRDRFQVMHRFYAKVTGGQYASNEQALVRRFWKVNNYVVYNHQEAGKYENHTENALLLY FT MACTHASNPVYATLKIRSYFYDSISN" FT CDS complement(884..1282) FT /codon_start=1 FT /product="AC3" FT /db_xref="GOA:Q59DM9" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q59DM9" FT /protein_id="ABG90899.1" FT /translation="MDSRTGEPITAHQAMNGVFIWEVPNPLYFKIIQVEEPIYTTTRIY FT TIQIRFNYNLRKALSLHKAYLNFQIWTTSVQASGTTYLNRFKDLVLMYLDQLGVVSLNN FT VIRAVRFATDKPYVNCVLERHSIKFNLY" FT CDS complement(1029..1460) FT /codon_start=1 FT /product="AC2" FT /db_xref="GOA:Q59DM8" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q59DM8" FT /protein_id="ABG90898.1" FT /translation="MKTHLSERGRSTMQNSSSSTPPSIKAQHRRAKRSKSVIRRRRLDL FT DCGCSIYVHINCRNHGFTHRGTHHCSSSNEWRFYLGGSKSPLFQDNPSGGANIHHNQNI FT HHPNTVQLQPEEGVESTQSIPELPNLDDISSSFWDDIFK" FT CDS complement(1345..2421) FT /codon_start=1 FT /product="AC1" FT /note="replication-associated protein" FT /db_xref="GOA:I6LGU0" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I6LGU0" FT /protein_id="ABG90897.1" FT /translation="MPRKGSFSIKAKNYFLTYPICSLAKEEALSQIKALHTPVNKKFIK FT ICRELHDNGEPHLHVLIQFEGKYNCTNNRFFDLVSPTRSAHFHPNIQGAKSSSDVKSYI FT DKDGDTIEWGQFQIDGRSARGGQQSSNDTYAKALNAASAEEALQIIKEEQPQHFFLQHH FT NLVANATRIFQKSPEPWVPPFQLSSFTNVPDEMQEWADNYFGRGAAARADRPISIIIEG FT DSRTGKTMWARSLGKHNYLSGHLDFNGRVYSNDVEYNVIDDISPNYLKLKHWKELIGAQ FT KDWQSNCKYGKPVQIKGGIPSIVLCNPGEGASYKDFLDKDENASLRAWTIHNAKFIFLN FT SPLYQSTAQESEEIQICH" XX SQ Sequence 2556 BP; 647 A; 525 C; 587 G; 797 T; 0 other; accggatggc cgcccgccgc gcccccctgg gcccacatat taaagccgtc caatcacaaa 60 gcgtcctgga agtctaattg tttaaaataa gcctataaat acattggagt ccgtctatac 120 cccaccaact ttaatttaaa atggttaaga gggacgcccc atggcgttta atggcgggga 180 ccactaaagt tagtcgcaac gccaatttct cgccacgtgg aggtatgggc cctaaggccg 240 ctgcttgggt taacaggccc atgtacagga agcccagaat ttatcgcact ttgagagggc 300 ctgatgttcc taaaggttgt gaaggcccat gtaaggtaca gtctttcgag cagcgtcatg 360 atatttctca tgttggtaag gtaatctgta tatccgatgt aactcgtggt aacggtatta 420 ctcaccgtgt tggcaagcgt ttttgtgtga agtctgtgta tattctaggt aaaatatgga 480 tggatgagaa cataaagctg aagaaccaca cgaacagcgt catgttttgg ttgattcgtg 540 acaggagacc ctatggtacc cctatggatt ttggtcaggt gtttaacatg tatgacaatg 600 agccgagtac tgctaccgtc aagaacgatc ttcgcgatcg atttcaagtc atgcataggt 660 tctatgccaa agtaactggt ggtcagtatg ccagtaacga gcaggcattg gttcggcgat 720 tttggaaggt taataactac gtcgtgtata accatcagga agcaggaaaa tacgagaatc 780 acacggagaa tgctctgtta ttgtatatgg catgtactca tgcttctaat cctgtgtatg 840 ctaccttgaa aattcgtagt tatttttatg actccatttc gaattaataa agattaaatt 900 ttattgaatg tctttcgagc acacaattta catatggttt atccgttgcg aaacgaacag 960 ctctaatgac attgttaagc gaaacaacac ctaattgatc taaatacatt aaaactaaat 1020 ctttaaatct atttaaatat gtcgtcccag aagcttgaac tgatgtcgtc cagatttgga 1080 agttcaggta tgctttgtgt agactcaacg ccttcctcag gttgtagttg aaccgtattt 1140 ggatggtgta tattctggtt gtggtgtata ttggctcctc cacttggatt atcttgaaat 1200 agaggggatt tggaacctcc cagataaaaa cgccattcat tgcttgatga gcagtgatgg 1260 gttcccctgt gcgtgaatcc atggtttctg cagttgatgt gtacgtaaat tgaacagcca 1320 cagtccaggt ctaaccttct ccgtctaatg acagatttgg atctcttcgc tctcctgtgc 1380 tgtgctttga tagagggggg agttgaggaa gatgaatttt gcattgtgga tcgtccacgc 1440 tctgagagat gcgttttcat ctttatcgag gaagtcttta tagctagccc cctctcctgg 1500 attgcacagc acgattgagg gtattcctcc tttaatttga actggcttcc cgtatttaca 1560 gttggactgc cagtcctttt gggcccctat caattctttc caatgcttta atttcaaata 1620 attagggctt atgtcatcaa tgacgttata ttcgacgtca ttcgaataga ctctgccatt 1680 aaagtcaaga tgtccactaa gataattatg tttacctaat gaacgggccc acattgtttt 1740 tccagttcga ctatctcctt cgatgatgat acttatcggt ctatctgccc gcgcagcggc 1800 acccctccca aagtagttgt ctgcccattc ttgcatctca tctggaacgt tcgtgaagga 1860 ggagagttga aacggaggaa cccatggttc tggagacttc tgaaatattc ttgttgcgtt 1920 cgcaacgaga ttgtgatgtt gaagaaagaa gtgttgtggt tgttcctcct ttattatttg 1980 cagtgcttcc tctgcagaag ctgcgtttaa cgcctttgcg tatgtatcgt tagaagactg 2040 ctgacctcct ctagcagatc ttccgtcgat ctggaattgt ccccattcaa ttgtatctcc 2100 gtccttgtcg atgtaggact tgacatcgga gctggattta gctccctgaa tgttcggatg 2160 gaaatgtgct gacctggttg gggataccaa atcgaagaat ctgttattcg tgcagttgta 2220 ttttccttcg aactggataa gcacatgaag atgaggttcc ccattatcgt gaagctctct 2280 acagatcttg atgaattttt tgtttacagg ggtgtgcaga gctttgattt gggacagtgc 2340 ttcttctttg gctaatgaac atatagggta tgtgaggaaa tagtttttgg cttttattga 2400 gaatgaaccc ttccgtggca tttttgtaat aagggatgtt cccccaattg ctccgctctc 2460 aaaactctat atgaatcggg ggaactgggg gtacatttat actagaactc tcattaaagg 2520 gatttgcaac acgtggcggc catccgctat aatatt 2556 //