ID GQ338768; SV 1; circular; genomic DNA; STD; VRL; 2746 BP. XX AC GQ338768; XX DT 29-JUL-2009 (Rel. 101, Created) DT 29-JUL-2009 (Rel. 101, Last updated, Version 1) XX DE Tomato leaf curl Vietnam virus isolate Haiduong 122 segment DNA A, complete DE sequence. XX KW . XX OS Tomato leaf curl Vietnam virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2746 RA Thanh T.N., Mukheerjee S.M., Vinh D.N.; RT ; RL Submitted (29-JUN-2009) to the INSDC. RL Plant Molecular Biology, International Centre For Genetic Engineering and RL Biotechnology, Aruna Asaf Ali Marg, New Delhi, Delhi 110067, India XX DR MD5; 59a4b2d6c1f58cb32d2f555d7c54cea3. XX FH Key Location/Qualifiers FH FT source 1..2746 FT /organism="Tomato leaf curl Vietnam virus" FT /segment="DNA A" FT /host="tomato" FT /isolate="Haiduong 122" FT /mol_type="genomic DNA" FT /country="Viet Nam:Binhgiang, Haiduong" FT /collection_date="12-Dec-2007" FT /db_xref="taxon:208177" FT gene 129..479 FT /gene="AV2" FT CDS 129..479 FT /codon_start=1 FT /gene="AV2" FT /product="pre-coat protein" FT /db_xref="GOA:C7F8J6" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:C7F8J6" FT /protein_id="ACT79077.1" FT /translation="MWDPLLNEFPETVHGFRCMLAIKYLQLVENTYSPDTLGYDLIRDL FT ILVIRARDYVEASRRYSHFHSRIQGASPAELRQPVCQPCCCPHCPRHKQKEVMGESAHV FT PQAQDVQNVQKP" FT gene 289..1062 FT /gene="AV1" FT CDS 289..1062 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:C7F8J7" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:C7F8J7" FT /protein_id="ACT79078.1" FT /translation="MSKRPADIVISTPASKVRRRLNFDSPYANRAVAPTVLVTNKRRSW FT VNRPMYRKPKMYRMYKSPDVPRGCEGPCKVQSYEQRHDIAHVGKVICVSDVTRGNGLTH FT RVGKRFCIKSVYVLGKIWMDENIKTKNHTNTVMFFLVRDRRPFGTPQDFGQVFNMYDNE FT PSTATVKNDNRDRFQVLRRFQATVTGGQYASKEQAIVRKFMKVNNHVTYNHQEAAKYDN FT HTENALLLYMACTHASNPVYATLKIRIYFYDSVQN" FT gene complement(1059..1463) FT /gene="AC3" FT CDS complement(1059..1463) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer protein" FT /db_xref="GOA:C7F8J8" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:C7F8J8" FT /protein_id="ACT79079.1" FT /translation="MDSRTGEPITATQAENGVFPWEITNPLYFKITELQNRPFLSNNDI FT VTVRIQFNYNLRKALGMHQCFLDFRIWTNSHLQTGRFLRVFRTQVLKFLNNLGVISLNN FT VIRAVNYVLWDALTQTKYVNSSHVIKFNLY" FT gene complement(1204..1611) FT /gene="AC2" FT CDS complement(1204..1611) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /note="suppressor" FT /db_xref="GOA:C7F8J9" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:C7F8J9" FT /protein_id="ACT79080.1" FT /translation="MQSSSASPSHSTQVPIKVQHRIAKKKITRRRRIDLPCGCSYFSAI FT NCANHGFTHRGTHHCNSGREWRFSLGDNKSPVFQDNGTTKQTLPVEQRHSDGQDPIQLQ FT PEESTGDAPMFSGLPDMDELTPSDWSFLKSI" FT gene complement(1511..2599) FT /gene="AC1" FT CDS complement(1511..2599) FT /codon_start=1 FT /gene="AC1" FT /product="Rep protein" FT /db_xref="GOA:C7F8K0" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:C7F8K0" FT /protein_id="ACT79081.1" FT /translation="MPPPKKFLINAKNYFLTYPHCSLSKEEALSQFLALTTPTNKLFIR FT ICRELHEDGSPHLHVLIQFEGKFKCQNNRFFDLVSPTRSAHFHPNIQSAKSSTDVKAYM FT DKDGDVLDHGVFQIDGRSARGGCQSANDAYAEAINSGSKATALNILREKAPKDFVLQFH FT NLNSNLDRIFTPPMEVYVSPFSSSSFDRVPEELEEWAAENVVSSAARPLRPISLVIEGD FT SRTGKTMWARSLGPHNYLCGHIDLSPKVYSNDAWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPVQIKGGIPAIFLCNPGPNSSYKEYLDEEKNNALKSWAIKNAVFVSITEPL FT YSSTYQSPAQDSQEENNQETED" FT gene complement(2149..2442) FT /gene="AC4" FT CDS complement(2149..2442) FT /codon_start=1 FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:C6KJG0" FT /protein_id="ACT79082.1" FT /translation="MGLLTCMSSSSSKENSNAKTTDSSISYPQPGQHISIRTFRALKAQ FT QMSKPTWTKTETCLIMEFSRSMEDRLEEVASLPTTHMPRQSIQGPKLRPSIY" XX SQ Sequence 2746 BP; 736 A; 564 C; 608 G; 838 T; 0 other; accggatggt cgcgattttt ttcagtggtc cctccactat tatttgtcgg ccaatacaga 60 cgctccctca gagcttattt atgtaacggt cccctataaa acttggtccc caagtactca 120 ttccaaacat gtgggatccg cttttaaacg agtttccaga aaccgttcac ggttttaggt 180 gtatgctagc gattaaatat ttgcaattag tagaaaatac atattctccc gatacattag 240 ggtacgattt aatacgtgat ttaatcttag tcattcgtgc cagggattat gtcgaagcgt 300 cccgcagata tagtcatttc cactcccgca tccaaggtgc gtcgccggct gaacttcgac 360 agcccgtatg ccaaccgtgc tgttgccccc actgtcctcg tcacaaacaa aaggaggtca 420 tgggtgaatc ggcccatgta ccgcaagccc aagatgtaca gaatgtacaa aagccctgat 480 gttccacgag gatgtgaagg cccatgtaaa gtccagtctt atgaacaacg tcatgatata 540 gcccatgtag ggaaggtaat ttgtgtgtct gatgttacac gtggtaacgg gttgactcat 600 cgtgttggta agaggttctg tattaagtca gtttatgttt tgggtaagat ctggatggat 660 gagaacatta agacgaagaa tcatactaat acggtcatgt tctttttagt acgtgataga 720 agaccttttg gaactcccca agattttggt caggtgttta acatgtatga taacgagcca 780 agtactgcca cggtgaagaa cgataacaga gatcgttttc aagttcttcg tcgatttcag 840 gcaactgtta ctggtggtca gtatgcaagc aaggagcaag caatagttag gaaatttatg 900 aaggtgaaca accatgtgac gtataatcat caagaggctg ctaaatatga taaccacaca 960 gagaatgctc tgttattgta tatggcatgt actcatgcta gtaacccagt gtatgctacg 1020 ttgaaaatca ggatctattt ctatgattct gttcagaatt aataaagatt aaattttatt 1080 acatgagaac tgtttacata ttttgtttgc gttaatgcgt cccataatac ataattgact 1140 gctctaatca cattattcaa actaattaca cccaaattat tgagaaattt caaaacttgt 1200 gtcctaaata ctcttaagaa acgaccagtc tgaaggtgtg agttcgtcca tatccggaag 1260 tccagaaaac attggtgcat ccccagtgct ttcctcaggt tgtagttgaa ttggatcctg 1320 accgtcacta tgtcgttgtt cgacaggaag ggtctgtttt gtagttccgt tatcttgaaa 1380 tacaggggat ttgttatctc ccagggaaaa acgccattct ctgcctgagt tgcagtgatg 1440 ggttcccctg tgcgtgaatc catggttggc gcagttaatt gcgctgaaat aagaacaacc 1500 acagggaagg tcaatcctcc gtctcctggt tattttcttc ttggctatcc tgtgctggac 1560 tttgataggt acttgagtag agtggctcgg tgatgctgac gaagactgca ttctttattg 1620 cccacgactt taatgcatta ttcttttcct cgtctaggta ttctttatag cttgagttgg 1680 gccctggatt gcacaggaag atagctggaa tgccaccttt aatttgaacc ggttttccgt 1740 actttgtgtt gctttgccag tccctttggg cccccataaa ttctttgaaa tgctttaggt 1800 agtgcggatc aacgtcatca attacgttat accaggcatc attactgtac acctttggac 1860 tcaaatcaat atgaccacac agataattgt gtggacccaa tgatcgagcc cacatcgtct 1920 tccccgtcct gctatcaccc tctatgacta aacttattgg tcgtaatggc cgcgcagcgg 1980 aactgacgac gttttctgca gcccactctt cgagttcctc tgggactcga tcaaacgaag 2040 aagaagaaaa aggagaaacg taaacctcca ttggaggagt aaaaatccta tctaaattac 2100 tatttaaatt atgaaactgt aaaacaaaat ctttgggagc tttctccctt aatatattga 2160 gggccgtagc tttggaccct gaattgattg cctcggcata tgcgtcgttg gcagactggc 2220 aacctcctct agccgatctt ccatcgatct ggaaaactcc atgatcaagc acgtctccgt 2280 ctttgtccat gtaggctttg acatctgttg agcttttagc gctctgaatg ttcggatgga 2340 aatgtgctga cctggttggg gatacgagat cgaagaatct gttgttttgg catttgaatt 2400 ttccttcgaa ctggatgagg acatgcaggt gaggagaccc atcttcgtgt agttccctgc 2460 agattcgaat gaataattta ttggttggtg tcgttagggc taaaaattgg gaaagtgcct 2520 cttctttgct taatgagcaa tgtgggtatg tgaggaaata attcttggca tttattagaa 2580 attttttggg cggaggcatg ttgacttggt caatcgggtc ctctcaaact tagctatgca 2640 atcggggaat gggtccttat ttatatgtga ggacctaaat ggcacaatcg taaataatca 2700 tatgaatttc aaattcaaat tccaaagcgg ccatccgtat aatatt 2746 //