ID AY514632; SV 1; linear; genomic DNA; STD; VRL; 2747 BP. XX AC AY514632; XX DT 25-JAN-2004 (Rel. 78, Created) DT 24-MAR-2005 (Rel. 83, Last updated, Version 2) XX DE Tomato yellow leaf curl Thailand virus strain Sakon Nakhon segment A, DE complete sequence. XX KW . XX OS Tomato yellow leaf curl Thailand virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2747 RX DOI; 10.1016/j.virusres.2004.10.001. RX PUBMED; 15826907. RA Sawangjit S., Chatchawankanphanich O., Chiemsombat P., Attathom T., RA Dale J., Attathom S.; RT "Molecular characterization of tomato-infecting begomoviruses in Thailand"; RL Virus Res. 109(1):1-8(2005). XX RN [2] RP 1-2747 RA Sawangjit S., Chatchawankanphanich O., Chiemsombat P., Attathom T., RA Dale J.L., Attathom S.; RT ; RL Submitted (30-DEC-2003) to the INSDC. RL Plant Pathology, Kasetsart University, Malaiman, Kamphaengsaen, Nakhon RL Pathom 73140, Thailand XX DR MD5; d4f9b84ee803cc1021a3f7e241a392c6. DR EuropePMC; PMC2224568; 17977971. XX FH Key Location/Qualifiers FH FT source 1..2747 FT /organism="Tomato yellow leaf curl Thailand virus" FT /segment="A" FT /strain="Sakon Nakhon" FT /mol_type="genomic DNA" FT /country="Thailand" FT /db_xref="taxon:85752" FT gene 275..631 FT /gene="AV2" FT CDS 275..631 FT /codon_start=1 FT /gene="AV2" FT /product="precoat protein" FT /db_xref="GOA:Q6R4C9" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:Q6R4C9" FT /protein_id="AAR98596.1" FT /translation="MWDPLLNEFPENVHGFRCMLAVKYLQAVEKTYSPDTLGFDLIRDL FT ISVIRAKNYVEASSRYNHFHARLEGTSPSELRQPICEPCCCPHCPRHKSKIMDEQAHEQ FT KAQDVQDVQKSRCS" FT gene 435..1205 FT /gene="AV1" FT CDS 435..1205 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q6R4C8" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:Q6R4C8" FT /protein_id="AAR98595.1" FT /translation="MSKRPADIIISTPASKVRRRLNFDSPYVSRAAAPTVRVTKARSWT FT NRPMNRKPKMYRMYRSPDVPRGCEGPCKVQSFDAKNDIGHMGKVLCLSDVTRGIGLTHR FT VGKRFCVKSLYFVGKIWMDENIKVKNHTNTVLFWIVRDRRPTGTPYDFQQVFNVYDNEP FT STATVKNDQRDRFQVIRRFQATVTGGQYAAKEQAIIRKFYRVNNYVVYNHQEAGKYENH FT TENALLLYMACTHASNPVYATLKVRSYFYDSVTN" FT gene complement(1202..1606) FT /gene="AC3" FT CDS complement(1202..1606) FT /codon_start=1 FT /gene="AC3" FT /product="REn protein" FT /db_xref="GOA:Q6R4C7" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q6R4C7" FT /protein_id="AAR98599.1" FT /translation="MDSRTGELLTATQSESGVYIWTVKNPLYFKITRHLESPFQRNHDI FT ITLQIQFNYNLRKALGIHKCFLVCKIWTHLHPQTSRFLRVFKYQCIKYLDRLGVISINN FT VIRAISHILYNVLEGTICVIEKHDIKFNIY" FT gene complement(1347..1751) FT /gene="AC2" FT CDS complement(1347..1751) FT /codon_start=1 FT /gene="AC2" FT /product="TrAP protein" FT /db_xref="GOA:Q6R4C6" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q6R4C6" FT /protein_id="AAR98598.1" FT /translation="MRSSSPSKAHSTQVPIKVQHRIAKRTTRRRRVDLPCGCSYFVAIG FT CHNNGFTHRGTTHCNSIREWRIYLDGQKSPVFQDNQAPREPIPEEPRHNHVTNPVQLQP FT EESVGDTQMFSSLQNLDPFTSSDLAFLKSI" FT gene complement(1654..2739) FT /gene="AC1" FT CDS complement(1654..2739) FT /codon_start=1 FT /gene="AC1" FT /product="Rep protein" FT /db_xref="GOA:Q6R4C5" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q6R4C5" FT /protein_id="AAR98597.1" FT /translation="MAPPNKFRINAKHYFLTYPHCSLTKEEALSQISALSTPTNKLFIR FT ICRELHEDGSPHLHVLIQFEGKFKCQNNRFFDLTSPSRPTHFHPNIQGAKSSTDVKAYM FT EKDGDVLDHGVFQIDGRSARGGCQSANDAYAEAINSGSKAAALNILKEKAPKDFVLQFH FT NLNSNLDRIFAPPIEVFVCPFLSSSFDQVPEELEEWVSENVSGAAARPWRPKSIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKAYNNDAWFNVIDDVDPHYLKHFKEFMGAQGDW FT QSNTKYGKPVQIKGGIPTIFLCNPGPNSSYKEYLEEEKNSALRNWAIKNAIFVTLQGPL FT YSSTYQGATPNSQEDNQTTES" FT gene complement(2289..2582) FT /gene="AC4" FT CDS complement(2289..2582) FT /codon_start=1 FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q6R4C4" FT /protein_id="AAR98600.1" FT /translation="MGLLTCMSSSSSKANSSAKTTDSSISPPQAGQHISIRTFRELKAQ FT QMLKHTWKKTETCLIMEFSKSMEDQLEEVANLPTTHMPRQSIQGPKQRPSIY" XX SQ Sequence 2747 BP; 734 A; 564 C; 608 G; 841 T; 0 other; ggtcaatcgg tgtctctcaa acttggctat gcaatcggtg tctggtgtct tatttatacc 60 tagacaccaa atgggattat tggtatttag tacatgaaat tcaaaattca aaatccaatc 120 gtgggcatcc gtataatatt accggatggc cgcgattttt tttaaagggg gccccttgat 180 gtgacgtctc atccaataag aacgctccct caaagcttaa ttatttatgg tcccctatat 240 aagacttaat ccccaagttt cggcgaaatt caaaatgtgg gatccactcc taaacgaatt 300 tcctgaaaac gtccacggtt tccgttgtat gctagccgtg aagtatctgc aagcggtcga 360 gaagacctat tcccctgata ccctagggtt tgatctcatc cgtgatctca tcagtgtaat 420 tcgtgcgaag aactatgtcg aagcgtccag cagatataat catttccacg cccgcctcga 480 aggtacgtcg ccgtctgaac ttcgacagcc catatgtgag ccgtgctgct gcccccactg 540 tccgcgtcac aaaagcaaga tcatggacga acaggcccat gaacagaaag cccaagatgt 600 acaggatgta cagaagtcca gatgttccta gaggatgtga aggcccatgt aaggttcaat 660 cgtttgatgc taagaacgat attggtcaca tgggcaaggt cttatgtttg tccgacgtta 720 cccgtggtat tgggcttacc catcgagttg gcaagcgttt ctgtgtcaag tcgctttatt 780 ttgtcgggaa gatctggatg gatgaaaata ttaaggttaa gaatcacact aacaccgttt 840 tattctggat agttagggat cggcgtccta ctggaacgcc ttatgatttt cagcaggtct 900 ttaatgtata tgataatgaa cctagcactg ctactgtgaa aaacgatcag cgtgatcgtt 960 tccaggttat aaggaggttc caggcaactg ttactggtgg acaatatgca gctaaggagc 1020 aggcgattat tagaaagttt tatcgtgtta acaattatgt agtttacaat caccaggaag 1080 ctgggaagta cgagaaccat actgaaaatg ctctgttgtt gtatatggca tgtactcatg 1140 cctctaatcc tgtgtatgct actttgaaag tcaggagtta tttctacgac tcagtgacga 1200 attaataaat attaaatttt atatcatgtt tttcaattac acaaattgtt ccttctaata 1260 cattgtacaa tatatgagat attgccctaa ttacattgtt tatactaatc acgcctaatc 1320 tatctaaata tttaatacat tgatatttaa atactcttaa gaaacgcgag gtctgaggat 1380 gtaaatgggt ccagattttg cagactagaa aacatttgtg tatccccaac gctttcctca 1440 ggttgtaatt gaactggatt tgtaacgtga ttatgtcgtg gttcctctgg aatgggctct 1500 ctaggtgcct ggttatcttg aaatacaggg gatttttgac cgtccagata tatacgccac 1560 tctctgattg agttgcagtg agtagttccc cggtgcgtga atccattatt gtgacagcct 1620 atggcgacga agtacgaaca tccacaaggt agatcaactc tccgtcgtct ggttgtcctc 1680 ttggctattc ggtgttgcac cttgataggt acttgagtag agtgggcctt ggagggtgac 1740 gaagatcgca ttctttatag cccagtttct aagtgcggag ttcttttcct cttccaagta 1800 ctctttatag ctggagtttg gtccaggatt gcagaggaag atagtgggaa ttccgccttt 1860 aatttgaact ggcttgccgt atttggtgtt gctttgccag tccccttggg cccccatgaa 1920 ctctttaaag tgttttagat aatgcggatc aacgtcatcg atgacgttaa accacgcatc 1980 attattatac gcttttggac ttaaatctaa atggccacac agataattat gtggtcccaa 2040 tgatctagcc cacattgtct tacccgtacg actatcacct tctatcacaa tactcttggg 2100 tctccaaggc cgcgcagcgg caccactcac attctcagaa acccactctt caagttcttc 2160 tggaacttga tcgaatgaag aagaaagaaa aggacaaaca aacacctcta taggaggtgc 2220 aaaaatcctg tctaaattac tatttaaatt atgaaattgt aaaacaaaat ctttaggagc 2280 cttctccttt aatatattga gggccgctgc tttggaccct gaattgattg cctcggcata 2340 tgcgtcgttg gcagattggc aacctcctct agctgatctt ccatcgattt ggaaaactcc 2400 atgatcaagc acgtctccgt ctttttccat gtatgcttta acatctgttg agcttttagc 2460 tccctgaatg ttcggatgga aatgtgttgg cctgcttggg gaggtgagat cgaagaatct 2520 gttgttttgg cacttgaatt tgccttcgaa ctggatgagg acatgcaggt gaggagaccc 2580 atcttcgtgt agttccctgc aaatacgtat gaataattta ttagtcgggg tagataatgc 2640 tgagatttgg gaaagtgcct cttctttagt aagagagcag tgtgggtatg tgagaaaata 2700 atgcttggca tttattctga atttattagg aggagccatg ttgactt 2747 //