ID AY514631; SV 1; linear; genomic DNA; STD; VRL; 2744 BP. XX AC AY514631; XX DT 25-JAN-2004 (Rel. 78, Created) DT 24-MAR-2005 (Rel. 83, Last updated, Version 2) XX DE Tomato yellow leaf curl Thailand virus strain Nong Khai segment A, complete DE sequence. XX KW . XX OS Tomato yellow leaf curl Thailand virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2744 RX DOI; 10.1016/j.virusres.2004.10.001. RX PUBMED; 15826907. RA Sawangjit S., Chatchawankanphanich O., Chiemsombat P., Attathom T., RA Dale J., Attathom S.; RT "Molecular characterization of tomato-infecting begomoviruses in Thailand"; RL Virus Res. 109(1):1-8(2005). XX RN [2] RP 1-2744 RA Sawangjit S., Chatchawankanphanich O., Chiemsombat P., Attathom T., RA Dale J.L., Attathom S.; RT ; RL Submitted (30-DEC-2003) to the INSDC. RL Plant Pathology, Kasetsart University, Malaiman, Kamphaengsaen, Nakhon RL Pathom 73140, Thailand XX DR MD5; 90081fc4d7df35be2b8467bc0299cc76. DR EuropePMC; PMC2224568; 17977971. XX FH Key Location/Qualifiers FH FT source 1..2744 FT /organism="Tomato yellow leaf curl Thailand virus" FT /segment="A" FT /strain="Nong Khai" FT /mol_type="genomic DNA" FT /country="Thailand" FT /db_xref="taxon:85752" FT gene 275..613 FT /gene="AV2" FT CDS 275..613 FT /codon_start=1 FT /gene="AV2" FT /product="precoat protein" FT /db_xref="GOA:Q6R4D5" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:Q6R4D5" FT /protein_id="AAR98590.1" FT /translation="MWDPLLNEFPENVHGFRCMLAVKYLQAVEKTYSPDTLGFDLIRDL FT IGVIRAKNYVEASSRYSHFHARLESTSPSELRQPIQQSCCCPHCPRHKRADMEEPTCIQ FT KAQVLQNV" FT gene 435..1205 FT /gene="AV1" FT CDS 435..1205 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:Q6R4D4" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:Q6R4D4" FT /protein_id="AAR98589.1" FT /translation="MSKRPADILISTPVSKVRRRLNFDSPYNSRAAVPTVRVTKGQIWK FT NRPAYRKPRFYRMYRSPDVPKGCEGPCKVQSFDAKNDIGHMGKVICLSDVTRGIGLTHR FT VGKRFCVKSLYFVGKIWMDENIKVKNHTNTVLFWIVRDRRPTGTPNDFQQVFNVYDNEP FT STATVKNDQRDRFQVIRRFQATVTGGQYAAKEQAIIRKFYRVNNYVVYNHQEAGKYENH FT TENALLLYMACTHASNPVYATLKVRSYFYDSVTN" FT gene complement(1202..1606) FT /gene="AC3" FT CDS complement(1202..1606) FT /codon_start=1 FT /gene="AC3" FT /product="REn protein" FT /db_xref="GOA:Q6R4D3" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:Q6R4D3" FT /protein_id="AAR98593.1" FT /translation="MDLRTGELLTATQLESGVYIWTVKNPLYFKITKHLESPFQRNHDI FT ITLQIQFNHNLRKALGIHKCFLVFKIWTHLHPQASRFLRVFKYQCIKYLDRLGVISINN FT VIRAISHVLYNVLEGTIDVIEEHDIKFNIY" FT gene complement(1347..1751) FT /gene="AC2" FT CDS complement(1347..1751) FT /codon_start=1 FT /gene="AC2" FT /product="TrAP protein" FT /db_xref="GOA:Q6R4D2" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q6R4D2" FT /protein_id="AAR98592.1" FT /translation="MRSSSPSKAHSTQVPIKVQHRIAKRATRRRRVDLPCGCSYFVAIG FT CHNNGFTHRGTTHCNSIREWRVYLDGQKSPIFQDNQAPREPIPEEPRHNHVTNPVQPQL FT EESVGDTQMFSSLQNLDSFTSSGLAFLKSI" FT gene complement(1654..2739) FT /gene="AC1" FT CDS complement(1654..2739) FT /codon_start=1 FT /gene="AC1" FT /product="Rep protein" FT /db_xref="GOA:Q6R4D1" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:Q6R4D1" FT /protein_id="AAR98591.1" FT /translation="MAPPNKFRINAKNYFLTYPHCSLTKEEALSQMHALETPTTKLFIR FT ICRELHEDGTPHLHVLIQFEGKFQCKNQRFFDLTSPTRSAHFHPSIQGAKSSTDVKTYM FT EKDGDVLDHGIFQIDGRSARGGCQSANDAYAEAINSGSKASALTILKEKAPKDFVLQFH FT NLNSNLDRIFTPPMEEYISPFSSSSFNQVPEELKEWACNNVVSAAARPLRPMGIVIEGD FT SRTGKTMWARSLGPHNYLCGHLDLSPKVYNNNVWYNVIDDVDPHYLKHFKEFMGAQRDW FT QSNTKYGKPVQIKGGIPTISLCNPGPNSSYKEYLEEEKNSALRNWAIRNAIFVTLKGPL FT YSGSNQGATPNSQEGNQTTES" FT gene complement(2289..2588) FT /gene="AC4" FT CDS complement(2289..2588) FT /codon_start=1 FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:Q6R4D0" FT /protein_id="AAR98594.1" FT /translation="MKMGPLTCMSSSNSKENSNVKIKDSSISRPQPGQHISIRAFRELK FT AQQMLKHTWKKTETYLIMEFSRSMEDRLEEVANLPTTHMPRQSIQGPKLRPSLY" XX SQ Sequence 2744 BP; 733 A; 545 C; 606 G; 860 T; 0 other; ggtcaatcgg tgtctctcaa acttggctat gcaatcggtg tctggggtct tatttatacc 60 tggacaccaa atggcataat tgtaatttag taaatgtgat tcaaaattca aaatccaaaa 120 gcggccatcc gtttaatatt accggatggc cgcgattttt tttaaagtgg tccccttgat 180 gtgatgtttc atccaattaa aacgctcagc caaagcttaa ttatttatgg tcccctattt 240 aagacttagt caccaagttt cggcgaaatt caaaatgtgg gatccactcc taaacgaatt 300 tccagaaaac gtccacggtt tccgttgtat gttagcggtg aagtatctgc aagcggtcga 360 gaagacgtat tcacctgata ccctagggtt tgatctcatc cgtgatctta tcggtgtaat 420 tcgtgcgaag aactatgtcg aagcgtccag cagatattct catttccacg cccgtctcga 480 aagtacgtcg ccgtctgaac ttcgacagcc catacaacag tcgtgctgct gtccccactg 540 tccgcgtcac aaaagggcag atatggaaga accgacctgc atacagaaag cccaggttct 600 acagaatgta tagaagtcct gatgtcccta agggatgtga gggtccatgt aaagtgcaat 660 ctttcgatgc gaagaacgat attggtcata tgggcaaggt aatctgtctg tctgacgtta 720 cccgtggtat tgggcttact catcgagttg gcaagcgttt ctgtgtcaag tcactttatt 780 ttgtcgggaa gatctggatg gatgaaaata ttaaggttaa gaatcacact aacaccgttt 840 tattttggat agttagggat cggcgtccta ctggaacgcc taatgatttt cagcaggtct 900 ttaatgtata tgataatgaa cccagcactg ctactgtaaa gaacgaccag cgtgatcgtt 960 tccaggttat aaggaggttc caggcaacgg tgactggtgg acaatatgca gctaaggagc 1020 aggcgattat tagaaagttt tatcgtgtta ataattatgt agtttacaat caccaggaag 1080 ctgggaagta cgagaaccat actgaaaatg ctttgttgtt gtatatggca tgtactcatg 1140 cctctaatcc tgtgtatgct actttgaaag tcaggagtta tttctatgac tcagtgacga 1200 attaataaat attaaatttt atatcgtgtt cttcaattac atcaattgtt ccttctaata 1260 cattgtacaa tacatgagat attgccctaa ttacattatt tatactaatc acgcctaatc 1320 tatctaaata tttaatacat tgatatttaa atactcttaa gaaacgcgag gcctgaggat 1380 gtaaatgagt ccagattttg aagactagaa aacatttgtg tatccccaac gctttcctca 1440 agttgtggtt gaactggatt tgtaacgtga ttatgtcgtg gttcctctgg aatgggctct 1500 ctaggtgctt ggttatcttg aaatataggg gatttttgac cgtccagata tacacgccac 1560 tctctaattg agttgcagtg agtagttccc cggtgcgtaa atccattatt gtgacatcct 1620 attgcgacga agtacgaaca tccacaaggt agatcaactc tccgtcgtct ggttgccctc 1680 ttggctattc ggtgttgcac cttgattgga acctgagtag agtgggcctt tgagggtgac 1740 gaagatcgca tttcttatag cccagtttct aagtgcggag ttcttttcct cttccaagta 1800 ctctttataa ctggagttgg gtccaggatt gcagagagag atagtgggaa ttccgccttt 1860 aatttgaact ggctttccgt actttgtgtt tgattgccag tccctttggg cccccatgaa 1920 ttctttaaag tgttttagat agtgcggatc gacgtcatcg atgacgttgt accacacatt 1980 attattgtac acttttggac ttaaatctaa atggccacac agataattat gtggtcccaa 2040 tgacctagcc cacatcgtct tccccgttct gctatcaccc tcaattacta tacccatggg 2100 tctcaatggc cgcgcagcgg cactgacaac attattacaa gcccattctt taagttcttc 2160 tggaacttga ttaaaagaag aagaagaaaa tggagaaata tattcctcca ttggaggagt 2220 aaaaatccta tctaaattag aatttaaatt atgaaattgc aaaacaaaat ctttaggggc 2280 cttttccttc agtatagtga gggccgaagc tttggaccct gaattgattg cctcggcata 2340 tgcgtcgttg gcagattggc aacctcctct agccgatctt ccatcgatct ggaaaattcc 2400 atgatcaagt acgtctccgt ctttttccat gtatgtttta acatctgttg agcttttagc 2460 tccctgaatg ctcggatgga aatgtgctga cctggttggg gacgtgagat cgaagaatct 2520 ttgattttta cattggaatt ttccttcgaa ttggatgagg acatgcaggt gaggggtccc 2580 atcttcatgg agttccctgc agattctgat gaataattta gtagtgggtg tttctagtgc 2640 gtgcatttgg gaaagtgctt cctctttagt gagagaacaa tgtgggtatg tgaggaaata 2700 gttcttggca tttattctga atttattagg aggagccatt gact 2744 //