ID GU385879; SV 2; linear; genomic DNA; STD; VRL; 2736 BP. XX AC GU385879; XX DT 01-MAR-2010 (Rel. 104, Created) DT 08-MAY-2018 (Rel. 136, Last updated, Version 5) XX DE Cotton leaf curl Kokhran virus segment DNA-A, complete sequence. XX KW . XX OS Cotton leaf curl Kokhran virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2736 RX DOI; 10.1007/s11262-010-0482-7. RX PUBMED; 20405195. RA Kumar J., Kumar A., Roy J.K., Tuli R., Khan J.A.; RT "Identification and molecular characterization of begomovirus and RT associated satellite DNA molecules infecting Cyamopsis tetragonoloba"; RL Virus Genes 41(1):118-125(2010). XX RN [2] RP 1-2736 RA Kumar J., Kumar A., Khan J.A.; RT "Molecular characterization of Cotton leaf curl virus infecting Cyamopsis RT tetragonolobus"; RL Unpublished. XX RN [3] RP 1-2736 RA Kumar J., Kumar A., Khan J.A.; RT ; RL Submitted (04-JAN-2010) to the INSDC. RL Molecular Virology Lab, National Botanical Research Institute, Rana Pratap RL Marg, Lucknow, Uttar Pradesh 226001, India XX RN [4] RC Sequence update by submitter RP 1-2736 RA Kumar J., Kumar A., Khan J.A.; RT ; RL Submitted (08-MAR-2010) to the INSDC. RL Molecular Virology Lab, National Botanical Research Institute, Rana Pratap RL Marg, Lucknow, Uttar Pradesh 226001, India XX DR MD5; d926be7e862b1cdf8593a34ac05f0a5b. DR EuropePMC; PMC4054350; 24719407. XX CC On Mar 9, 2010 this sequence version replaced GU385879.1. XX FH Key Location/Qualifiers FH FT source 1..2736 FT /organism="Cotton leaf curl Kokhran virus" FT /segment="DNA-A" FT /host="Cyamopsis tetragonoloba" FT /isolate="Lucknow" FT /mol_type="genomic DNA" FT /country="India" FT /isolation_source="leaf" FT /clone="IN:Lko:10" FT /db_xref="taxon:222464" FT gene 131..487 FT /gene="AV2" FT CDS 131..487 FT /codon_start=1 FT /gene="AV2" FT /product="pre-coat protein" FT /db_xref="GOA:D3Y1N2" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:D3Y1N2" FT /protein_id="ADD62429.1" FT /translation="MWDPLLNEFPYTVHGFRCMLSVKYLQLLSQDYSPDTLGYELIRDL FT ISVIRARNYVEATSRYNHFHARFEGTSPSQLRQPICEPCCCPHCPRHQSKSMGEQAHEQ FT KAQDVQDVQKSRCS" FT gene 291..1061 FT /gene="AV1" FT CDS 291..1061 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /note="involved in encapsidation" FT /db_xref="GOA:D3Y1N1" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:D3Y1N1" FT /protein_id="ADC96619.1" FT /translation="MSKRPADIIISTPASKVRRRLNFDSPYVSRAAAPIVRVTKAKAWA FT NRPMNRKPRMYRMYRSPDVPRGCEGPCKVQSFESRHDIQHIGKVMCVSDVTRGTGLTHR FT VGKRSCVKSVYVLGKIWMDENIKTKNHTNSVMFFLVRDRRPVDKPQDFGWVFNMFDNEP FT STATVKNVHRDRYQVLRKWYATVTGGQYASKEQALVKKSIRVNNYVVYNQQEAGKYENH FT SENALMLYMACTHASNPVYATLKIRIYFYDSVTN" FT gene complement(1058..1459) FT /gene="AC3" FT CDS complement(1058..1459) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer protein" FT /db_xref="UniProtKB/TrEMBL:D3Y1N3" FT /protein_id="ADD62430.1" FT /translation="MDSRTREPITLQPCRMSHNLEVPIPLFQNHQPRQPSIHDEYGYTH FT DQDPVQSQPEESVGDTQMFHSFPNLDDLTASDWSFLKGLYDSSASIFNLLRNCQYYIMC FT LSSWSCIMECITEHVVYVDQSSSIKFNIY" FT gene complement(1153..1599) FT /gene="AC2" FT CDS complement(1153..1599) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:D3Y1N4" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:D3Y1N4" FT /protein_id="ADD62431.1" FT /translation="MRSSSLSKDHCTQLSIKVQHREARDATGERIRSCSPCYSSSVITA FT TPWIHAQGNPSLCSLAGCRIIWRFPSLYFRIISHVNRPFTTNMDILTIRIQFNHNLRKA FT LGIHKCFIAFRIWMTSQPPTGRFLRVFTTQVLQYLIYLGIVSII" FT gene complement(1505..2581) FT /gene="AC1" FT CDS complement(1505..2581) FT /codon_start=1 FT /gene="AC1" FT /product="Rep protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:D3Y1N5" FT /protein_id="ADD62432.1" FT /translation="MPPKRKKYKPKTISSLIHSAHSLKRKHFPKFKPSTHPRIKNTSNS FT AESYTKCGALISMCSSSSRANSSARITDSSTWYPQPGQHISIRTFRELNQAQMSGLHRQ FT GRGHSRVGRVSDRWKISKRRTADSPRRLRRSTLRRQSVRGSYSHLGTRSLRFCTTISLF FT KCKSRSNLSGATGSLYFSFFSFFFRSSSRRTCSVGCRERRQCRCAAQLTNKFSDCGCQS FT DGEDDVGQIIRSTYYLCGHLDLSPRVYSNDAWFNVIDDVDPHYLKHFKEFMGAQKDWQS FT NTKYGKPVQIKGGIPTIFLCNQGPNSSYKEFLDEEKNSALKNWALKNAIFITLEGPLYS FT AFHQSTAQGSERRNRREN" FT gene complement(2091..2429) FT /gene="AC4" FT CDS complement(2091..2429) FT /codon_start=1 FT /gene="AC4" FT /product="transcriptional regulator protein" FT /db_xref="GOA:D3Y1N6" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="UniProtKB/TrEMBL:D3Y1N6" FT /protein_id="ADD62433.1" FT /translation="MWSPHLHVLIQFEGKFVCTNNRFFDLVSPTRSAHFHPNIQGAKSS FT SDVRPTSTRTGTLSSGESFRSMEDQQEEDSRQPTTLTPQHFTQAVSQRLLQSFRNSLLK FT ILYYNFII" XX SQ Sequence 2736 BP; 742 A; 559 C; 641 G; 794 T; 0 other; accggatggc cgcgcgattt ttttgtgggc cctaccatta actcttgtcg gccaatcata 60 tgactccctc aaagctaaat aacgctcccg cacactataa gtacttgcgc actaagtttc 120 aaattcaaac atgtgggatc cactattaaa cgaattcccc tatacggttc acgggtttcg 180 gtgtatgctt tctgtgaaat atttgcaact tttgtcgcag gattattcac cggatacgct 240 tgggtacgag ttaatacggg atttgatttc agtaataagg gccaggaatt atgtcgaagc 300 gaccagcaga tataatcatt tccacgcccg cttcgaaggt acgtcgccgt ctcaacttcg 360 acagcccata tgtgagccgt gctgctgccc ccattgtccg cgtcaccaaa gcaaaagcat 420 gggcgaacag gcccatgaac agaaagccca ggatgtacag gatgtacaga agtccagatg 480 ttcctagagg atgtgaaggt ccatgtaagg ttcagtcgtt tgagtccaga catgatattc 540 agcatatagg taaagtcatg tgtgttagtg atgttactcg tggtactggg cttacccata 600 gagtgggtaa gagatcttgt gtgaagtctg tgtatgtttt gggtaagata tggatggatg 660 agaacattaa gacgaagaat cacacgaata gtgtgatgtt tttcttggtt agagatcgta 720 gaccagttga taaacctcaa gattttggat gggtgtttaa catgtttgat aatgagccca 780 gtacggcgac tgtgaagaat gttcatcgtg ataggtatca agttctgcgc aaatggtatg 840 caactgtcac cggtggacaa tacgcttcaa aggaacaagc tctcgtgaag aagtctatta 900 gggttaataa ttatgttgtg tataaccagc aggaagctgg caagtatgag aatcattctg 960 agaatgcttt aatgttgtat atggcgtgta ctcacgcctc taacccagtg tatgctacct 1020 tgaagatacg gatctacttc tatgattccg tgacaaatta ataaatattg aattttattg 1080 aagatgattg gtctacatat acaacatgct ctgtaataca ttccataata catgaccaac 1140 tgctcaaaca cattatataa tactgacaat tcctaagtaa attaaatatt gaagcacttg 1200 agtcgtaaag acccttaaga aacgaccagt cggaggctgt gaggtcatcc agattcggaa 1260 agctatgaaa catttgtgta tccccaacgc tttcctcagg ttgtgattga actggatcct 1320 gatcgtgagt atatccatat tcgtcgtgaa tggacggttg acgtggctga tgattctgaa 1380 ataaagggat gggaacctcc agattatgcg acatcctgca aggctgcaaa gtgatgggtt 1440 cccttgtgcg tgaatccatg gtgtggcagt gatgacagat gacgaataac acggtgaaca 1500 agatctaatt ctctctcctg ttgcgtctct cgcttccctg tgctgtactt tgatggaaag 1560 ctgagtacag tggtccttcg agagtgatga agatcgcatt ctttaaagcc caatttttta 1620 gtgcagaatt cttctcttca tccaaaaact ctttatagct tgaattgggt ccttgattgc 1680 agaggaagat agtgggaatt ccgcctttaa tttgaactgg cttcccgtat tttgtatttg 1740 attgccagtc cttttgggcc cccatgaact ccttaaagtg ctttaggtaa tgcgggtcga 1800 cgtcatcaat gacgttaaac caggcgtcat tactgtatac ccttgggctc agatctagat 1860 gtccacacag ataatatgtg gacctaatga tctggcccac atcgtcttcc ccgtccgact 1920 gacacccaca atcactaaac ttattggtca attgggccgc gcagcggcac tgacgacgtt 1980 ctcggcagcc cacactacaa gttcttctgg aacttgatcg aaagaagaag gagaaaaagg 2040 agaaatatag ggagccggtg gctcctgaaa gattcgatct agatttgcat ttaaataatg 2100 aaattgtagt acaaaatctt aaggagcgag ttcctaaatg actgtaagag cctctgactg 2160 actgcctgcg taaagtgctg cggcgtaagc gtcgtgggct gtctgctgtc ctcctcttgc 2220 tgatcttcca tcgatctgaa actctcccca ctcgagagtg tccccgtcct tgtcgatgta 2280 ggcctgacat ctgagcttga tttagctccc tgaatgttcg gatggaaatg tgctgacctg 2340 gttggggata ccaggtcgaa gaatctgtta ttcgtgcaga cgaatttgcc ctcgaactgg 2400 atgagcacat ggagatgagg gctccacatt tcgtgtaact ctctgcagag tttgatgtat 2460 tttttattcg agggtgtgtt gatggcttga atttgggaaa gtgcttcctc tttagtgagt 2520 gagcactgtg gataagtgat gaaatagttt ttggcttgta ctttttccgc tttggaggca 2580 tgttgactaa aattgatcac cgattgaccg ctcttgcaac tctccccggt atatcggtga 2640 tcaatatata gtgatcacca aatggcataa tggtaataaa aaaactttaa tttgaaattc 2700 aaaccaaaag gctaaagcgg ccatccgtat aatatt 2736 //