ID HM037920; SV 1; circular; genomic DNA; STD; VRL; 2753 BP. XX AC HM037920; XX DT 27-APR-2012 (Rel. 112, Created) DT 27-APR-2012 (Rel. 112, Last updated, Version 1) XX DE Cotton leaf curl virus isolate Sirsa-UC segment DNA-A, complete sequence. XX KW . XX OS Cotton leaf curl virus OC Viruses; Geminiviridae; Begomovirus; unclassified Begomovirus. XX RN [1] RP 1-2753 RA Chakrabarty P.K., Sable S.V., Kalbande B.B., Chavhan R.L., Monga D., RA Koundal V., Kumar D., Pappu H.R.; RT ; RL Submitted (27-MAR-2010) to the INSDC. RL Division of Crop Improvement, Central Institute for Cotton Research, Post RL Bag No. 2, Shankar Nagar, P.O., Nagpur, Maharashtra 440010, India XX DR MD5; 8d98fb82429f8ceb3541da083676aa28. XX FH Key Location/Qualifiers FH FT source 1..2753 FT /organism="Cotton leaf curl virus" FT /segment="DNA-A" FT /host="Gossypium hirsutum" FT /isolate="Sirsa-UC" FT /mol_type="genomic DNA" FT /country="India" FT /collected_by="Dilip Monga and P.K. Chakrabarty" FT /collection_date="Aug-2009" FT /db_xref="taxon:53010" FT gene complement(75..806) FT /gene="AC5" FT CDS complement(75..806) FT /codon_start=1 FT /gene="AC5" FT /product="ac5 protein" FT /db_xref="InterPro:IPR006892" FT /db_xref="InterPro:IPR013671" FT /db_xref="UniProtKB/TrEMBL:I1SZD1" FT /protein_id="AEI52849.1" FT /translation="MNILHSRRTGLIIKHIKYLSKILRFINRSTISNQEKHHTIRVILR FT LNVLIHPYLTQHINRLDTKSLTNSMGQPSTTSNITNTHYFTYMLNIMSGLKRLNLTWTF FT TSSRNIWTSVHPVHPGLSVHGPVRPCFCFGDADNGGSSTARIWAVEVQTAAYLRSGRGN FT DDICWSLRHNYWPLLPGLNPLSNRNPMYPESKFSLLTANISLLTYTENRERCQGTRLTV FT DPTFFKRILSNEVLIISGRII" FT gene 131..487 FT /gene="AV2" FT CDS 131..487 FT /codon_start=1 FT /gene="AV2" FT /product="precoat/movement protein" FT /db_xref="GOA:I1SZC5" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:I1SZC5" FT /protein_id="AEI52843.1" FT /translation="MWDPLLNEFPDTVHGFRCMLAVKYLQLVEKTYSPDTLGYDLIRDL FT ILVIRASNYVEATSRYRHFHARFEGTPPSELRQPICEPCCCPHCPRHQSKSMGEQAHEQ FT KAQDVQDVQKSRCS" FT gene 291..1061 FT /gene="AV1" FT CDS 291..1061 FT /codon_start=1 FT /gene="AV1" FT /product="coat protein" FT /db_xref="GOA:I1SZC6" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:I1SZC6" FT /protein_id="AEI52844.1" FT /translation="MSKRPADIVISTPASKVRRRLNFDSPYASRAAAPIVRVTKAKAWA FT NRPMNRKPRMYRMYRSPDVPRGCEGPCKVQSFESRHDIQHIGKVMCISDVTRGTGLTHR FT VGKRFCVKSVYVLGKIWMDENIKTKNHTNSVMFFLVRDRRPVDKPQDFGEVFNMFDNEP FT STATVKNVHRDRYQVLRKWYATVTGGQYASKEQALVKKFVRVNNYVVYNQQEAGKYENH FT TENALMLYMACTHASNPVYATLKIRIYFYDSVKN" FT gene complement(1064..1468) FT /gene="AC3" FT CDS complement(1064..1468) FT /codon_start=1 FT /gene="AC3" FT /product="replication enhancer protein" FT /note="ren" FT /db_xref="GOA:I1SZC7" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:I1SZC7" FT /protein_id="AEI52845.1" FT /translation="MDSRTGEPITAAQAGNGAYIWEVPNPLYFKIISHVNRPFTTNMDI FT LTIRIQFNYNTRKALELHKCFLTFRIWTTLQPQTGLFLRVFKTQVLKYLNNLGVISINL FT VIKAVEHVLYNIIQQTMYVDQYSEIKFKLH" FT gene complement(1161..1613) FT /gene="AC2" FT CDS complement(1161..1613) FT /codon_start=1 FT /gene="AC2" FT /product="transcription activator protein" FT /db_xref="GOA:Q1KSG4" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q1KSG4" FT /protein_id="AEI52846.1" FT /translation="MRSSSHLIDPCTQVPIKVQHREAKRRNRRRRVDLECGCSYYLSIN FT CHNHGFTHRGTHHCSSSREWRIYLGGSKSPLFQDHQPRQPSIHDEYGHTHDQDPVQLQH FT SESSGTAQVFSNIPNLDDLTASDWSFLKGIQNPSPQISEQSRCNFN" FT gene complement(1510..2598) FT /gene="AC1" FT CDS complement(1510..2598) FT /codon_start=1 FT /gene="AC1" FT /product="replication initiator protein" FT /db_xref="GOA:I1SZC9" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:I1SZC9" FT /protein_id="AEI52847.1" FT /translation="MPPKRNGFYSKNYFITYPKCSLTKEEALSQLLNIQTPTSKKYIRI FT CRELHEDGTPHLHVLIQFEGKFKCQNMRFFDLVSPSRSAHFHPNIQGAKSSSDVKSYIE FT KDGDILDWGQFQIDGRSARGGQQTANDAYAAALNAGSKSEALRVIKELAPKDFVLQFHN FT LNANLDRIFQEPPAPYVSPFSSSSFDQVPEELEVWADENVVSAAARANRPISLVIEGDS FT RTGKTMWARSLGPHNYLCGHLDLSPRVYSNDAWFNVIDDVDPHYLKHFKEFMGAQKDWQ FT SNTKYGKPVQIKGGIPTIFLCNPGPNSSYKEFLDEEKNSALKNWALKNAIFITLDRPMY FT SGTNQSTAQGSEEAQQEEESRS" FT gene complement(2142..2444) FT /gene="AC4" FT CDS complement(2142..2444) FT /codon_start=1 FT /gene="AC4" FT /product="ac4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:I1SZD0" FT /protein_id="AEI52848.1" FT /translation="MGLLTCMFSSSSKGNSSARICDSSTWSPQAGQHISIRTYRELNPA FT PTSNPTSRRTGTFSTGGNFRSTEGQQEEGNRQPMTLTPQHLTREVSRRLLESLRN" XX SQ Sequence 2753 BP; 743 A; 543 C; 632 G; 835 T; 0 other; accggatggc cgcgcgattt ttttgtgggc cttaccatta acacttgtcg gccaatcata 60 tgacgcgctc aaagctaaat aattctcccg cttattataa gtacttcgtt gctaagtatg 120 cgtttgaaaa atgtgggatc cactgttaaa cgagttccct gacaccgttc acggttttcg 180 gtgtatgtta gcagtgaaat atttgcagtt agtagagaaa acttactctc cggatacatt 240 gggttacgat ttgataaggg atttaatcct ggtaataagg gccagtaatt atgtcgaagc 300 gaccagcaga tatcgtcatt tccacgcccg cttcgaaggt acgccgccgt ctgaacttcg 360 acagcccata tgcgagccgt gctgctgccc ccattgtccg cgtcaccaaa gcaaaagcat 420 gggcgaacag gcccatgaac agaaagccca ggatgtacag gatgtacaga agtccagatg 480 ttcctagagg atgtgaaggt ccatgtaagg ttcagtcgtt tgagtccaga catgatattc 540 agcatatagg taaagtaatg tgtattagtg atgttactcg tggtactggg ctgacccata 600 gagttggtaa gagattttgt gtcaagtctg tttatgtgtt gggtaagata tggatggatg 660 agaacattaa gacgaagaat cacacgaata gtgtgatgtt tttcttggtt agagatcgta 720 gacctgttga taaacctcaa gattttggag aggtatttaa tatgtttgat aatgagccca 780 gtacggcgac tgtgaagaat gttcatcgtg ataggtatca agttctgcgc aaatggtatg 840 caactgtcac cggtggacaa tacgcttcaa aggaacaggc tttggtcaag aagtttgtca 900 gagttaacaa ttatgttgtt tacaatcaac aggaagcagg aaaatacgag aatcatacgg 960 aaaatgcgtt aatgctttat atggcttgta ctcacgctag caaccctgtt tatgctacgt 1020 tgaagattag gatatatttt tatgactctg taaagaattg atattaatga agtttgaatt 1080 ttatttctga atattgatct acatacatag tttgttggat tatattgtac aatacatgtt 1140 ctacagcttt aataactaaa ttaattgaaa ttacaccgag attgttcaga tatttgagga 1200 cttgggtttt gaataccctt aagaaaagac cagtctgagg ctgtaaggtc gtccagattc 1260 ggaatgttag aaaacacttg tgcagttcca gagctttccg agtgttgtag ttgaactgga 1320 tcctgatcgt gagtatgtcc atattcgtcg tgaatggacg gttgacgtgg ctgatgatct 1380 tgaaataaag gggatttgga acctcccaga tatatgcgcc attccctgct tgagctgcag 1440 tgatgggttc ccctgtgcgt gaatccatgg ttatggcagt tgattgacag ataataagaa 1500 cacccgcatt caagatctac tctcctcctc ctgttgcgcc tcttcgcttc cctgtgctgt 1560 actttgattg gtacctgagt acatgggtct atcaagtgtg atgaagatcg cattctttaa 1620 agcccaattt tttagtgcag aattcttctc ttcatccaaa aactctttat agcttgaatt 1680 gggtcctgga ttgcagagga agatagtggg aattccgcct ttaatttgaa ctggcttccc 1740 gtattttgta tttgattgcc agtccttttg ggcccccatg aactccttaa agtgctttag 1800 gtaatgcggg tcgacgtcat caatgacgtt aaaccaggcg tcattactgt atacccttgg 1860 gctcagatct agatgtccac acagataatt atgtggacct aatgatctgg cccacatcgt 1920 cttccccgtc ctactgtcac cctcaatcac taaacttatt ggtctattgg cccgcgcagc 1980 ggcactgacg acgttctcgt cagcccacac ttcaagttct tctggaactt gatcgaaaga 2040 agaagaggaa aaaggagaaa cataaggagc tggtggctcc tgaaagattc tgtctagatt 2100 tgcatttaaa ttatgaaatt gcagtacaaa atccttagga gctagttcct taatgactct 2160 aagagcctcc gacttacttc ccgcgttaag tgctgcggcg taagcgtcat tggctgtctg 2220 ttgccctcct cttgctgacc ttccgtcgat ctgaaattgc ccccagtcga gaatgtcccc 2280 gtccttctcg atgtaggatt tgacgtcgga gctggattta gctccctgta tgttcggatg 2340 gaaatgtgct gacctgcttg gggagaccaa gtcgaagaat cgcatattct ggcacttgaa 2400 tttcccttcg aactggatga gaacatgcaa gtgaggagtc ccatcttcgt gaagctctct 2460 gcagattcta atatattttt ttgaagttgg ggtttgtata tttaataatt gggaaagtgc 2520 ttcctctttg gtgagagaac atttgggata agtgatgaaa tagtttttgg aataaaaacc 2580 gttccgcttt ggaggcatgt tgactaaaat tgatcaccga ttgaccgctc ttgcaactct 2640 ccctggtata tcggtgatca atatatagtg atcaccaaat ggcataatgg taataaaaaa 2700 actttaattt gaaattcaaa ccaaaaggct aaagcggcca tccgtttaat att 2753 //