ID FJ469626; SV 1; circular; genomic DNA; STD; VRL; 2771 BP. XX AC FJ469626; XX DT 29-DEC-2008 (Rel. 98, Created) DT 25-FEB-2009 (Rel. 99, Last updated, Version 2) XX DE Cotton leaf curl Gezira virus-[okra:Niger] clone NG1FL segment A, complete DE sequence. XX KW . XX OS Cotton leaf curl Gezira virus-[okra:Niger] OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2771 RX DOI; 10.1007/s00705-008-0304-6. RX PUBMED; 19156351. RA Shih S.L., Kumar S., Tsai W.S., Lee L.M., Green S.K.; RT "Complete nucleotide sequences of okra isolates of Cotton leaf curl Gezira RT virus and their associated DNA-beta from Niger"; RL Arch. Virol. 154(2):369-372(2009). XX RN [2] RP 1-2771 RA Shih S.L., Kumar S., Lee L.M., Green S.K.; RT ; RL Submitted (17-NOV-2008) to the INSDC. RL Virology, AVRDC-The World Vegetable Center, PO Box 42, Shanhua, Taiwan, RL Republic of China XX DR MD5; 5e643ff836d01aa2589033bbda6c963e. DR EuropePMC; PMC2839976; 20178575. XX FH Key Location/Qualifiers FH FT source 1..2771 FT /organism="Cotton leaf curl Gezira virus-[okra:Niger]" FT /segment="A" FT /mol_type="genomic DNA" FT /country="Niger:Sadore (45km from Niamey)" FT /isolation_source="Okra" FT /collection_date="2007" FT /clone="NG1FL" FT /db_xref="taxon:502871" FT gene 161..529 FT /gene="V2" FT CDS 161..529 FT /codon_start=1 FT /gene="V2" FT /product="pre-coat protein" FT /db_xref="GOA:B1PIY0" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:B1PIY0" FT /protein_id="ACK77810.1" FT /translation="MWDPLLNDFPESVHGFRCMLAVKYLQAVRESYDPSTLGYDLLSDL FT IGVVRRTNYVEATSRYHHFHSRLESASPSELRQPRVVLCTCPHCPRHKQTSGLDQQAHV FT QKTEDVQNVSQPRCPKGM" FT gene complement(294..851) FT /gene="C5" FT CDS complement(294..851) FT /codon_start=1 FT /gene="C5" FT /product="C5 protein" FT /db_xref="InterPro:IPR006892" FT /db_xref="InterPro:IPR013671" FT /db_xref="UniProtKB/TrEMBL:B8Y3K2" FT /protein_id="ACK77807.1" FT /translation="MITEIVLDRSSARFVVKHVENLTEIQRGIPIRSSITDKEKHDIVG FT VVLLLNVVIHPYLTKDINGFDSETLSSTMGKTNPLSNIGHTSDNTRMLDIITLFIRLDL FT TGTFTSLWDIWAAIHSVHPRFSVRGPVGPGPTFVCDEDSGGTCRGQPWAVEVQTATHFR FT GGSGNDDICWSLRHSWYDEQLR" FT gene 321..1097 FT /gene="V1" FT CDS 321..1097 FT /codon_start=1 FT /gene="V1" FT /product="coat protein" FT /db_xref="GOA:B8Y3K3" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="UniProtKB/TrEMBL:B8Y3K3" FT /protein_id="ACK77806.1" FT /translation="MSKRPADIIISTPASKVRRRLNFDSPGLSSARAPTVLVTNKRRAW FT TNRPTYRKPRMYRMYRSPDVPKGCEGPCKVQSYEQRDDVKHTGIVRCVSDVTKGVGLTH FT RTGKRFTIKSIYILGKVWMDDNIKKQNHTNNVMFFLVRDRRPYGNSPLDFGQVFNMFDN FT EPSTATVKNDLRDHFQVLRKFTATVIGGPSGMKEQALVRRFFRINSQIVYNHQEAGKFE FT NHTENAILLYMACTHASNPVYATLKIRIYFYDSVSN" FT gene complement(1094..1495) FT /gene="C3" FT CDS complement(1094..1495) FT /codon_start=1 FT /gene="C3" FT /product="replication enhancement protein" FT /db_xref="GOA:B1PIY3" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:B1PIY3" FT /protein_id="ACK77809.1" FT /translation="MDSRTGELITAHQSENGVLIWTINNPLYFKTIKEFRLTHGNQTMI FT EMQIRFNYNLRKELGIHKCFMNFRVWTISRPPTGLFLNVFRKQIMKYLYRLGVISINNV FT IRAVNHVLYNVLQTTVASEFTHNIQIKLY" FT gene complement(1239..1643) FT /gene="C2" FT CDS complement(1239..1643) FT /codon_start=1 FT /gene="C2" FT /product="transcriptional activation protein" FT /db_xref="GOA:B1PIX7" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:B1PIX7" FT /protein_id="ACK77808.1" FT /translation="MRPSSPSQIRCTQVPIKVQHREAKKRAIRRRRIDIPCGCTVYVAF FT TCRDNGFTHRGTHHCASEREWRTYLDNQQSPVFQNHQGVSSNARESNNDRDADKIQLQP FT QEGVGDSQMFHELPGLDDLTPSDWSFLKRI" FT gene complement(1543..2631) FT /gene="C1" FT CDS complement(1543..2631) FT /codon_start=1 FT /gene="C1" FT /product="replication-associated protein" FT /db_xref="GOA:B1PIX8" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:B1PIX8" FT /protein_id="ACK77805.1" FT /translation="MAPTKKFRINSKNYFLTFPKCSLSKEEALEQLLNLNTPTNKKYIK FT ICRELHEDGQPHLHVLLQFQGKYNCQNKRFFDLVSPNRSAHFHPNIQGAKSSSDVKSYI FT DKDGDTLEWGEFQIDGRSARGGQQTANDAYAAALNAGSKAEALRVIRELAPKDFVLQFH FT NLNSNLERIFEEPPAPYVSPFLSSSFDQVPEELEEWAAENVVEAAARPDRPISIVIEGD FT SRTGKTVWARSLGPHNYLCGHLDLSPKVFSNDAWYNVIDDVDPHYLKHFKEFMGAQKDW FT QSNTKYGKPVQIKGGIPTIFLCNPGPNSSYKEYLDEEKNAHLKSWALKNATFITLSNPL FT YSGTNQSPASGGQEESNQETQD" FT gene complement(2181..2474) FT /gene="C4" FT CDS complement(2181..2474) FT /codon_start=1 FT /gene="C4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:B8Y3K7" FT /protein_id="ACK77811.1" FT /translation="MASLICMCFSSSKGSTTAKIRDSSTWSPQIGQHISIQTFRELNPA FT PTSSPTSTRMETHSNGENSRSTEDLPEEANRQPMTLTPQRLTQEVRQRLLGL" XX SQ Sequence 2771 BP; 695 A; 626 C; 617 G; 833 T; 0 other; accggtgggc gcgaaaaaaa aagtggaccc cgccccacgt gaacatgtcg cgcgagtggt 60 gaacatgtcg cgcgatgctg tccaatcaga acgcgcgctc tacgcattat aatttgaaat 120 ttgaaatata tacttgctcc ccaagttgtc cgccataact atgtgggatc cgttattgaa 180 cgacttccct gaatccgttc acggtttccg ttgtatgctc gccgtgaaat atttgcaggc 240 tgttcgtgag tcgtacgatc cttccaccct tggttacgat cttctaagcg atctcatcgg 300 agttgttcgt cgtaccaact atgtcgaagc gaccagcaga tatcatcatt tccactcccg 360 cctcgaaagt gcgtcgccgt ctgaacttcg acagcccagg gttgtcctct gcacgtgccc 420 ccactgtcct cgtcacaaac aaacgtcggg cctggaccaa caggcccacg tacagaaaac 480 cgaggatgta cagaatgtat cgcagcccag atgtcccaaa gggatgtgaa ggtccctgta 540 aggtccagtc ttatgaacag cgtgatgatg tcaagcatac gggtattgtc cgatgtgtgt 600 ccgatgttac taagggggtt ggtcttaccc atcgtactgg aaagcgtttc actatcaaat 660 ccatttatat ccttggtaag gtatggatgg atgacaacat taagaagcag aaccacacca 720 acaatgtcat gtttttcctt gtccgtgata gacgaccgta tgggaattcc ccgttggatt 780 tcggtcaagt tttcaacatg tttgacaacg aacctagcac tgctacggtc aagaacgatc 840 tccgtgatca ttttcaggta ttgaggaagt ttactgcaac tgttattggt ggtccttcag 900 gaatgaaaga acaagccctt gttcgtcgtt tttttaggat taatagtcag attgtgtaca 960 accatcaaga ggctggtaaa ttcgagaatc atacggagaa tgctatattg ttgtatatgg 1020 catgtactca tgcctcaaac cctgtgtatg ctacgttaaa gatacggatc tacttctacg 1080 attcggtatc taattaatat aattttattt gaatattgtg cgtgaattca ctggccacag 1140 ttgtttgcaa tacattatac aaaacatgat tgactgccct aattacattg ttaattgaaa 1200 ttacgcctaa tctatacaaa tatttcataa tctgcttcct aaatacgttt aagaaaagac 1260 cagtcggagg gcgtgagatc gtccagaccc ggaagttcat gaaacatttg tgaatcccca 1320 actccttcct gaggttgtag ttgaatctta tctgcatctc tatcattgtt tgattcccgt 1380 gcgttagacg aaactccttg atggttttga aatacagggg attgttgatt gtccagataa 1440 gtacgccatt ctcgctctga tgcgcagtga tgagttcccc tgtgcgtgaa tccattgtct 1500 ctgcacgtga aggctacgta taccgtgcaa ccacaaggga tgtcaatcct gcgtctcctg 1560 attgctctct tcttggcctc ccgatgctgg actttgattg gtacctgagt acagcggatt 1620 tgagagggtg atgaaggtcg cattctttaa tgcccaggat ttgagatgcg cgtttttctc 1680 ctcgtctaaa tactctttat atgaggaatt gggacctgga ttgcagagga agattgttgg 1740 gatgcctcct ttaatttgaa ctggtttccc gtattttgtg ttgctttgcc agtccttttg 1800 tgcccccatg aattccttaa agtgtttcag ataatgcggg tcaacatcat cgatgacgtt 1860 ataccatgcg tcattactga acacctttgg gcttagatca agatggccgc acagataatt 1920 gtgaggccca agacttctgg cccatacggt cttccctgtt ctgctatccc cttcaatcac 1980 tatactaatc ggtctatctg gccgcgcagc ggcctcaacg acgttctccg ccgcccattc 2040 ttcaagttct tctggaactt ggtcgaaaga agaagacaaa aaaggagaaa cataaggagc 2100 tggaggctcc tcaaaaatcc tttctaaatt actatttaaa ttatgaaatt gtaaaacaaa 2160 atctttggga gctaactccc ttataaccct aagagcctct gccttacttc ctgcgttaag 2220 cgctgcggcg taagcgtcat tggctgtctg ttggcctcct ctggcagatc ttccgtcgat 2280 ctggaattct ccccattcga gtgtgtctcc atccttgtcg atgtaggact tgacgtcgga 2340 gctggattta gctccctgaa tgtttggatg gaaatgtgct gacctatttg gggagaccag 2400 gtcgaagaat ctcttatttt ggcagttgta cttcccttgg aactggagaa gcacatgcag 2460 atgaggctgg ccatcttcgt gaagctctct gcaaatcttg atgtattttt tatttgttgg 2520 ggtgtttagg tttagaagtt gctctagtgc ttcttcttta gaaagagaac attttgggaa 2580 agtgaggaaa tagtttttag aatttattct gaatttcttt gtaggagcca tattgtcaac 2640 tagcaccgat tgaccactcc agatacttct ccctagtgaa ttggggttct atatatagtg 2700 agaccccaaa tggcattatg gtaataatcc caaaaaattt gaacccccat agcgcccacc 2760 gttctaatat t 2771 //