ID FR751142; SV 1; circular; genomic DNA; STD; VRL; 2763 BP. XX AC FR751142; XX DT 22-JUN-2011 (Rel. 109, Created) DT 22-JUN-2011 (Rel. 109, Last updated, Version 1) XX DE Cotton leaf curl Gezira virus DNA-A, complete sequence, clone NT1 XX KW complete genome. XX OS Cotton leaf curl Gezira virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2763 RA Tahir M.N.; RT ; RL Submitted (25-DEC-2010) to the INSDC. RL Tahir M.N., Agricultural Biotechnology Division, National Institute for RL Biotechnology & Genetic Engineering, NIBGE, Jhang Road, Faisalabad, Punjab, RL 38000, PAKISTAN. XX RN [2] RX DOI; 10.1371/journal.pone.0020366. RX PUBMED; 21637815. RA Tahir M.N., Amin I., Briddon R.W., Mansoor S.; RT "The merging of two dynasties--identification of an African cotton leaf RT curl disease-associated begomovirus with cotton in Pakistan"; RL PLoS One 6(5):e20366-e20366(2011). XX DR MD5; 1f5b83203e9b96963faf313206581d92. DR EuropePMC; PMC3102712; 21637815. XX FH Key Location/Qualifiers FH FT source 1..2763 FT /organism="Cotton leaf curl Gezira virus" FT /segment="DNA-A" FT /host="Gossypium hirsutum" FT /mol_type="genomic DNA" FT /country="Pakistan:Sindh, Hala" FT /collection_date="2005" FT /clone="NT1" FT /tissue_type="Leaf" FT /db_xref="taxon:222459" FT CDS 162..530 FT /gene="V2" FT /product="precoat protein" FT /db_xref="GOA:F8K9T6" FT /db_xref="InterPro:IPR002511" FT /db_xref="InterPro:IPR005159" FT /db_xref="UniProtKB/TrEMBL:F8K9T6" FT /protein_id="CBY85306.1" FT /translation="MWDPLLNDFPESVHGFRCMLAVKYLQAVRESYDPSTLGYDLLSDL FT IGVVRRTNYVEATSRYHHFHSRLESASPSELRQSRVILCTCPHCPRHKQTSGVVQQTQL FT QEAQNLQTVSEPRCPKGV" FT CDS 322..1098 FT /gene="V1" FT /product="coat protein" FT /db_xref="GOA:F8K9T7" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:F8K9T7" FT /protein_id="CBY85307.1" FT /translation="MSKRPADIIISTPASKVRRRLNFDSPGLSSARAPTVLVTNKRRAW FT SNRPNYRKPRIYRLYRSPDVPKGCEGPCKVQSYEQRDDVKHTGIVRCVSDVTKGTGLTH FT RTGKRFTIKSIYILGKVWMDENIKKQNHTNNVMFFLVRDRRPYGNSPMDFGQVFNMFDN FT EPSTATVKNDYRDRFQVMRKFSATVTGGPSGMKEQALVRRFFKINSQIVYNQQEAAKYE FT NHTENALLLYMACTHASNPVYATLKIRIYFYDSVSN" FT CDS complement(550..1032) FT /gene="C5" FT /product="hypothetical protein" FT /db_xref="GOA:F8K9T8" FT /db_xref="InterPro:IPR006892" FT /db_xref="UniProtKB/TrEMBL:F8K9T8" FT /protein_id="CBY85308.1" FT /translation="MSTCHIQQQSILSMVLILRSFLLVINNLTVNLKKPTNKSLFLHPR FT RTTSYSSTKLTHHLETISIIILYSSSTWLIIKHVKNLTKIHRGIPIRSSITNKEKHDIV FT GVILLLNVLIHPYLTKNINGFDCETLSSTMSKPSSLGHIRNTTNNTSMLHIITLFI" FT CDS complement(1095..1496) FT /gene="C3" FT /product="replication enhancer protein" FT /db_xref="GOA:F8K9W0" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:F8K9W0" FT /protein_id="CBY85309.1" FT /translation="MDSRTGELITAHQTENGVLIWTINNPLYFKTIKEIPLTHGNQTMV FT EMQIRFNYNLRKELGIHKCFMNFRVWTISRPPTGLFLNVFRKQIMKYLYRIGVISINNV FT IRAVNHVLYDVLQTTVESEFTHNIQIKLY" FT CDS complement(1240..1644) FT /gene="C2" FT /product="transcriptional activator protein" FT /db_xref="GOA:Q77PZ9" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:Q77PZ9" FT /protein_id="CBY85310.1" FT /translation="MRPSSPSQIRCTQVPIKVQHREAKKRAIRRRRIDIPCGCTVYVAF FT TCRDNGFTHRGTHHCASDREWRTYLDNQQSPVFQNHKGDSSNARESNNGRDADKIQLQP FT QEGIGDSQMFHELQGLDDLTPSDWSFLKRI" FT CDS complement(1544..2632) FT /codon_start=1 FT /gene="C1" FT /product="Rep protein" FT /db_xref="GOA:F8K9U8" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:F8K9U8" FT /protein_id="CBY85311.1" FT /translation="MAPPHRFQIYAKNYFLTFPKCSLTKEEALEQIQQISTASNKKYIK FT ICRELHEDGQPHLHVLLQFEGKFKCQNQRLFDLVSPNRSTHFHPNIQGAKSSSDVKSYI FT DKDGDTLEWGEFQIDGRSARGGQQTANDAYAAALNAGSKAEALRVIRELAPKDFVLQFH FT NLNSNLERIFQEPPAPYVSPFLSSSFDQVPEELEEWAAENVVEAAARPSRPISIVIEGE FT SRTGKTVWARSLGPHNYLCGHLDLSPKVFSNDAWYNVIDDVDPHYLKHFKEFMGAQKDW FT QSNTKYGKPVQIKGGIPTIFLCNPGPNSSYKEYLDEDKNAHLKSWALKNATFITLSNPL FT YSGTNQSSASGGQEESNQETQD" FT CDS complement(2182..2475) FT /gene="C4" FT /product="C4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:F8K9W4" FT /protein_id="CBY85312.1" FT /translation="MANLIYMCFSSSKGSSSAKIRDSSTWSPQIGQHISIQTFRELNPA FT PTSSPTSTRMETHWNGENSRSTVDLQEEDNRPPMTLTPQRLTQEVRQRLLGL" XX SQ Sequence 2763 BP; 689 A; 599 C; 619 G; 856 T; 0 other; accggtgggc gcgaaaaaaa aagtggtccc cgccccacgt gaacatgtcg cgcgagtgct 60 gtacatgtcg cgcgatgctg tccaatcaga actcgcgctc tacgcattat aatttgaaat 120 ttgaaatata aacttgctcc ctaagtttgt taggcataac tatgtgggat ccgttattga 180 acgacttccc tgaatccgtt cacggtttcc gttgtatgct agccgtgaaa tatttgcagg 240 ctgttcgaga gtcgtatgat ccttccactc ttgggtacga tcttcttagc gatctaatcg 300 gagttgttcg ccgtaccaac tatgtcgaag cgaccagcag atatcatcat ttccactccc 360 gcctcgaaag tgcgtcgccg tctgaacttc gacagtcccg ggttatcctc tgcacgtgcc 420 cccactgtcc tcgtcacaaa caaacgtcgg gcgtggtcca acagacccaa ttacaggaag 480 cccagaattt acagactgta tcggagccca gatgtcccaa aggggtgtga aggtccatgt 540 aaggtccagt catatgaaca gcgtgatgat gtgaagcata ctggtattgt tcgttgtgtt 600 tctgatgtga ccaagggaac tgggcttact catcgtactg gaaagcgttt cacaatcaaa 660 tccatttata ttcttggtaa ggtatggatg gatgagaaca ttaagaagca gaatcacacc 720 aacaatgtca tgtttttcct tgttcgtgat agaagaccgt atgggaattc cccgatggat 780 tttggtcaag tttttaacat gtttgataat gagccaagta ctgctactgt aaagaatgat 840 tatcgagatc gtttccaggt gatgcgtaag tttagtgcta ctgtaactgg tggtccttct 900 gggatgaagg aacaggctct tgttcgtagg ttttttaaga ttaacagtca gattgtttat 960 aaccaacagg aagctgcgaa gtatgagaac catactgaga atgctttgtt gttgtatatg 1020 gcatgtactc atgcttctaa tcctgtgtat gctacgttaa agatacggat ctacttctac 1080 gattcggtat ctaattaata taattttatt tgaatattgt gggtgaattc actttcgacg 1140 gttgtttgca atacatcgta caaaacatga ttgactgccc taattacatt attaattgaa 1200 attacgccta ttctatacaa atatttcata atctgcttcc taaatacgtt taagaaaaga 1260 ccagtcggag ggcgtgagat cgtccagacc ctgaagttca tgaaacattt gtgaatcccc 1320 aattccttcc tgaggttgta gttgaatctt atctgcatct ctaccattgt ttgattcccg 1380 tgcgttagag gaatctcctt tatggttttg aaatacaggg gattgttgat tgtccagata 1440 agtacgccat tctctgtctg atgcgcagtg atgagttccc ctgtgcgtga atccattgtc 1500 tctgcacgtg aaggctacgt atactgtgca accacaaggg atgtcaatcc tgcgtctcct 1560 gattgctctc ttcttggcct cccgatgctg aactttgatt ggtacctgag tacagcggat 1620 ttgagagggt gatgaaggtc gcattcttta atgcccagga tttgagatgc gcgttcttgt 1680 cctcgtctaa atactcttta tacgaggaat tgggacctgg attgcagagg aagattgttg 1740 ggatgccccc tttaatttga actggtttcc cgtattttgt gttgctttgc cagtcctttt 1800 gtgcacccat gaactcctta aagtgtttca gataatgcgg gtcaacatca tctatgacgt 1860 tataccatgc gtcattactg aacacttttg ggcttagatc aagatggccg cacagataat 1920 tgtgaggccc aagacttctg gcccatacgg tctttcctgt tctgctttcc ccttcaatca 1980 ctatactaat cggtctactt ggccgcgcag cggcctcaac gacgttctcc gccgcccatt 2040 cttcaagttc ttctggaact tggtcgaacg aagaagacaa aaaaggagaa acataaggag 2100 ctggaggctc ctgaaaaatc ctttctaaat tactatttaa attatgaaat tgtaaaacaa 2160 aatctttggg agctaactcc cttataaccc taagagcctc tgccttactt cctgcgttaa 2220 gcgctgcggc gtaagcgtca ttggcggtct gttgtcctcc tcttgcagat ctaccgtcga 2280 tctggaattc tccccattcc agtgtgtctc catccttgtc gatgtaggac ttgacgtcgg 2340 agctggattt agctccctga atgtttggat ggaaatgtgt tgacctattt ggggagacca 2400 ggtcgaagag tctctgattt tggcacttga acttcccttc gaactggaga agcacatgta 2460 gatgaggttg gccatcctcg tgaagctctc tgcagatctt gatatatttc ttgtttgaag 2520 ctgtgcttat ttgctgaatt tgctctaggg cttcttcttt ggttagagaa cattttggga 2580 aagttaggaa ataattcttg gcatatattt ggaatctgtg gggaggagcc attgacttcg 2640 tcaatcggta ctcagatgct tctctccaat atatcggtac tcaatatata gtgagtacca 2700 aatggcattt tggtaaaaat cccaataatt ttgaaccccc atagcgccca ccgttctaat 2760 att 2763 //