ID M10070; SV 1; circular; genomic DNA; STD; VRL; 2646 BP. XX AC M10070; XX DT 02-JUL-1986 (Rel. 09, Created) DT 17-APR-2005 (Rel. 83, Last updated, Version 9) XX DE Bean golden yellow mosaic virus-[Puerto Rico] DNA1, complete sequence. XX KW . XX OS Bean golden yellow mosaic virus-[Puerto Rico] OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2646 RX DOI; 10.1073/pnas.82.11.3572. RX PUBMED; 16593562. RA Howarth A.J., Caton J., Bossert M., Goodman R.M.; RT "Nucleotide sequence of bean golden mosaic virus and a model for gene RT regulation in geminiviruses"; RL Proc. Natl. Acad. Sci. U.S.A. 82(11):3572-3576(1985). XX DR MD5; 016ece6a929ba061e3b08592281abed7. DR EuropePMC; PMC5408669; 28358318. DR InterPro; IPR002488; Gemini_C4. DR UniProtKB/Swiss-Prot; P0C6G0; AC4_BGYMV. XX CC Draft entry and sequence in computer readable form kindly provided CC by A.J.Howarth (26-SEP-1985). CC Bean golden mosaic virus consists of two circular ss-DNA molecules, CC DNA 1 and DNA 2. The sense of the strand below is identical to CC that of the viral ss-DNA. CC The 'common regions', positions 1-205, are identical in DNA 1 and CC DNA 2. There is no sequence homology in the regions flanking these CC areas. A repeat is located at nucleotides 18-37 and 40-59 and an CC inverted repeat at 149-184. This inverted repeat may form a stable CC stem-loop structure with a 12 bp stem and a 12 bp loop. 'at' rich CC regions are found at positions 220-240 and 320-340. REFERENCE 1 CC suggests that late genes are expressed in a clockwise and early CC genes in a counterclockwise direction. Four ORFs (see FEATURES) on CC DNA 1 compare favorably with the ORFs of tomato golden mosaic virus CC and cassava latent virus with respect to length, position and CC partial homology. REFERENCE 1 noted additional ORFs also. XX FH Key Location/Qualifiers FH FT source 1..2646 FT /organism="Bean golden yellow mosaic virus-[Puerto Rico]" FT /segment="DNA1" FT /mol_type="genomic DNA" FT /note="ICTV7 type organism" FT /db_xref="taxon:222448" FT CDS complement(join(1591..2646,1..6)) FT /codon_start=1 FT /product="40.2 kDa protein" FT /db_xref="GOA:P0CK39" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/Swiss-Prot:P0CK39" FT /protein_id="AAA46318.1" FT /translation="MPPPQRFRVQSKNYFLTYPRCTIPKEEALSQLQKIHTTTNKKFIK FT VCEERHDNGEPHLHALIQFEGKFICTNKRLFDLVSTTRSAHFHPNIQGAKSSSDVKEYI FT DKDGVTIEWGQFQVDGRSARGGQQSANDSYAKALNADSIESALTILKEEQPKDYVLQNH FT NIRSNLERIFFKVPEPWVPPFPLSSFVNIPVVMQDWVDDYFGRGSAARPERPISIIVEG FT DSRTGKTMWARALGPHNYLSGHLDFNSLVYSNSVEYNVIDDITPNYLKLKDWKELIGEQ FT KDWQSNCKYGKPVQIKGGIPSIVLCNPGEGSSYKDFLNKEEKPALHNWTIHNAIFVTLT FT APLYQSTAQDCQT" FT repeat_region 1..205 FT /note="common region repeat" FT CDS 399..1124 FT /codon_start=1 FT /product="putative coat protein" FT /note="27.7 kDa protein" FT /db_xref="GOA:P0CK33" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/Swiss-Prot:P0CK33" FT /protein_id="AAA46319.1" FT /translation="MAGTSKVSRSGNYSPSGGMGSKSNKANAWVNRPMYRKPRIYRMYK FT SPDVPKGCEGPCKVQSYEQRHDISHVGKVMCISDITRGNGITHRVGKRFCVKSVYILGK FT IWMDENIMLKNHTNSVIFWLVRDRRPYGTPMDFGQVFNMFDNEPSTATVKNDFRDRYQV FT MHRFNAKVSGGQYASNDQALVRRFWKVNNHVVYNHQEAGKYENHTENALLLYMACTHAS FT NPVYATLKIRIYVYDSITN" FT CDS complement(1121..1519) FT /codon_start=1 FT /product="15.6 kDa protein" FT /db_xref="GOA:P0CK35" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/Swiss-Prot:P0CK35" FT /protein_id="AAA46320.1" FT /translation="MDSRTGENITAHQAENSVFIWEVPNPLYFKIMRVEDPAYTRTRIY FT HIQIRFNHNLRKALDLHKAFLNFQVWTTSIQASGTTYLNRFRLLVLLYLHRLGVIGINN FT VIRAVQFATNKSYVNTVLENHDIKYKFY" FT CDS complement(1266..1784) FT /codon_start=1 FT /product="19.6 kDa protein" FT /db_xref="GOA:P0CK37" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/Swiss-Prot:P0CK37" FT /protein_id="AAA46321.1" FT /translation="MESRFKLKEEYHQSCCAIQVRVPVIKTSSTKRKNQLYTTGPFIMR FT SSSPSQPPSIKAQHRIAKHKAIRRRRIDLNCGCSIFYHIKCADHGFTHRGEHHCASGRE FT FRFYLGGTKSPLFQDHAGGRSSIHTDKDIPHPNQVQSQPQESTGSPQSIPELPSLDDID FT SSFWDDIFK" XX SQ Sequence 2646 BP; 709 A; 517 C; 587 G; 833 T; 0 other; tggcatattt gtaaatatgc gagtgtctcc aaatgagttt gcgagtgtct ccaattgagg 60 ctcctcaaac tctcgctatg caattggaga ctggagtaca atatatacta gaaccctcaa 120 tctcttgaat tatcacatcc atacacgtgg cggccatccg atataatatt accggatggc 180 cgcccgcgcc cctttatatc cgtactgcta cacgtggtgc tttaatttaa attaaagatg 240 tctatttttg actgaccaat gcttttgcat gtgagaagct tagatatttg tgtaaaactt 300 ggcgactaag ttttaccttc gtttataaat ttaaattaaa tgtatgccca ttccacgtgt 360 aagtccagaa tgcctaagcg tgatgcgccg tggctcatat ggcgggaacc tccaaggttt 420 cccgttctgg caattattct ccaagtggtg gaatgggctc aaaatccaac aaggccaatg 480 catgggtcaa caggcccatg tatagaaagc caaggatata tcggatgtac aaaagcccag 540 atgtgccaaa gggatgtgaa ggaccttgca aggtccaatc atatgaacaa cgccatgata 600 tatctcatgt tggtaaggtt atgtgtatat ccgatatcac acgtggtaat ggtattactc 660 atcgtgttgg taaacgtttt tgtgtgaagt ctgtgtatat tttaggtaag atatggatgg 720 atgaaaacat catgcttaag aaccatacca atagtgtcat tttttggttg gttcgtgacc 780 gtagaccata tggaacccct atggattttg gtcaagtttt taacatgttt gacaatgaac 840 ctagtactgc tacggtcaag aacgattttc gtgatcgtta tcaagttatg cataggttca 900 atgcaaaggt ttctggtggt caatatgcaa gcaacgatca agccttggta aggcgttttt 960 ggaaggtgaa caaccatgtc gtctataacc accaggaagc aggaaaatac gagaatcata 1020 cggagaatgc gttattgttg tatatggcat gtacacatgc ctctaatcct gtatatgcga 1080 cattgaaaat tcggatctat gtctatgatt cgataaccaa ttaataaaat ttatatttta 1140 tatcatgatt ctcaagtaca gtatttacat atgatttgtt tgttgcgaac tgaacagctc 1200 taatgacatt gtttattcct attacgccta acctatgtaa atacaataaa actaagagtc 1260 taaatctatt taaatatgtc gtcccagaag cttgaatcga tgtcgtccag acttggaagt 1320 tcaggaatgc tttgtggaga tccagtgctt tcctgaggtt gtgattgaac ctgatttgga 1380 tgtggtatat ccttgtccgt gtgtatgctg gatcttccac ccgcatgatc ttgaaataaa 1440 ggggatttgg tacctcccaa ataaaaacgg aattctctgc ctgatgcgca gtgatgttct 1500 cccctgtgcg tgaatccatg atctgcgcac ttgatatggt aaaatatgga acagccgcag 1560 ttcaagtcaa tgcgtcgtcg acgaatggct ttatgtttgg caatcctgtg ctgtgctttg 1620 atagaggggg gctgtgaggg tgacgaagat cgcattatga atggtccagt tgtgtaaagc 1680 tggtttttcc tctttgttga ggaagtcttt ataactggaa ccctcacctg gattgcacag 1740 cacgattgat ggtattcctc ctttaatttg aaccggcttt ccatatttac agttggattg 1800 ccagtccttt tgttccccaa ttagctcttt ccagtccttt aacttcaaat aattcggggt 1860 tatgtcatca atgacgttgt attccactga gttcgaatag acaagtgaat taaagtccaa 1920 atgaccgctc aaataattat gtgggcctaa tgcacgagcc cacattgtct ttccagttcg 1980 tgaatcacct tcgacgatga tactaatagg tctttctggc cgcgcagcgg aaccccttcc 2040 gaaatagtcg tcaacccagt cttgcataac aaccggaata ttgacgaatg atgacaacgg 2100 aaatggagga acccatggtt ccggcacttt gaagaagatc cgttcgagat tagaacggat 2160 gttgtgattt tgaaggacgt aatctttcgg ttgttcttcc ttcaatattg tcaaggcaga 2220 ttcaattgaa tctgcgttta atgcctttgc gtatgagtcg ttggcagact gctgacctcc 2280 tcttgcagat ctgccgtcga cttggaattg tccccattcg attgtgactc catctttgtc 2340 gatgtattct ttgacgtcgg aacttgattt agctccctga atgttcggat ggaaatgtgc 2400 tgacctggtt gtggatacca ggtcgaacaa tcttttattt gtgcagatga atttaccttc 2460 gaactgaata agcgcatgaa gatggggttc accattatcg tgacgttcct cacagacctt 2520 gatgaatttc ttattcgtcg ttgtatgaat cttttgaagt tgcgaaagag cttcttcttt 2580 cggtatagtg caacgaggat aagtgaggaa atagtttttg gactgaactc taaatctttg 2640 aggtgg 2646 //