ID M10080; SV 1; circular; genomic DNA; STD; VRL; 2587 BP. XX AC M10080; XX DT 02-JUL-1986 (Rel. 09, Created) DT 29-MAY-2003 (Rel. 75, Last updated, Version 6) XX DE Bean golden yellow mosaic virus-[Puerto Rico] DNA2, complete sequence. XX KW . XX OS Bean golden yellow mosaic virus-[Puerto Rico] OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2587 RX DOI; 10.1073/pnas.82.11.3572. RX PUBMED; 16593562. RA Howarth A.J., Caton J., Bossert M., Goodman R.M.; RT "Nucleotide sequence of bean golden mosaic virus and a model for gene RT regulation in geminiviruses"; RL Proc. Natl. Acad. Sci. U.S.A. 82(11):3572-3576(1985). XX DR MD5; 72c2692b83f69699460dd2606555dee2. XX CC Draft entry and sequence in computer readable form kindly provided CC by A.J.Howarth (26-SEP-1985). CC Bean golden mosaic virus consists of two circular ss-DNA molecules, CC DNA 1 and DNA 2. The sense of the strand below is the same as that CC of the viral ss-DNA. CC The 'common regions', positions 1-205, are identical in DNA 1 and CC DNA 2. There is no sequence homology in the regions flanking these CC areas. A repeat is located at nucleotides 18-37 and 40-59 and an CC inverted repeat at 149-184. This inverted repeat may form a stable CC stem-loop structure with a 12 bp stem and a 12 bp loop. REFERENCE 1 CC suggests that late genes are expressed in a clockwise and early CC genes in a counterclockwise direction. Two ORFs (see FEATURES) on CC DNA 2 compare favorably with the ORFs of tomato golden mosaic virus CC and cassava latent virus with respect to length, position and CC partial homology. REFERENCE 1 identified additional ORFs as well. XX FH Key Location/Qualifiers FH FT source 1..2587 FT /organism="Bean golden yellow mosaic virus-[Puerto Rico]" FT /segment="DNA2" FT /mol_type="genomic DNA" FT /note="ICTV7 type organism" FT /db_xref="taxon:222448" FT repeat_region 1..205 FT /note="common region repeat" FT CDS 510..1280 FT /codon_start=1 FT /product="putative insect transmission product" FT /note="29.7 kDa protein" FT /db_xref="GOA:P0CK41" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR001530" FT /db_xref="UniProtKB/Swiss-Prot:P0CK41" FT /protein_id="AAA46322.1" FT /translation="MYASKYKRGSSNYQRRGYSRYQGFRRTAIVTRHDGKRRQHQSNKS FT NEDPKMLVQCIRENQFGPDFVMSHNTAISTFINYPQLGKIEPNRCRSYIKLKRLRFKGT FT VKIERMHTDVNMDGLSPKIEGVFSIVIVVDRKPHLSPSGCLHTFDELFGARINSHGNLA FT VMPSLKDRFYIRHLLKRVLSVDKDTTMIDVEGSTLLSNKRYNMWSTFNDFDHDSCNGVY FT ANIAKNALLVYYCWMSDIMSKASTFVSYDLDYVG" FT CDS complement(1314..2195) FT /codon_start=1 FT /product="33.1 kDa protein" FT /db_xref="GOA:P0CK43" FT /db_xref="InterPro:IPR000211" FT /db_xref="UniProtKB/Swiss-Prot:P0CK43" FT /protein_id="AAA46323.1" FT /translation="MDSQLANPPNAFNYIESQRDEYQLSHDLTEIILQFPSTASQLSAR FT FSRSCMKIDHCVIEYRQQVPINATGSVVVEIHDKRMTDNESLQASWTFPIRCNIDLHYF FT SSSFFSLKDPIPWKLYYRVSDTNVHQRTHFAKFKGKLKLSTAKHSVDIPFRAPTVKILS FT KQFSNKDIDFSHVDYGKWERKLIRSASLSKYGLQGPIELKPGESWASKSTIGVSHLDAD FT SDLDSAIHPYKNLNRLGSSVLDPGDSASIIGAQRAQSNITLSIAQLNEIVRSTVNECIN FT NNCIHVQPKSLK" XX SQ Sequence 2587 BP; 761 A; 441 C; 511 G; 874 T; 0 other; tggcatattt gtaaatatgc gagtgtctcc aaatgagttt gcgagtgtct ccaattgagg 60 ctcctcaaac tctcgctatg caattggaga ctggagtaca atatatacta gaaccctcaa 120 tctcttgaat tatcacatcc atacacgtgg cggccatccg atataatatt accggatggc 180 cgcccgcgcc cctttatatc cgtacgccgc atttcgattc cagaaatgcc cttcccacgc 240 tttatacacg gtattctata ccgttggata aagcttattg cactctatca ttgaattttg 300 aattattgtc attgcttttg tgtttcgata tttatggatg ggtctgacac attactgaca 360 ttgtacaatt cgaccaatca aaattcacga tagaagtcta gttaaatcgt ttattaatat 420 gagatcctta taagtaaaca tatttgataa agaaaaaacg atcatcgttt ttaatgtttg 480 tcttgttatt ttaggcaaac gatttaacaa tgtatgcgtc taaatataaa cgtggttcgt 540 ctaactatca acgacgaggt tattcacgtt atcaaggttt tcgacgaacg gcaattgtta 600 cacgacacga tggtaagcgt cgacaacatc aatctaataa gtctaatgaa gatcctaaga 660 tgttagtgca atgtatacgt gaaaatcagt ttggtcctga ttttgttatg tcccataata 720 ctgcaatatc gacgttcatt aattatcctc aacttggtaa gatcgaacct aatcgatgta 780 ggtcatatat caagttgaaa cgcttgcgtt tcaaagggac ggttaagatt gaacgcatgc 840 acactgatgt taacatggat ggattatctc ctaagattga aggcgtattt agtattgtta 900 ttgtggttga tcggaaacca catttaagtc ctagtggatg tctccataca tttgacgaat 960 tgtttggagc aaggataaat agccatggaa acctagctgt gatgccttct ttgaaagata 1020 gattttacat acggcatttg ttgaagcgtg ttttatctgt ggacaaagac acgactatga 1080 tagacgttga gggttctact ttgttgtcta ataagcgtta taatatgtgg tctacattta 1140 atgattttga tcatgactca tgtaatggtg tttatgctaa tattgctaag aacgctttat 1200 tagtctatta ttgctggatg tcagatatta tgtctaaggc atccacattt gtatcatatg 1260 atcttgatta tgttggttga acgtcgatct aacacaagaa taaatatacg tatttatttc 1320 aaagactttg gttgaacatg aatacaattg ttgttgatac attcatttac cgttgatcgc 1380 acaatttcgt ttaattgtgc aattgacagt gttatgtttg attgggctct ctgagccccg 1440 ataattgacg ctgaatctcc cgggtccaaa acactgcttc ccagtctgtt taaattcttg 1500 tatggatgta ttgcgctatc taagtccgag tccgcatcta aatgagatac accgattgta 1560 ctttttgagg cccatgactc acccggcttt aattcgattg ggccttgtag tccatacttt 1620 gataatgatg cggatcgtat caatttcctt tcccacttcc cgtagtcgac atgtgagaag 1680 tcgatgtcct tattggaaaa ctgctttgat aatattttta ctgttggtgc ccggaagggg 1740 atatctaccg aatgtttagc cgttgataat ttcagttttc ccttgaattt tgcgaaatgt 1800 gtcctttgat gaacgttggt gtctgaaact ctgtaataga gtttccatgg tatcgggtct 1860 tttaacgaga aaaatgacga agagaagtag tggagatcta tgttgcatct tatgggaaac 1920 gtccatgatg cttgtaatga ttcattgtct gtcatccttt tgtcgtgaat ctccactaca 1980 accgatcctg tcgcgtttat aggcacctgt tgcctatact caattacaca atgatcgatc 2040 ttcatacaac tgcgactgaa tcgtgctgat agttgagacg ctgtggaagg aaattgaagg 2100 attatctcag ttagatcatg agacaattga tattcgtctc tctgagactc tatgtaatta 2160 aatgcatttg gaggatttgc taactgagaa tccatttaag aataaatggc cgcgcagcgg 2220 aattgctcag agagataagg tccactcaac taatggatag ctaacaaata agaactatat 2280 ttgtgaaaca ctcatacaac tgatatactg acaaggagaa tgtgaaatag attatagtga 2340 gatagagaag cattgttgta acttatatag acagttgtat gcgattaaag ctctttttga 2400 aaaagaacat attatttaag taatgatcat gaaatatata tcttatatat atatattgta 2460 tgaattggac atgattattg ttatactaaa tgtcgtacga tgatcagaag ttaatatgga 2520 ttatattgtt ttttatgaat tatttgaagg ttaataagca aaacgaagcc gttttgaact 2580 tattcgt 2587 //