ID K02029; SV 1; circular; genomic DNA; STD; VRL; 2588 BP. XX AC K02029; XX DT 18-NOV-1986 (Rel. 10, Created) DT 29-MAY-2003 (Rel. 75, Last updated, Version 5) XX DE Tomato golden mosaic virus-Yellow vein component A, complete sequence. XX KW . XX OS Tomato golden mosaic virus-Yellow vein OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2588 RX PUBMED; 16453557. RA Hamilton W.D.O., Stein V.E., Coutts R.H.A., Buck K.W.; RT "Complete nucleotide sequence of the infectious cloned DNA components of RT tomato golden mosaic virus: potential coding regions and regulatory RT sequences"; RL EMBO J. 3(9):2197-2205(1984). XX DR MD5; 7c210c7297af6be221d46666f7f6ea60. DR EuropePMC; PMC1079959; 15784145. DR EuropePMC; PMC110531; 9811744. DR EuropePMC; PMC2816314; 20104645. DR EuropePMC; PMC2902183; 17532021. DR EuropePMC; PMC387707; 15078963. DR EuropePMC; PMC5904752; 29530481. DR GOA; P03562. DR GOA; P03563. DR GOA; P03567. DR GOA; P0C2W9. DR InterPro; IPR000657; Gemini_AL3. DR InterPro; IPR000942; Gemini_AL2. DR InterPro; IPR001191; Gemini_AL1_REP. DR InterPro; IPR001301; Gemini_AL1_CLV. DR InterPro; IPR002488; Gemini_C4. DR InterPro; IPR022690; Gemini_AL1_REP_cat-dom. DR InterPro; IPR022692; Gemini_AL1_REP_central. DR UniProtKB/Swiss-Prot; P03562; TRAP_TGMVY. DR UniProtKB/Swiss-Prot; P03563; REN_TGMVY. DR UniProtKB/Swiss-Prot; P03567; REP_TGMVY. DR UniProtKB/Swiss-Prot; P0C2W9; AC4_TGMVY. XX CC Geminiviruses are characterised by twin isometric virions, major CC capsid polypeptides of about 28 kd, and ss-DNA genomes. The genomes CC of cassava latent virus (CLV) and tomato golden mosaic virus (TGMV) CC consist of two circular components, while that of maize streak CC virus (MSV) consists of a single circle. EMBO J. 3, 2197-2205 CC (1984) identifies the following additional open reading frames on CC the complementary strand that would code for proteins with >10 kd: CC AL1 -- 13-1543 (passing through origin) CC AL2 -- 1601-1212 CC AL3 -- 1465-1067 CC The sequence at 1-235 is highly homologous to an equivalent region CC on component B; it doesn't appear to code for protein and has the CC potential to form a stable hairpin. An analogous region is found in CC CSV. CC The virion-sense (+) strand is shown below. XX FH Key Location/Qualifiers FH FT source 1..2588 FT /organism="Tomato golden mosaic virus-Yellow vein" FT /segment="component A" FT /mol_type="genomic DNA" FT /db_xref="taxon:223341" FT CDS 327..1070 FT /codon_start=1 FT /product="coat protein" FT /note="AR1" FT /db_xref="GOA:P03560" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/Swiss-Prot:P03560" FT /protein_id="AAA46582.1" FT /translation="MPKRDAPWRLMAGTSKVSRSANYSPRGSLPKRDAWVNRPMYRKPR FT IYRSLRGPDVPKGCEGPCKVQSYEQRHDISLVGKVMCISDVTRGNGITHRVGKRFCVKS FT VYILGKIWMDENIKLKNHTNSVMFWLVRDRRPYGTPMDFGQVFNMFDNEPSTATVKNDL FT RDRFQVIHRFHAKVTGGQYASNEQALVRRFWKVNNNVVYNHQEAGKYENHTENALLLYM FT ACTHASNPVYATLKIRIYFYDSITN" XX SQ Sequence 2588 BP; 672 A; 513 C; 605 G; 798 T; 0 other; gatgcgatgg catttttgta attaagaggc ttactaccaa ttgaggaggg gctccaaaag 60 ttatatgaat tggtagtaag gtagctctta tatattagaa gttcctaagg ggcacgtggc 120 ggccatccgt ttaatattac cggatggccg cgcgatcgtc acccgacccg cttccgcaaa 180 ttacgccgca ttgtcgtcta agtggtcccg catatgtgaa gggccaatca tatttggccc 240 tgaaatctaa gatattttta aagacttgtg gttaagttgt taaagttata taaaacgaca 300 tgcgtttcgt ggatctttaa ttcaaaatgc ctaagcggga tgccccatgg cgtttaatgg 360 cggggacctc aaaggtttcc cgctctgcta attattctcc tcgaggaagt ttgcctaagc 420 gtgatgcttg ggttaacagg cccatgtaca ggaagcccag gatatatcga tcactaagag 480 gccccgatgt tcctaaagga tgtgaagggc cttgtaaagt ccagtcatac gagcagcgtc 540 atgatatttc cctagttggg aaggtcatgt gtatatctga tgtgacacgt ggtaacggta 600 ttacccaccg tgttggtaag cgtttctgcg ttaagtctgt atatatcttg ggcaagatat 660 ggatggatga gaacatcaag ttgaagaatc acacgaacag tgtcatgttc tggttggtta 720 gggatcggag accttatggc actcctatgg atttcggaca agtgttcaac atgttcgata 780 atgagccaag tactgcaacg gtaaagaacg acctacggga tcgtttccaa gtgatccaca 840 ggtttcacgc caaggttact ggtggtcaat atgccagcaa cgagcaggct ctggttagga 900 gattctggaa ggtcaataac aatgtcgtct acaaccacca ggaggcaggg aaatatgaga 960 atcatactga gaacgccctg ttattgtata tggcatgtac tcatgcctct aaccctgtgt 1020 atgcgacgtt gaaaattcga atctattttt atgattcgat aacaaattaa taaaatttat 1080 attttattga atgattttcg agtacatgcg ttatatatga tctgtctgtt gcgaaacgaa 1140 cagctctaat aacattgtta atacatataa cgcctaactg ttcaaggtac aacatcacta 1200 agtatttaaa tctatttaaa taagttctcc cagaagctgt cgtcgatgtc gtccatactt 1260 ggaagttgag aaatgccttg tggagatcca atgctctcct caggttgtgg ttgaacctga 1320 tttgtaagtg gtatatcctg gtgttggtgt agaggggatc ctctacgctg attatcttga 1380 aatagagggg atttgttatc tcccagatat agacgccatt ctctgcttga ggcacagtga 1440 taggttcccc tgtgcgtgaa tccattgttt ctgcagtcga tgtgaatgta tatggaacag 1500 ccacagttca ggtcaattcg tcgccttcta atagctcttc gtttagctgc tctgtgttga 1560 gctttgatag aggggggagt tgaggaagac gaatttcgca ttatggaaag tccagttctt 1620 tagtggagtg ttttcctctt tgtcgaggaa aactttatag ctagcaccct ctccaggatt 1680 gcacagcacg attgacggga tacctccttt aatttgaact ggctttccgt atttacagtt 1740 agtctgccaa tctctttggg ccccaatgag ttctttccaa tgtttcaact ttagatattg 1800 cggtgtgaca tcatcgatga cgttatactc aaccttgttt gagtaaaccc tagaattgag 1860 atccaaatgc ccgctcaaat aattatgtgg gcctagtgaa cgagcccaca tagtctttcc 1920 cgtccgacta tcgccctcga tgataatact aataggtctc tccggccgcg cagcggaact 1980 ctttccaaaa taattttcag cccattgtct catctcgtct ggcacgttag taaatgatga 2040 gacgtggaac ggaggaagcc atggttcagg agtcttatca aatatcctat ctaaattgct 2100 atttagattg tggaactgaa ataaatattt ttctgggatt ttctctctaa ttatctgcag 2160 ggcttcttct ttggaagaag catttaacgc ctctgctgca gcgtcgttag atgtttggca 2220 acctcctcta gcacttcgac cgtcgacctg gaattctccc catacaagag tatctccgtc 2280 tttgtcgatg tacgtcttga cgtcggaaga cgatttagct ctctgaatgt ttggatggaa 2340 atgtgctgac cttgttgggg ataccaggtc gaagaatcgt tgattttggc agcagtattt 2400 tccctcgaac tgaataagca cgtggaggtg aggttgccca tcttcatgaa gctctctgca 2460 gatttttatg aattttttgt taatcggagt gtttagggct tgtaattgag aaagtgattc 2520 ttctttggac aaggagcact gaggatatgt aagaaaataa tttttggcat ttatttgaaa 2580 ccgttttg 2588 //