ID DQ641688; SV 1; circular; genomic DNA; STD; VRL; 2677 BP. XX AC DQ641688; XX DT 03-MAY-2007 (Rel. 91, Created) DT 21-DEC-2007 (Rel. 94, Last updated, Version 2) XX DE Corchorus golden mosaic virus segment DNA-A, complete sequence. XX KW . XX OS Corchorus golden mosaic virus OC Viruses; Geminiviridae; Begomovirus. XX RN [1] RP 1-2677 RX DOI; 10.1099/vir.0.83236-0. RX PUBMED; 18089756. RA Ha C., Coombs S., Revill P., Harding R., Vu M., Dale J.; RT "Molecular characterization of begomoviruses and DNA satellites from RT Vietnam: additional evidence that the New World geminiviruses were present RT in the Old World prior to continental separation"; RL J. Gen. Virol. 89(Pt 1):312-326(2008). XX RN [2] RP 1-2677 RA Ha C.V., Coombs S., Revill P.A., Harding R.M., Vu M.T., Dale J.L.; RT ; RL Submitted (18-MAY-2006) to the INSDC. RL Institute of Health and Biomedical Innovation, Queensland University of RL Technology, 2 George Street, Brisbane, Queensland 4001, Australia XX DR MD5; d891f5c31b0ae6ceb0944bc7045225fa. XX FH Key Location/Qualifiers FH FT source 1..2677 FT /organism="Corchorus golden mosaic virus" FT /segment="DNA-A" FT /host="Jute mallow (Corchorus capsularis, Tilliaceae); FT crop" FT /mol_type="genomic DNA" FT /country="Viet Nam:Hanoi" FT /note="acronym: CoGMV" FT /db_xref="taxon:390436" FT gene 225..968 FT /gene="AV1" FT CDS 225..968 FT /codon_start=1 FT /gene="AV1" FT /product="CP protein" FT /db_xref="GOA:A5H140" FT /db_xref="InterPro:IPR000263" FT /db_xref="InterPro:IPR000650" FT /db_xref="InterPro:IPR029053" FT /db_xref="UniProtKB/TrEMBL:A5H140" FT /protein_id="ABG26006.1" FT /translation="MKREAPWRTNAGTSKVRRALNFSPRSGLGPKASAWVNRPMYRKPR FT IYRTYRSPDVPKGCEGPCKVQSFEQRHDISHVGKVMCISDVTRGNGITHRVGKRFCIKS FT VYILGKVWMDDNIKLKNHTNSVMFWLVRDRRPYGTPMDFGQVFNMYDNEPSTATIKNDL FT RDRYQVLHRFASKVTGGQYASNEQSLVRRFWKVNNHVVYNHQEAAKYDNHTENALLLYM FT ACTHASNPVYATLKIRIYFYDSISN" FT gene complement(965..1357) FT /gene="AC3" FT CDS complement(965..1357) FT /codon_start=1 FT /gene="AC3" FT /product="REn protein" FT /db_xref="GOA:A5H141" FT /db_xref="InterPro:IPR000657" FT /db_xref="UniProtKB/TrEMBL:A5H141" FT /protein_id="ABG26009.1" FT /translation="MDLRTGDPITAYQAESSVFTWRVPNPLYFKIISVTDLGARHLLLL FT QIRFNHNLRKALQLHKCYLNFRVRTTSRWRISSFFQVFSSRIRRYLDDLGAIGINNVIR FT AIAHAADTLVYVHVEEENHIIKFNIY" FT gene complement(1095..1610) FT /gene="AC2" FT CDS complement(1095..1610) FT /codon_start=1 FT /gene="AC2" FT /product="TrAP protein" FT /db_xref="GOA:A5H142" FT /db_xref="InterPro:IPR000942" FT /db_xref="UniProtKB/TrEMBL:A5H142" FT /protein_id="ABG26008.1" FT /translation="MEGFQQSSSAILGLLHHIRSSSKRKRTKQSMTGPKKTSSTSPSKS FT HSLIPQIKNRHRHSKKVIRRRRIDLECGCSIYVSIHCRDDGFTHRGSHHCISGREFRFY FT LEGSKSPLFQDNICHRSRCETPTIAADKIQPQPEESTSAPQVLPEFQGSDDLQMEDFEF FT LSSIFFPN" FT gene complement(1405..2484) FT /gene="AC1" FT CDS complement(1405..2484) FT /codon_start=1 FT /gene="AC1" FT /product="rep protein" FT /db_xref="GOA:A5H143" FT /db_xref="InterPro:IPR001191" FT /db_xref="InterPro:IPR001301" FT /db_xref="InterPro:IPR022690" FT /db_xref="InterPro:IPR022692" FT /db_xref="UniProtKB/TrEMBL:A5H143" FT /protein_id="ABG26007.1" FT /translation="MSGRFKKQGVSFFLTWPKCPVTKESALDQIQALTLPTNIVYIRVC FT EEKHQDGSPHLHALVQFQKKFICTNCRLFDLSHPQNSRQFHCHIETARSSSDAKSYIEK FT DGVFCEWGTFQVDGRSARGGQQTVNEAYAKALNSGSKDEALNIIKELVPKDYVLQFHNL FT NQNLERIFAPPVNVFEPPFPLSSFNNVPAVINQWVNDNIMDAAARPFRPISIIIEGPSR FT TGKTLWARSLGRHNYLCGHLDLSPKVYSNEAWYNVIDDVDPHYLKHMKEFMGAQRDWQS FT NCKYGKPIQINGGIPTIFLCNPGPTSSYKEFFEEEKNKAINDWAKKNVIYVTIEEPFFN FT TTNQESTSALEESNSSETN" FT gene complement(2031..2333) FT /gene="AC4" FT CDS complement(2031..2333) FT /codon_start=1 FT /gene="AC4" FT /product="AC4 protein" FT /db_xref="InterPro:IPR002488" FT /db_xref="UniProtKB/TrEMBL:A5H144" FT /protein_id="ABG26010.1" FT /translation="MGRLTCMPWFNSRKNSSAQIVDYSTSPIHRTHVNSTVTLKLRAPP FT PTPNHISKKMECSANGALSRWTEDRREAVNRQLMRLMQRRLTAEVRTRLLISLRS" XX SQ Sequence 2677 BP; 723 A; 536 C; 589 G; 829 T; 0 other; accgtgcagc agccgccgct ttttccgtac actttaattt aaaatgaaat tgaaattgat 60 tggaacttta ctttagatgt ggccaatgat ataacacgtg gtgggtcatt ttagacgctt 120 ttatccttat gacataagtt tcaatttcat ttgaacttta tagcgctata aatttaaatt 180 tgaatgaatt tcaagtactg tcagtgttat agttcgaaaa gatgatgaaa cgtgaggccc 240 catggcgtac gaatgctggg acctccaagg tacgtcgcgc tttaaatttc tcccctcgta 300 gtggattggg cccaaaagcg tctgcttggg ttaatcggcc catgtataga aagcccagga 360 tttatcgaac gtatagatca cctgatgttc caaagggctg tgaaggccct tgtaaggtac 420 agtcatttga acagcgtcac gacatttctc atgtcggcaa ggtcatgtgt atatccgatg 480 tcacacgtgg taatggtatc acgcatcgtg ttggtaaacg tttttgtatt aagtctgttt 540 atatcctagg taaagtatgg atggacgata atattaaact taagaaccac actaacagtg 600 ttatgttctg gttagttagg gataggagac cgtatggtac tcccatggat tttggacagg 660 tttttaacat gtatgataac gagccgagta ccgctactat caagaacgat ctccgtgatc 720 gttaccaggt tttgcatagg tttgcgtcga aggttactgg tggacagtat gctagcaacg 780 aacagtctct tgtgagacga ttctggaagg tgaacaacca tgtggtgtac aaccatcaag 840 aagctgctaa gtacgataat cacactgaga atgcattatt attgtatatg gcatgtactc 900 atgcctctaa tcctgtgtat gctactttaa agatacggat ctatttctat gattcgatat 960 caaattaata aatattgaat tttattatat gattttcttc ttcaacatgt acatatacga 1020 gcgtatcagc agcatgtgca atcgctctaa ttacattgtt tatgccaatg gcacctaaat 1080 catctaagta acgtctaatt cgggaagaaa atacttgaaa gaaactcgaa atcctccatc 1140 tggaggtcgt ccgaaccctg aaattcaggt agcacttgtg gagctgaagt gctttcctca 1200 ggttgtggtt gaatcttatc tgcagcaata gtaggtgtct cgcacctaga tctgtgacag 1260 atattatctt gaaatagagg ggatttggaa ccctccaagt aaaaacggaa ctctctgcct 1320 gatatgcagt gatgggatcc cctgtgcgta aatccatcgt ctctgcagtg gatgctcaca 1380 tatatggagc aaccgcattc caggtcaatt cgtctccgac gaattacttt cttcgagtgc 1440 cgatgtcgat tcttgatttg tggtattaaa gaatggctct tcgatggtga cgtagatgac 1500 gttttttttg gcccagtcat tgattgcttt gttcttttcc tcttcgaaga actccttata 1560 tgatgaagta ggcccaggat tgcagaggaa gattgttgga atccctccat ttatttgaat 1620 tggtttgccg tacttgcagt tggactgcca gtctctctgg gcccccataa attccttcat 1680 atgctttaga tagtgcgggt cgacatcatc aatgacgttg taccaggcct catttgaata 1740 cacttttggg cttagatcaa gatgtccaca aagataattg tgacgaccca aacttctggc 1800 ccataatgtt ttccctgtcc tagatggacc ttcaatgata atagatatcg gcctaaaggg 1860 ccgcgcagcg gcatccatta tattatcgtt aacccactga ttaataacag caggaacgtt 1920 attaaacgat gatagtggga acggtggctc aaaaacatta actggaggag caaagatcct 1980 ctccaaattt tgattaagat tatgaaattg taatacataa tcctttggaa ctaactcctt 2040 aatgatatta agagcctcgt ccttacttcc gctgttaagc gcctttgcat aagcctcatt 2100 aactgtctgt tgaccgcctc tcgccgatct tccgtccacc tggaaagtgc cccattcgca 2160 gaacactcca tctttttcga tatatgattt ggcgtcggag gaggagcgcg cagtttcaat 2220 gtgacagtgg aattgacgtg agttctgtgg atgggagagg tcgaataatc gacaatttgt 2280 gcagatgaat tttttctgga attgaaccaa ggcatgcagg tgaggcgacc catcttgatg 2340 tttttcctca cagactctaa tataaacgat gttggttgga agggttaatg cttgtatttg 2400 gtctaaggct gactctttag taactgggca ttttggccac gtgaggaaaa aagaaacgcc 2460 ttgtttctta aaacgaccgc tcatcttgcg ttttaaatcg gggacaatca aagtctctga 2520 tatccgatat atcggggaca atatataggt ctcccaatat ttgtactaag agcgtgcaga 2580 gcctttttat acggacgcga agggcattat agtcatttcc cttagtaatt cagcgtgttt 2640 tttgggttcc aatccgctgc tgcacgctcc tattatt 2677 //