ID M10376; SV 1; circular; genomic DNA; STD; VRL; 8016 BP. XX AC M10376; J02047; XX DT 09-MAR-1987 (Rel. 11, Created) DT 04-MAR-2000 (Rel. 63, Last updated, Version 5) XX DE Cauliflower mosaic virus (altered virulence isolate D/H), complete genome. XX KW coat protein; complete genome. XX OS Cauliflower mosaic virus OC Viruses; Ortervirales; Caulimoviridae; Caulimovirus. XX RN [1] RP 1-8016 RX DOI; 10.1016/0378-1119(82)90013-0. RX PUBMED; 7152260. RA Balazs E., Guilly H., Jonard G., Richards K.; RT "Nucleotide sequence of DNA from an altered-virulence isolate D/H of the RT cauliflower mosaic virus"; RL Gene 19(3):239-249(1982). XX DR MD5; bea76f3158cb688f4f96a85192b40cf5. DR EPD; EP07015; CAMV_35MJ. DR EuropePMC; PMC1440423; 16571798. DR EuropePMC; PMC4498817; 26162084. XX CC The beta-strand is shown below. XX FH Key Location/Qualifiers FH FT source 1..8016 FT /organism="Cauliflower mosaic virus" FT /isolate="Cabb-D/H" FT /mol_type="genomic DNA" FT /db_xref="taxon:10641" FT misc_feature join(7323..8016,1..364) FT /note="long intergenic region" FT CDS 13..303 FT /codon_start=1 FT /note="ORF7; putative" FT /db_xref="UniProtKB/TrEMBL:Q83163" FT /protein_id="AAA46344.1" FT /translation="MNRSMTKTQEDKTSPKYQRVLNSKNKRSFKIKNSSLTPVTDRFTT FT VRFQNNIECVYANFDSQLKSSYDGRSKKIKTLSLKNLRCYETFLRKYLLEQ" FT CDS 365..1348 FT /codon_start=1 FT /note="ORF1; putative" FT /db_xref="GOA:P03547" FT /db_xref="InterPro:IPR028919" FT /db_xref="UniProtKB/Swiss-Prot:P03547" FT /protein_id="AAA46345.1" FT /translation="MDLYPEENTQSEQSQNSENNMQIFKSETSDGFSSDLKISNDQLKN FT ISKTQLTLEKEKIFKMPNVLSQVMKKAFSRKNEILYCVSTKELSVDIHDATGKVYLPLI FT TKEEINKRLSSLKPEVRRTMSMVHLGAVKILLKAQFRNGIDTPIKIALIDDRINSRRDC FT LLGAAKGNLAYGKFMFTVYPKFGISLNTQRLNQTLSLIHDFENKNLMNKGDKVMTITYI FT VGYALTNSHHSIDYQSNATIELEDVFQEIGNIQQSEFCTIQNDECNWAIDIAQNKALLG FT AKTKTQIGNSLQIGNIASSSSTENELARVSQNIDLLKNKLKEICGE" FT CDS 1345..1824 FT /codon_start=1 FT /note="ORF2; putative" FT /db_xref="GOA:P03550" FT /db_xref="InterPro:IPR004917" FT /db_xref="UniProtKB/Swiss-Prot:P03550" FT /protein_id="AAA46346.1" FT /translation="MSITGQPHVYKKDTIIRLKPLSLNSNNRSYVFSSSKGNIQNIINH FT LNNLNKIVGRSLLGIWKINSYFGLSKDPSESKSKNPSVFNTAKTIFKSGGVDYSSQPKE FT IKSLLEAQNTRIKSLEKAIQSLDEKIEPEPLTKEEVKELKESINSIKEGLKNIIG" FT CDS 1826..2215 FT /codon_start=1 FT /note="ORF3; putative" FT /db_xref="GOA:P03553" FT /db_xref="InterPro:IPR004986" FT /db_xref="UniProtKB/Swiss-Prot:P03553" FT /protein_id="AAA46347.1" FT /translation="MANLNQIQKEVSEILSDQKSMKADIKAILELLGSQNPIKESLETV FT AAKIVNDLTKLINDCPCNKEILEALGNQPKEQLIGQPKEKGKGLNLGKYSYPNYGVGNE FT ELGSSGNPKALTWPFKAPAGWPNQY" FT CDS 2197..3669 FT /codon_start=1 FT /note="coat protein (gene IV)" FT /db_xref="GOA:P03544" FT /db_xref="InterPro:IPR001878" FT /db_xref="InterPro:IPR001988" FT /db_xref="InterPro:IPR036875" FT /db_xref="UniProtKB/Swiss-Prot:P03544" FT /protein_id="AAA46348.1" FT /translation="MAESILDRTINRFWYKLGDDCLSESQFDLMIRLMEESLDGDQIID FT LTSLPSDNLQVEQVMTTTEDSISEEESEFLLAIGETSEEESDSGEEPEFEQVRMDRTGG FT TEIPKEEDGGEPSRYNERKRKTTEDRYFPTQPKTIPGQKQTTMGMLNIDCQANRRTLID FT DWAAEIGLIVKTNREDYLDPETILLLMEHKTSGIAKELIRNTRWNRTTGDIIEQVIDAM FT YTMFLGLNYSDNKVAEKIEEQEKAKIRMTKLQLCDICYLEEFTCDYEKNMYKTELADFP FT GYINQYLSKIPIIGEKALTRFRHEANGTSIYSLGFAAKIVKEELSKICDLTKKQKKLKK FT FNKKCCSIGEASVEYGCKKTSKKKYHKRYKKKYKAYKPYKKKKKFRSGKYFKPKEKKGS FT KQKYCPKGKKDCRCWICNIEGHYANECPNRQSSEKAHILQQAEKLGLQPIEEPYEGVQE FT VFILEYKEEEEETSTEEDDGSSTSEDSDSESD" FT CDS 3260..3583 FT /codon_start=1 FT /note="ORF8; putative" FT /db_xref="UniProtKB/TrEMBL:Q83164" FT /protein_id="AAA46349.1" FT /translation="MDARRHPRRSIIKDTRKNIRLINLIRRRRNSGQENTSSPKKRRAL FT SKSIAQRARKTADVGSAISKAITPTNVLIDKAQRRLTSFNKQRNWVSSPSKNPTKEFKK FT YSS" FT CDS 3623..5650 FT /codon_start=1 FT /note="ORF5; putative" FT /db_xref="GOA:P03556" FT /db_xref="InterPro:IPR000477" FT /db_xref="InterPro:IPR000588" FT /db_xref="InterPro:IPR021109" FT /db_xref="InterPro:IPR041373" FT /db_xref="UniProtKB/Swiss-Prot:P03556" FT /protein_id="AAA46350.1" FT /translation="MMDHLLQKTQIQNQTEQVMNITNPNSIYIKGRLYFKGYKKIELHC FT FVDTGASLCIASKFVIPEEHWINAERPIMVKIADGSSITINKVCRDIDLIIAGEIFHIP FT TVYQQESGIDFIIGNNFCQLYEPFIQFTDRVIFTKDRTYPVHIAKLTRAVRVGTEGFLE FT SMKKRSKTQQPEPVNISTNKIAILSEGRRLSEEKLFITQQRMQKIEELLEKVCSENPLD FT PNKTKQWMKASIKLSDPSKAIKVKPMKYSPMDREEFDKQIKELLDLKVIKPSKSPHMAP FT AFLVNNEAEKRRGKKRMVVNYKAMNKATVGDAYNPPNKDELLTLIRGKKIFSSFHCNSG FT FWQVLLDQESRPLTAFTCPQGHYEWNVVPFGLKQAPSIFQRHMDEAFRVFRKFCCVYVD FT DILVFSNNEEDHLLHVAMILQKCNQHGIILSKKKAQLFKKKINFLGLEIDEGTHKPQGH FT ILEHINKFPDTLEDKKQLQRFLGILTYASDYIPKLAQIRKPLQAKLKENVPWKWTKEDT FT LYMQKVKKNLQAFPPLHHPLPEEKLIIETDASDDYWGGMLKAIKINEGTNTELICRYAS FT GSFKAAEKNYHSNDKETLAVINTIKKFSIYLTPVHFLIRTDNTHFKSFVNLNYKGDSKL FT GRNIRWQAWLSHYSFDVEHIKGTDNHFADFLSREFNRVNS" FT misc_feature 5651..5753 FT /note="small intergenic region" FT mRNA 5679..>8016 FT /note="major viral mRNA" FT CDS 5754..7322 FT /codon_start=1 FT /product="inclusion body protein" FT /note="major virus-specific in vitro translation product; FT gene VI)" FT /db_xref="GOA:P03557" FT /db_xref="InterPro:IPR009027" FT /db_xref="InterPro:IPR011320" FT /db_xref="InterPro:IPR037056" FT /db_xref="UniProtKB/Swiss-Prot:P03557" FT /protein_id="AAA46351.1" FT /translation="MENIEKLLMQEKILMLELDLVRAKISLARANGSSQQGELSLHRET FT PEKEVAVHSALVTFTPTQVKAIPEQTAPGKESTNPLMASILPKDMNPVQTGTRLAVPSD FT FLRPHQGIPIPQKSELSSTVVPLRAESGIQHPHINYYVVYNGPHAGIYDDWGCTKAATN FT GVPGVAHKKFATITEARAAADAYTTRQQTDRLNFIPKGEAQLKPKSFAEALTSPPKQKA FT HWLTLGTKKPSSDPAPKEISFAPEITMDDFLYLYDLVRKFDGEGDDTMFTTDNEKISLF FT NFRKNANPQMVREAYAAGLIKTIYPSNNLQEIKYLPKKVKDAVKRFRTNCIKNTEKDIF FT LKIRSTIPVWTIQGLLHKPRQVIEIGVSKKVIPTESKAMESRIQIEDLTELAVKTGEQF FT IQSLLRLNDKKKIFVNMVEHDTLVYSKNIKETDSEDQRAIETFQQRVISGNLLGFHCPA FT ICHFIMKTVEKEGGAYKCHHCDKGKAIVQDASADEGTTDKSGPPPTRSIVEKEDVPNTS FT SKQVD" FT mRNA 7419..>8016 FT /note="major viral mRNA" XX SQ Sequence 8016 BP; 2939 A; 1653 C; 1562 G; 1862 T; 0 other; ggtatcagag ccatgaatag gtctatgacc aaaactcaag aggataaaac ctcaccaaaa 60 taccaaagag ttcttaactc taaaaataaa agatctttca agatcaaaaa tagttccctc 120 acaccggtga ccgacaggtt taccaccgta aggtttcaga acaacatcga atgcgtttac 180 gccaacttcg actctcagct caagtcgtcg tacgatggta gatctaaaaa gatcaagact 240 ctaagcctta aaaatcttag atgttacgaa accttcctca ggaagtacct tttggaacaa 300 taaaatctct ctgagaatag tactctattg agtatccaca gaaaaaataa tcttctgtgt 360 tgagatggat ttgtatccag aagaaaacac ccaaagcgag caatcgcaaa attctgaaaa 420 taatatgcaa atatttaaat cagaaacttc ggatggattc tcctccgatt taaagatctc 480 aaacgatcaa ttaaaaaata tctcaaaaac ccaattaact ttggaaaaag aaaagatatt 540 taagatgcct aacgttttat ctcaagttat gaaaaaagcg tttagcagga aaaacgagat 600 tctctactgc gtctcgacaa aagaattatc ggtggacatt catgatgcca caggtaaggt 660 atatcttcct ttaatcacta aagaggaaat taataaaaga ctttccagct taaaacctga 720 agtcagaaga accatgtcca tggtccattt gggcgcggtc aaaatattgc ttaaagctca 780 atttagaaat gggattgata ccccaatcaa aattgcttta atcgatgata gaatcaattc 840 tagaagagat tgtcttcttg gtgcagccaa aggtaatctc gcatacggta agtttatgtt 900 tactgtatac cccaagtttg gaataagcct taatacccaa agacttaacc aaaccttaag 960 ccttattcat gattttgaga ataaaaatct tatgaataaa ggtgataaag ttatgaccat 1020 aacctatatc gtaggatatg cattaacaaa tagtcatcat agcatagatt atcaatcgaa 1080 tgctacaatt gaactagaag acgtatttca agaaattgga aatatccagc aatctgagtt 1140 ctgtacaata cagaatgatg aatgcaattg ggccattgat atagcccaaa acaaagcctt 1200 attaggagct aaaaccaaaa cccaaattgg taatagtctt caaataggaa atattgcatc 1260 atcctctagt actgaaaatg aattagctag ggtgagccaa aacatagatc ttttaaaaaa 1320 taaattaaaa gaaatctgtg gagaatgagc ataacgggtc aaccgcatgt ttataaaaaa 1380 gatactatta ttagactaaa accattgtct cttaatagta ataatagaag ttatgttttt 1440 agttcctcaa aagggaacat tcaaaatata attaatcatc ttaacaacct caataagatt 1500 gtaggaagaa gcttactcgg aatatggaag atcaactcat acttcggact aagcaaagac 1560 ccttcggagt ccaaatcgaa aaacccgtca gtttttaata ctgcaaaaac catttttaag 1620 agtggggggg ttgattactc gagccaacca aaggaaataa aatccctttt agaagctcaa 1680 aatactagaa ttaaaagtct agaaaaagca attcaatcct tagatgaaaa gattgaacca 1740 gagcccttaa ctaaagaaga agttaaagag cttaaagaat cgattaactc gatcaaagaa 1800 ggattaaaga atattattgg ctgaaatggc taatcttaat caaatccaaa aagaagtctc 1860 tgaaatcctc agtgaccaaa aatccatgaa agcggatata aaagctatct tagaattatt 1920 aggatcccaa aatcctatta aagaaagctt agaaaccgtt gcagcgaaaa tcgttaatga 1980 cttaaccaag ctcatcaatg attgtccttg taacaaagag atattagaag ccttaggcaa 2040 ccaacctaaa gagcaactaa taggacaacc taaagaaaaa ggcaaaggcc ttaatcttgg 2100 aaaatactct taccccaatt acggagtagg aaatgaagaa ttaggatcct ctggaaaccc 2160 taaagcttta acctggccct tcaaagctcc agcaggatgg ccgaatcaat attagaccga 2220 actattaata ggttctggta taaactggga gatgattgtc tctcagaaag tcaatttgac 2280 cttatgataa ggttaatgga agagtccctt gacggggacc aaattattga tctaacctct 2340 ctacctagtg acaatttgca ggttgaacag gttatgacaa caaccgaaga ctcgatctcg 2400 gaagaagaat cagaattcct tctagcaata ggagaaacgt ctgaagaaga aagcgattca 2460 ggagaagaac ctgaattcga acaagttcga atggatcgaa caggaggaac ggagattccc 2520 aaagaagaag atggcggaga accatctaga tataatgaga gaaagagaaa gaccactgaa 2580 gatcggtact ttccaactca accaaagacc attccaggcc aaaagcaaac gaccatggga 2640 atgctcaaca ttgactgcca agccaatcgg agaactctaa tcgacgattg ggcagcagaa 2700 atcggattga tagtcaagac caatagagaa gactatcttg atccagaaac aatcctactt 2760 ctgatggaac ataaaacatc aggaatagcc aaggagttaa tccgaaacac aagatggaac 2820 cgcactaccg gcgacatcat agaacaggtg atcgatgcaa tgtacaccat gttcctagga 2880 cttaactact ccgacaacaa ggtcgccgag aagatcgaag agcaagagaa ggccaaaatc 2940 agaatgacca agcttcagct ctgcgacatc tgctaccttg aagaatttac atgtgattat 3000 gagaagaaca tgtacaagac agaactggcg gatttcccag gatatatcaa ccagtacctg 3060 tcaaaaatcc ccatcattgg agaaaaagcg ttaacacgct ttaggcatga agccaacgga 3120 accagcatct acagtttagg tttcgcggca aagatagtaa aagaagaact atctaaaatc 3180 tgcgacttga ccaagaagca gaagaagttg aagaaattca acaagaagtg ctgtagcatc 3240 ggagaagctt cagtagaata tggatgcaag aagacatcca agaagaagta tcataaaaga 3300 tacaagaaaa aatataaggc ttataaacct tataagaaga agaagaaatt ccggtcagga 3360 aaatacttca agcccaaaga aaagaagggc tctaagcaaa agtattgccc aaagggcaag 3420 aaagactgca gatgttggat ctgcaatatc gaaggccatt acgccaacga atgtcctaat 3480 cgacaaagct cagagaaggc tcacatcctt caacaagcag agaaactggg tctccagccc 3540 atcgaagaac cctacgaagg agttcaagaa gtattcatcc tagaatacaa agaagaggaa 3600 gaagaaacct ctacagaaga agatgatgga tcatctactt cagaagactc agattcagaa 3660 tcagactgag caggtgatga acatcaccaa tcccaattcg atctacatca agggaagact 3720 ctacttcaag ggatacaaga agatagagct tcactgtttt gtagacacgg gagcaagttt 3780 atgcatagca tccaagttcg tcataccaga agaacattgg atcaatgcag aaagaccaat 3840 catggtcaaa attgcagatg gaagttcgat caccatcaac aaagtctgca gagacattga 3900 cctgatcata gccggagaaa tattccatat tcccaccgtc tatcaacagg aaagtggaat 3960 cgatttcatc atcggcaaca acttctgtca gttgtatgaa cctttcatac aatttacaga 4020 tagagttatc ttcacaaagg acagaacata ccctgttcat attgcgaagc taacaagagc 4080 agtgcgagta ggcacagaag gattcctaga atccatgaag aaacgttcaa agactcagca 4140 accggagcct gtgaacattt caacaaacaa aattgctatt ctttcagagg ggaggaggtt 4200 atcagaagaa aaacttttca tcactcagca aagaatgcaa aaaatcgaag aactacttga 4260 gaaagtatgt tcagaaaatc cattagatcc taacaagact aagcaatgga tgaaagcttc 4320 aatcaagctc agcgacccaa gcaaagctat caaggttaaa cccatgaagt atagcccaat 4380 ggatcgtgaa gaatttgata agcaaatcaa agaattactg gatctaaaag tcatcaagcc 4440 cagtaaaagc cctcacatgg caccagcctt cttggtcaac aatgaagccg agaagcgaag 4500 aggaaagaaa cgtatggtag tcaactacaa agctatgaac aaagccactg taggagacgc 4560 ttacaatcct cccaacaaag acgagttact tacactcatt cgaggaaaga agatcttttc 4620 ttccttccac tgtaactcag gattctggca ggttctgcta gatcaagaat caagacctct 4680 aacggcattc acatgtcccc aaggtcacta tgaatggaat gtggtacctt tcggcttaaa 4740 gcaagctcca tccatattcc aaagacacat ggacgaagct ttccgtgtgt tcagaaagtt 4800 ctgttgcgtt tatgtcgacg acattctcgt attcagtaac aatgaagaag atcacctact 4860 tcacgtagca atgatcttac aaaagtgcaa tcaacatgga attatccttt ccaagaagaa 4920 agcacaactc ttcaagaaga agataaactt ccttggtcta gaaatagatg aaggaacaca 4980 caagcctcaa ggacacatct tggaacatat caacaaattc ccagataccc ttgaagataa 5040 gaagcaactt cagagattct taggcatact cacatatgcc tcagattata ttccgaagct 5100 agcgcaaatc agaaagcctc tgcaagccaa gcttaaggag aacgttccat ggaaatggac 5160 aaaagaggac accctctaca tgcaaaaggt gaagaaaaat ctgcaagcat ttcctccact 5220 acatcatccc ttaccagaag agaagttgat tatcgagacc gacgcatcag atgactactg 5280 gggaggtatg ttaaaagcta tcaaaattaa cgaaggtact aatactgagt taatttgcag 5340 atacgcatct ggaagcttta aagctgcaga aaagaattac cacagcaatg acaaagagac 5400 actggcggta ataaatacta taaagaaatt tagtatttat ctaactcctg ttcattttct 5460 gatcagaaca gataatactc atttcaagag ttttgttaat ctcaattaca aaggagattc 5520 gaaacttgga agaaacatca gatggcaagc atggcttagc cattattcat ttgatgttga 5580 acacattaaa ggaaccgaca accactttgc ggacttcctt tcaagagaat tcaatagggt 5640 taattcctaa ttgaaatccg aagataagat tcccacacac ttgtggctga tatcaaaagg 5700 ctactgccta tataaacaca tctctggaga ctgagaaaat cagacctcca agcatggaga 5760 acatagaaaa actcctcatg caagagaaaa tactaatgct agagctcgat ctagtaagag 5820 caaaaataag cttagcaaga gctaacggct cttcgcaaca aggagaactc tctctccacc 5880 gtgaaacacc ggaaaaagaa gtagcagttc attctgcact ggtcactttt acgccaactc 5940 aagtaaaggc tattccagag caaacggctc ctggtaaaga atcaacaaat ccgttgatgg 6000 ctagtatctt gccaaaagat atgaacccag ttcagactgg gacaaggcta gcagtgccat 6060 cggacttttt acgtcctcat cagggaattc caatcccaca aaaatctgag cttagcagca 6120 cagttgttcc tctcagagca gaatcgggta ttcaacaccc tcatatcaac tactacgttg 6180 tgtataacgg tccacatgcc ggtatatacg atgactgggg ttgtacaaag gcagcaacaa 6240 acggcgtccc cggagttgcg cataagaagt ttgccactat tacagaggca agagcagcag 6300 ctgacgcgta tacaacaaga cagcaaacag ataggttgaa ctttatcccc aaaggagaag 6360 ctcaactcaa gcccaagagc tttgctgagg ccttaacaag cccaccaaag caaaaagccc 6420 actggctcac gctaggaacc aaaaagccca gcagtgatcc agccccaaaa gagatctcct 6480 ttgccccgga gatcacaatg gacgacttcc tctatctcta tgatctagtc aggaagttcg 6540 acggagaagg tgacgatacc atgttcacca ctgacaatga gaagattagc ctcttcaatt 6600 tcagaaagaa cgctaaccca cagatggtta gagaggccta cgcagcagga ctcattaaga 6660 cgatctaccc gagcaataat ctccaggaga tcaaatacct tcccaagaag gttaaagatg 6720 cagtcaaaag attcaggact aactgcatca agaacacaga gaaagatata tttctcaaga 6780 tcagaagtac tattccagta tggacgattc aaggcttgct tcacaaacca aggcaagtaa 6840 tagagattgg agtctctaaa aaggtaattc ctacagaatc aaaggccatg gagtcaagga 6900 ttcaaattga ggatctaaca gaactcgccg tgaagactgg cgaacagttc atacagagtc 6960 tcttacgact caatgacaag aagaaaatct tcgtcaacat ggtggagcac gacactctcg 7020 tctactccaa gaatatcaag gaaacagact cagaagacca aagggcaatt gagactttcc 7080 aacaaagggt aatttcggga aacctcctcg gattccattg cccagctatc tgtcacttca 7140 tcatgaagac agtagaaaag gaaggtggcg cctacaaatg tcaccattgc gataaaggaa 7200 aggctatcgt tcaagatgcc tctgccgacg aagggaccac agacaaaagt ggacctccac 7260 ccacgaggag catcgtagaa aaagaagacg ttcccaacac gtcttcaaag caagtggatt 7320 gatgtgatat ctccactgac gtaagggatg acgcacaatc ccactatcct tcgcaagacc 7380 cttcctctat ataaggaagt tcatttcatt tggagaggac acgctgaaat caccagtctc 7440 tctctacaac tctctctctc tctacatttc cataataatg tgtgagtagt tcccagataa 7500 gggaattagg gttcttatag ggtttcgctc atgtgttgag catataagaa acccttagta 7560 tgtatttgta tttgtaaaat acttctatca ataaaatttc taattcctaa aaccaaaatc 7620 cagtactaaa atccagatct cctaaagtcc ctatagatct ttgtggtgaa tataaaccag 7680 acacgagacg actaaacctg gagcccagac gccgtttgaa gctagaagta ccgcttaggc 7740 aggaggccgt tagggaaaag atgctaaggc agggttggtt acgttgactc ccccgtaggt 7800 ttggtttaaa tatcatgaag tggacggaag gaaggaggaa gacaaggaag gataaggttg 7860 caggccctgt gtaaggtaag acgatggaaa tttgatagag gtacgctact atacttatac 7920 tatatgctaa gggaatgctt gtatttaccc tatataccct aataacccct tatcgattta 7980 aagaaataat ccgcataagc ccccgcttaa aaaatt 8016 //