ID X14911; SV 1; linear; genomic DNA; STD; VRL; 3131 BP. XX AC X14911; XX DT 23-NOV-1989 (Rel. 21, Created) DT 12-SEP-1993 (Rel. 36, Last updated, Version 2) XX DE Cauliflower mosaic virus gene VI for the inclusion body matrix protein XX KW matrix protein. XX OS Cauliflower mosaic virus OC Viruses; Ortervirales; Caulimoviridae; Caulimovirus. XX RN [1] RP 1-3131 RA Takahashi H.; RT ; RL Submitted (01-APR-1989) to the INSDC. RL Takahashi H., Faculty of Agriculture, Tohoku University, 1-1 Tsutsumidori RL Amamiyamachi , Sendai , JAPAN 980. XX RN [2] RP 1-3131 RX DOI; 10.1093/nar/17.19.7981. RX PUBMED; 2798138. RA Takahashi H., Shimamoto K., Suzuki M., Ehara Y.; RT "DNA sequence of gene VI of cauliflower mosaic virus Japanese strain S RT (CaMV S-Japan)"; RL Nucleic Acids Res. 17(19):7981-7981(1989). XX DR MD5; c89df9dacd1f2ff95c6876990088cadb. DR EPD; EP07015; CAMV_35MJ. XX CC *source :strain=CaMV S-Japan; CC CC Data kindly reviewed (19-OCT-1989) by Takahashi H. XX FH Key Location/Qualifiers FH FT source 1..3131 FT /organism="Cauliflower mosaic virus" FT /mol_type="genomic DNA" FT /db_xref="taxon:10641" FT regulatory 753..759 FT /note="TATA box (19S promoter)" FT /regulatory_class="promoter" FT CDS 798..2366 FT /note="inclusion body matrix protein (AA 1-522)" FT /db_xref="GOA:P13218" FT /db_xref="InterPro:IPR009027" FT /db_xref="InterPro:IPR011320" FT /db_xref="InterPro:IPR037056" FT /db_xref="UniProtKB/Swiss-Prot:P13218" FT /protein_id="CAA33037.1" FT /translation="MENIEKLLMQEKILMLELDLVRAKISLARANGSSQQGDLSLHRET FT PVKEEAVHSALATFTPTQVKAIPEQTAPGKESTNPLMASILPKDMNSVQTENRLVKPLD FT FLRPHQGIPIPQKSEPNSSVTLHRVESGIQHPHTNYYVVYNGPHAGIYDDWGCTKAATN FT GVPGVAHKKFATITEARAAADAYTTNQQTGRLNFIPKGEAQLKPKSFAKALISPPKQKA FT HWLTLGTKKPSSDPAPKEISFDPEITMDDFLYLYDLARKFDGEDDGTIFTTDNEKISLF FT NFRKNANPQMVREAYTAGLIKTIYPSNNLQEIKYLPKKVKDAVKRFRTNCIKNTEKDIF FT LKIRSTIPVWTIQGLLHKPRQVIEIGVSKKIVPTESKAMESKIQIEDLTELAVKSGEQF FT IQSLLRLNDKKKIFVNMVEHDTLVYSKNIKDTVSEDQRAIETFQQRVISGNLLGFHCPA FT ICHFIMKTVEKEGGAYKCHHCEKGKAIVKDASTDRGTTDKDGPPPTRSIVEKEDVPTTS FT SKQVD" FT regulatory 2432..2438 FT /note="TATA box (35S promoter)" FT /regulatory_class="promoter" FT misc_feature 2634..2639 FT /note="pot. polyA signal" XX SQ Sequence 3131 BP; 1099 A; 694 C; 615 G; 723 T; 0 other; tctagaaata gatgaaggaa cacacaagcc tcaaggacat atcttggaac acatcaacaa 60 attcccagat acccttgaag acaagaagca acttcagaga ttcttaggca tcctaacata 120 tgcctctgat tatatcccga agctagctca aatcagaaag cctctgcaag ccaagcttaa 180 agaaaatgtt ccatggaaat ggacaaagga ggacaccctc tacatgcaaa aggtgaagaa 240 aaatctgcaa ggatttcctc cactacatca tcccttacca gaggaaaagc tgatcatcga 300 gaccgacgca tcagacaact actggggagg tatgttaaaa gctatcaaaa ttaacgaagg 360 tactaatact gagttaattt gcagatacgc atctggaagc tttaaagctg cagaaaagaa 420 ttaccacagc aatgacaaag agacattggc ggtaataaat actataaaga aattcagtat 480 ttatctaact cctgttcatt ttctgattag gcaggataat actcatttca agagtttcgt 540 taatctcaat tacaaaggag attcaaagct tggaagaaac atcagatggc aagcatggct 600 tagccactat tcatttgatg ttgaacatat taaaggaacc gacaaccact ttgcggactt 660 cctttcaaga gaattcaata aggttaattc ctaattgaaa tccgaagata agattcccac 720 acacttgtgg ctgatatcaa aaggctactg cctatataaa cacatctctg gaaactgaga 780 aaatcagacc tccaagcatg gagaacatag aaaaactcct catgcaagag aaaatactaa 840 tgctagagct cgatctagta agagcgaaaa taagcttagc aagagctaac ggctcttcgc 900 aacaaggaga cctctctctc caccgtgaaa caccggtaaa agaagaagca gttcattctg 960 cactggccac ttttacgcca actcaagtaa aggctattcc agagcaaacg gctcctggta 1020 aagaatcaac aaatccgttg atggctagta tcttgccaaa agatatgaat tcagttcaaa 1080 ctgaaaatag gcttgtaaag ccattggact ttctacgtcc tcatcaggga attccaatcc 1140 cacaaaaatc tgagcctaac agctctgtta ctcttcacag agtagaatcg ggtattcaac 1200 accctcatac caactactac gtcgtgtata acggaccaca tgccggtata tacgatgact 1260 ggggttgtac aaaggcggcg acaaacggcg ttcccggagt tgcacacaag aagtttgcca 1320 ctattacaga ggcaagagca gcagctgacg cgtatacaac aaatcagcaa acaggaaggt 1380 tgaacttcat ccccaaagga gaagctcaac tcaaacccaa gagctttgcc aaggccttaa 1440 tcagcccacc aaagcaaaaa gcccactggc tcacgctagg aaccaaaaag cccagcagtg 1500 atcctgcccc aaaagagatc tcctttgacc cagagatcac aatggacgac ttcctctatc 1560 tctatgatct agcaaggaag ttcgacggag aagatgacgg taccatcttc actactgata 1620 atgagaagat tagcctcttc aatttcagaa agaacgctaa cccacagatg gttagagagg 1680 cctacacagc aggtctcatc aaaacgatct acccgagtaa caatctccag gagatcaaat 1740 accttcccaa aaaggttaaa gatgcagtca aaagattcag gactaactgc atcaagaaca 1800 cagagaaaga tatatttctc aagatcagaa gtactattcc agtatggacg attcaaggct 1860 tgcttcataa accaaggcaa gtaatagaga ttggagtctc taagaagata gttcctacag 1920 aatcaaaggc catggagtca aagattcaaa ttgaggatct aacagaactg gcagtaaagt 1980 ctggcgaaca gttcatacag agtctcttac gactcaatga caagaagaaa atcttcgtca 2040 atatggtgga gcacgacact ctggtctact ccaaaaatat caaggataca gtctcagaag 2100 accaaagggc aattgagact tttcaacaaa gggtaatttc aggaaacctc ctcggattcc 2160 attgcccagc tatctgtcac ttcatcatga agacagtaga aaaggaaggt ggcgcctaca 2220 aatgtcatca ttgcgaaaaa ggaaaggcta tcgttaaaga tgcctctacc gacagaggga 2280 ccaccgacaa agatggaccc ccacccacga ggagcatcgt ggaaaaagaa gacgttccaa 2340 ccacgtcttc taagcaagtg gattgatgtg acatctccac tgacgtaagg gatgacgcac 2400 aatcccacta tccttcgcaa gacccttcct ctatataagg aagttcattt catttggaga 2460 ggacacgctg aaatcaccag tctctctcta caaatctatc tctctctatt ttctccataa 2520 taatgagtga gtagtctccg ataagggaat gagggttctt atagggtttc gctcatgtgt 2580 tgagcatata agaaaccctt agtatgtatt tgtatttgta aaatacttct atcaataaaa 2640 tttctaattc ctaaaaccaa aatccagcac taaaattcca gaaatcccaa gtccctatag 2700 atctttgtgg tgaatataaa ccagacacga gacgactaaa cctggagccc agatgccgct 2760 tgaagctaga agtaccgctt aggcaggagg ccgttaggga aaagaagcta aggcagggtt 2820 ggttacgttg actcccccgt aggtttggtt taaatatcat gaagtggact gaagaaagaa 2880 ggaagacatg gaaggataag gttgcaggcc ctgaacaagg taagacgatg gaaatttgat 2940 agaggtacgc tactatactc atactatacg ctaagggaat acttatattt ataccctata 3000 cccctaataa ccccttatca atttaagaaa taatccgcac aaaccccagt ttaaacgatt 3060 ggtatcatag caataccatg tttaaaaaat tggtatcaga gccatgaatg gtttaaaaac 3120 caaactctag a 3131 //